I recommended “Understanding Distributed Systems: What every developer should kn...

ryandv · on Feb 8, 2024

To add to this list, there is also "Principles of Eventual Consistency" [0] for getting down to the mathematical formalisms.

In addition, Lamport's paper "Time, Clocks, and the Ordering of Events in a Distributed System" [1].

[0] https://www.microsoft.com/en-us/research/wp-content/uploads/...

[1] https://lamport.azurewebsites.net/pubs/time-clocks.pdf

yodsanklai · on Feb 8, 2024

> Lamport's paper "Time, Clocks, and the Ordering of Events in a Distributed System"

I know this article is a classic. I studied it at school but I've always found it very hard to understand. Maybe I'm wrong but I have the feeling that relatively few engineers use these formalisms as their mental models when designing distributed systems.

bostik · on Feb 8, 2024

It was surprising that Kleppman's book was mentioned only at the very end of the article, but at least it came with an understandable caveat. That book is incredible - although in all honesty it does require solid foundation of distributed systems to make proper sense.

Until you have personally battled with replication lag, real-life impacts of eventual consistency and distributed writes, Data-Intensive Applications feels like a dry theoretical read. If you do come across the book with the scars and lessons, it does open the world up.

hiAndrewQuinn · on Feb 8, 2024

I often like to think that, at a basic level, all a [edit: indexed] db "does" is move our O(n) search of an unordered text file to the O(log n) search of a tree

teraflop · on Feb 8, 2024

Yup.

From a high-altitude view, that's why splitting a huge database table into smaller partitions is not an automatic performance win. If you have M partitions with N rows each, then a lookup might require O(log M) time to find a partition and O(log N) time to find a row within the partition. But O(log M + log N) = O(log MN) which is what you would get from a single big table with appropriate indexing.

Of course, in the real world constant factors and implementation details matter, so this is just a heuristic. But it seems to run contrary to a lot of novice programmers' intuition that a large DB table must automatically be a slow one.

maerF0x0 · on Feb 8, 2024

if the facet is indexed.

hiAndrewQuinn · on Feb 8, 2024

ah yes, thanks

wooly_bully · on Feb 8, 2024

The book that Dominik Tornow is writing “Thinking in Distributed Systems” has been an excellent next read after DDIA for me (it’s not yet finished I believe).

Really shows the experience of someone who understands this stuff inside and out (was one of the main people behind Temporal).

killthebuddha · on Feb 8, 2024

FWIW I don't see mention of incompleteness on the book's site http://book.dtornow.com/