Comment by westurner - Hacker Neue

westurner Jul 23, 2020 parent

From a previous question re: "Ask HN: CS papers for software architecture and design?" (https://www.hackerneue.com/item?id=15778396 and distributed systems we eventually realize were needed in the first place:

> Bulk Synchronous Parallel: https://en.wikipedia.org/wiki/Bulk_synchronous_parallel .

Many/most (?) distributed systems can be described in terms of BSP primitives.

> Paxos: https://en.wikipedia.org/wiki/Paxos_(computer_science) .

> Raft: https://en.wikipedia.org/wiki/Raft_(computer_science) #Safety

> CAP theorem: https://en.wikipedia.org/wiki/CAP_theorem .

Papers-we-love > Distributed Systems: https://github.com/papers-we-love/papers-we-love/tree/master...

awesome-distributed-systems also has many links to theory: https://github.com/theanalyst/awesome-distributed-systems

- Byzantine fault: https://en.wikipedia.org/wiki/Byzantine_fault :

> A [Byzantine fault] is a condition of a computer system, particularly distributed computing systems, where components may fail and there is imperfect information on whether a component has failed. The term takes its name from an allegory, the "Byzantine Generals Problem",[2] developed to describe a situation in which, in order to avoid catastrophic failure of the system, the system's actors must agree on a concerted strategy, but some of these actors are unreliable.

awesome-bigdata lists a number of tools: https://github.com/onurakpolat/awesome-bigdata

Practically, dask.distributed (joblib -> SLURM,), dask ML, dask-labextension (a JupyterLab extension for dask), and the Rapids.ai tools (e.g. cuDF) scale from one to many nodes.

westurner OP Jul 24, 2020

Not without a sense of irony, as the lists above list many papers that could be readings with quizzes,

Distributed systems -> Distributed computing: https://en.wikipedia.org/wiki/Distributed_computing

Category: Distributed computing: https://en.wikipedia.org/wiki/Category:Distributed_computing

Category:Distributed_computing_architecture : https://en.wikipedia.org/wiki/Category:Distributed_computing...

DLT: Distributed Ledger Technology: https://en.wikipedia.org/wiki/Distributed_ledger

Consensus (computer science) https://en.wikipedia.org/wiki/Consensus_(computer_science)

This item has no comments currently.