Distributed Systems: Your Only Guarantee Is Inconsistency

In this presentation titled "Distributed Systems: Your Only Guarantee Is Inconsistency" at Euruko 2017, Anthony Zacharakis discusses the complexities of distributed systems, particularly focusing on the inconsistencies that can arise within data during such transitions. Drawing from his experience at DigitalOcean, he illustrates the importance of understanding these inconsistencies utilizing billing processing as a relatable example.

Key Points Discussed:
- Introduction to Distributed Systems: Zacharakis stresses the growing significance of this topic, especially with the rise of microservices and service-oriented architectures.
- Importance of Month-Close Processing: He explains the month-close pipeline in billing, detailing the actions taken for each customer, like generating invoices and charging users. The increasing scale leads to challenges maintaining this process.
- Challenges of Synchronous Processing: The current reliance on a single class for several tasks presents issues, particularly with performance and handling failures due to external dependencies, which complicates maintenance.
- Role of Background Workers: The presentation highlights the advantages of using background workers to improve workflow. Jobs can be queued, prioritized, and retried automatically, enabling more efficient handling of heavy tasks. Zacharakis notably mentions Sidekiq as a preferred tool in this context.
- Practical Example of Job Handling: He outlines how to implement Sidekiq by developing a separate job class, which can run expensive tasks asynchronously, reducing processing time significantly.
- Dealing with Idempotency Issues: Zacharakis explains the necessity to address idempotency, which is the challenge of ensuring that if a job fails, it can safely retry without causing duplicate charges or errors.
- Unforeseen Bugs and Adjustments: The transition to asynchronous processing led to unexpected issues such as duplicate payments, demonstrating the need to adjust existing assumptions about timing and process flows.
- Acceptance of Dynamic Environments: He encourages acknowledging that conditions may change and emphasizes designing systems to adapt and handle these changes gracefully, thus preparing for a self-healing system.

Conclusions and Takeaways:
- The key lesson is the essential nature of adapting to inherent inconsistencies within distributed systems. By understanding and planning for unexpected behaviors, systems can be designed to handle failures and changes more effectively. Zacharakis underlines that systems must be built with the knowledge that the environment is dynamic, encouraging a shift from synchronous to asynchronous thinking, while fostering resilience within the architecture.