Sleeping on the job

The video titled "Sleeping on the Job" features Kir Shatrov and Julik Tarkhanov discussing the challenges and strategies associated with managing background jobs in software applications. They illustrate how background jobs, while useful for executing delayed tasks, can pose risks, such as crashing applications due to unpredictable execution times. The speakers emphasize the importance of effectively visualizing and monitoring background job workflows to enhance control and reliability.

Key points discussed include:
- Delayed Execution Risks: Background jobs do not execute immediately; instead, they can take varying times to complete, complicating performance predictions and resource management.
- Visualization Techniques: A timeline swim lane view is proposed as an effective way to visualize job execution flows, allowing developers to track multiple worker threads over time.
- Monitoring Metrics: Essential metrics include job throughput and performance distributions, which are crucial for performance assessment. Different companies utilize various monitoring tools like Datadog at Shopify and AppSignal at WeTransfer, reflecting their operational needs.
- Two-step Deployment Strategy: Discussing deployment mechanisms, they suggest using a two-step or blue-green deployment strategy to prevent mismatches between the web application and background workers causing execution errors.

Sleeping on the job
Julik Tarkhanov and Kir Shatrov • August 21, 2020 • online

We all love our Sidekiq’s and our Resque’s. But they do let us down sometimes. Not because they are bad, but because the queueing theory is limiting us. There is a way to break out of the madness though - let’s explore how to get our job queues under control.

Kir Shatrov is a platform engineer at Shopify where he works on scalability and reliability of one of the world’s largest ecommerce platforms. When not into working, Kir enjoys cooking, gastronomic tourism (he even has a GitHub repo with his favourite spots!) and exploring London on the bike.
Julik Tarkhanov is a software developer at WeTransfer where he is responsible for the backend components of the Transfer product, enabling effortless transfer of creative ideas. Prior to WeTransfer he worked in the visual effects industry creating images that inspire and befuddle. On his free time he explores weird user interfaces and plays trumpet.

Welcome to the #NoRuKo conference. A virtual unconference organized by Stichting Ruby NL.

#NoRuKo playlist with all talks and panels: https://www.youtube.com/playlist?list=PL9_A7olkztLlmJIAc567KQgKcMi7-qnjg

Recorded 21th of August, 2020.
NoRuKo website: https://noruko.org/
Stichting Ruby NL website: https://rubynl.org/

NoRuKo 2020