Reverse debugging at scale

Say you receive an email notification that a service is crashing just after your last code change deploys. The crash happens in only 0.1 percent of the servers where it runs. But you’re at a large-scale company, so 0.1 percent equals thousands of servers — and this issue is going to be hard to reproduce. Several [...] Read More... The post Reverse debugging at scale appeared first on Facebook Engineering.
http://dlvr.it/RyYLx0

Komentar

Postingan populer dari blog ini

How Meta enforces purpose limitation via Privacy Aware Infrastructure at scale

Fully Sharded Data Parallel: faster AI training with fewer GPUs

Risk-driven backbone management during COVID-19 and beyond