Detecting silent errors in the wild: Combining two novel approaches to quickly detect silent data corruptions at scale
Silent data corruptions (SDCs), data errors that go undetected by the larger system, are a widespread problem for large-scale infrastructure systems. Left undetected, these types of corruptions can cause data loss and propagate across the stack and manifest as application-level problems. Silent data corruptions (SDC) in hardware impact computational integrity for large-scale applications. Sources of [...]
Read More...
The post Detecting silent errors in the wild: Combining two novel approaches to quickly detect silent data corruptions at scale appeared first on Engineering at Meta.
http://dlvr.it/SLtFLm
http://dlvr.it/SLtFLm
Komentar
Posting Komentar