Logarithm: A logging engine for AI training workflows and services

Systems and application logs play a key role in operations, observability, and debugging workflows at Meta. Logarithm is a hosted, serverless, multitenant service, used only internally at Meta, that consumes and indexes these logs and provides an interactive query interface to retrieve and view logs. In this post, we present the design behind Logarithm, and [...]


Read More...


The post Logarithm: A logging engine for AI training workflows and services appeared first on Engineering at Meta.


http://dlvr.it/T4Fp7Z

Komentar

Postingan populer dari blog ini

Inside Meta’s first smart glasses

Simulator-based reinforcement learning for data center cooling optimization

Improving machine learning iteration speed with faster application build and packaging