Postingan

Menampilkan postingan dari Juni, 2021

Asicmon: A platform agnostic observability system for AI accelerators

Gambar
We will be hosting a talk about our work on, “A Platform Agnostic Observability System for AI Accelerators” during our virtual Systems @Scale event at 10:20 a.m. PT on Wednesday, June 30, followed by a live Q&A session. Please submit any questions to systemsatscale@fb.com before the event. Accelerators are special-purpose hardware devices optimized for specific [...] Read More... The post Asicmon: A platform agnostic observability system for AI accelerators appeared first on Facebook Engineering. http://dlvr.it/S2dhyb

Driving towards an open internet ecosystem to help tackle the digital divide

Gambar
Connectivity is an integral part of Facebook’s mission to bring people closer together, and the COVID-19 pandemic has only heightened the demand for critical internet access. According to the latest edition of our Inclusive Internet Index, nearly 70 percent of people around the world believe that increased internet usage in all aspects of their lives [...] Read More... The post Driving towards an open internet ecosystem to help tackle the digital divide appeared first on Facebook Engineering. http://dlvr.it/S2cXBx

Consolidating Facebook storage infrastructure with Tectonic file system

Gambar
What the research is:  Tectonic, our data center scale distributed file system, enables better resource utilization, promotes simpler services, and requires less operational complexity than our previous approach. Our previous storage infrastructure consisted of a set of use-case specific storage systems. Clusters, or instances of these storage systems, used to scale to tens of petabytes. [...] Read More... The post Consolidating Facebook storage infrastructure with Tectonic file system appeared first on Facebook Engineering. http://dlvr.it/S29PGH

Meet Kats — a one-stop shop for time series analysis

Gambar
What it is:  A new library to analyze time series data. Kats is a lightweight, easy-to-use, and generalizable framework for generic time series analysis, including forecasting, anomaly detection, multivariate analysis, and feature extraction/embedding. To the best of our knowledge, Kats is the first comprehensive Python library for generic time series analysis, which provides both classical [...] Read More... The post Meet Kats — a one-stop shop for time series analysis appeared first on Facebook Engineering. http://dlvr.it/S29P7d

Network hose: Managing uncertain network demand with model simplicity

Gambar
Our production backbone network connects our data centers and delivers content to our users. The network supports a vast number of different services, distributed across a multitude of data centers. Traffic patterns shift over time from one data center to another due to the introduction of new services, service architecture changes, changes in user behavior, [...] Read More... The post Network hose: Managing uncertain network demand with model simplicity appeared first on Facebook Engineering. http://dlvr.it/S1nCVj

How Facebook deals with PCIe faults to keep our data centers running reliably

Gambar
Peripheral component interconnect express (PCIe) hardware continues to push the boundaries of computing thanks to advances in transfer speeds, the number of available lanes for simultaneous data delivery, and a comparatively small footprint on motherboards. Today, PCIe connectivity-based hardware delivers faster data transfers and is one of the de facto methods to connect components to [...] Read More... The post How Facebook deals with PCIe faults to keep our data centers running reliably appeared first on Facebook Engineering. http://dlvr.it/S0ws4t