RocksDB is an open-source storage engine widely used inside and outside of Meta. Historically, it was mainly used for storing data on local SSDs.
RocksDB is an open-source storage engine widely used inside and outside of Meta. Historically, it was mainly used for storing data on local SSDs.
This paper describes how we transformed the legacy data lakehouse stack at Meta to adapt to the new realities through a large cross-organizational effort called Shared Foundations.
In this paper, we present a new benchmark, TAOBench, that captures the social graph workload at Meta. We open source workload configurations along with a benchmark that leverages...
Velox provides reusable, extensible, high-performance, and dialect-agnostic data processing components for building execution engines, and enhancing data management systems.
Here, we approached HRTF personalization from a morphological standpoint by calculating the distance between any two three-dimensional models of the ear.
With bumped ribbon retrieval (BuRR), we present the first practical succinct retrieval data structure. In an extensive experimental evaluation BuRR achieves space overheads...
This paper presents Meta’s end-to-end DSI pipeline, composed of a central data warehouse built on distributed storage and a Data PreProcessing Service that scales to eliminate data stalls.
In this paper, we describe a schema-first approach to application telemetry that is being implemented at Meta. It allows the observability platforms to capture metadata about...
In this paper, we’d like to introduce some of the most important features and performance improvements the open source Presto community made in recent years, which enables...
In this paper we describe the Smarter Warehouse initiative that aims to automate or simplify many of these optimization decisions. Our long term vision is for a large portion of...