National and Kapodistrian University of Athens
Every day, billions of people connect with each other through Meta using services like Facebook, Instagram, WhatsApp, and Messenger. Meta’s services rely on fleets of servers in data centers across the globe, all running applications and delivering the performance the services need. However, silent data corruption, or data errors that go undetected by the larger system, remain a widespread challenge for large-scale infrastructure systems. This type of corruption can propagate across the stack and manifest as application-level problems. It can also result in data loss and require months to debug and resolve. Our teams enable and support hardware testing and large-scale experiments in Meta's data centers including detecting and remediating silent data corruptions on a scale of hundreds of thousands of machines.
Within this novel research domain, we identify research opportunities that range from architectural solutions to data corruption, to fleetwide testing strategies and distributed computing resiliency models, to software and library resiliency, to silicon level design, simulation and manufacturing approaches. Solutions could be cross-layered with proposals combining different domains within the above. This RFP is not limited to solutions specific to CPUs, but instead is pursuing all the components typically used within a server infrastructure.
To foster further innovation in this area, and to deepen our collaboration with academia, Meta is pleased to invite faculty to respond to this call for research proposals pertaining to the aforementioned topics. We anticipate awarding up to five awards, each in the $50,000 range. Payment will be made to the proposer's host university as an unrestricted gift.
National and Kapodistrian University of Athens
University of British Columbia
Northeastern University
Stanford University
Carnegie Mellon University
Applications Are Currently CLosed
We are soliciting proposals focusing on mitigation of silent data corruptions within internet applications due to hardware faults affecting the data center computing stack (from hardware to compilers to applications). Proposals could range from hardware and architectural level mitigations and design strategies to test architecture evolution to software resiliency for silent data corruption.
The example topics include the following:
1. Computer architecture approaches to handle silent data corruptions
2. Distributed computing solutions to Silent error propagation
3. Service resiliency, software redundancy
4. Silicon design
Most of the RFP awards are an unrestricted gift. Because of its nature, salary/headcount could be included as part of the budget presented for the RFP. Since the award/gift is paid to the university, they will be able to allocate the funds to that winning project and have the freedom to use as they need. All Facebook teams are different and have different expectations concerning deliverables, timing, etc. Long story short – yes, money for salary/headcount can be included. It’s up to the reviewing team to determine if the percentage spend is reasonable and how that relates to the decision if the project is a winner or not.
We are flexible, but ideally proposals submitted are single-spaced, Times New Roman, 12 pt font.
Yes, award funds can be used to cover a researcher’s salary.
Budgets can vary by institution and geography, but overall research funds ideally cover the following: graduate or post-graduate students’ employment/tuition; other research costs (e.g., equipment, laptops, incidental costs); travel associated with the research (conferences, workshops, summits, etc.); overhead for research gifts is limited to 5%
One person will need to be the primary PI (i.e., the submitter that will receive all email notifications); however, you’ll be given the opportunity to list collaborators/co-PIs in the submission form. Please note in your budget breakdown how the funds should be dispersed amongst PIs.
Meta’s decisions will be final in all matters relating to Meta RFP solicitations, including whether or not to grant an award and the interpretation of Meta RFP Terms and Conditions. By submitting a proposal, applicants affirm that they have read and agree to these Terms and Conditions.