I am a Research Scientist at Facebook on the Core Systems team. My current work focuses on improving the disaster readiness and fault tolerance of Facebook’s geo-replicated infrastructure through empirical study, tooling development and strategy making.
I obtained my PhD in Computer Science and Engineering from University of Michigan and BS/MPhil in Computer Science and Engineering from the Hong Kong University of Science and Technology. My prior research includes software safety & security for autonomous vehicles, performance diagnosis and acceleration for mobile systems, and traffic management in software-defined networks.
Data centers, distributed systems, fault tolerance and reliability