Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning
Hongseok Namkoong, Sam Daulton, Eytan Bakshy
I am a Research Scientist in the Core Data Science team’s Adaptive Experimentation group. My research focuses on developing methods for contextual bandit optimization, reinforcement learning, and Bayesian optimization. Prior to joining Meta, I was at Harvard University, where my work focused on developing robust and efficient transfer learning methods to advance towards disseminating reinforcement learning to high-stakes human applications.
Bayesian optimization, contextual bandits, reinforcement learning, transfer learning
Hongseok Namkoong, Sam Daulton, Eytan Bakshy
Sam Daulton, Max Balandat, Eytan Bakshy
Sam Daulton, Max Balandat, Eytan Bakshy
Ryan M. Dreifuerst, Sam Daulton, Yuchen Qian, Paul Parayil Varkey, Max Balandat, Sanjay Kasturia, Anoop Tomar, Ali Yazdan Panah, Vish Ponnampalam, Robert W. Heath Jr
David Eriksson, Pierce I-Jen Chuang, Sam Daulton, Peng Xia, Akshat Shrivastava, Arun Babu, Shicong Zhao, Ahmed Aly, Ganesh Venkatesh, Max Balandat
Sam Daulton, Shaun Singh, Vashist Avadhanula, Drew Dimmery, Eytan Bakshy