Learnings from Data Integration for Augmented Language Models

arXiv

Abstract

One of the limitations of large language models is that they do not have access to up-to-date, proprietary, or personal data. As a result, there are multiple efforts to extend language models with techniques for accessing external data. In that sense, LLMs share the vision of data integration systems whose goal is to provide seamless access to a large collection of heterogeneous data sources. While the details and the techniques of LLMs differ greatly from those of data integration, this paper shows that some of the lessons learned from research on data integration can elucidate the research path we are conducting today on language models.

Featured Publications

All Publications

Research