You are not logged in | Log in

Retrieval Augmented Language Models

Speaker(s)
Konrad Staniszewski
Affiliation
Uniwersytet Warszawski
Date
June 1, 2023, 12:15 p.m.
Information about the event
https://uw-edu-pl.zoom.us/j/92107522918
Seminar
Seminarium "Machine Learning"

Large language models store their knowledge in parameters and require costly fine-tuning to update. An interesting alternative is to provide new knowledge in the model's context. However, typical models have relatively short context lengths.
In this presentation, I will discuss one of the potential solutions to this problem - retrieval augmented transformer models. Those models utilize a large external database to store information about already processed parts of the text and retrieve most matching entries to improve the performance during the inference.