[PAPER] Lost in the Middle: How Language Models Use Long Contexts

Feb. 23, 2024

Nelson F. Liu et al.

- With the advent of RAG (Retrieval Augmented Generation) models, LLMs can now process long input contexts.

- However, there are concerns about whether LLMs can effectively utilize the information within these long contexts.

- The performance of LLMs degrades when the relevant information is located in the middle of the long input context.

- LLMs perform better when the relevant information is positioned at the beginning or end of the input context.

- LLMs tend to focus more on recent information and struggle to effectively utilize long contexts.

- This limitation is likely related to the encoder architecture of LLMs, which may not be optimized for processing long-range dependencies.

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

티스토리툴바