Menu

Post image 1
Post image 2
1 / 2
0

Lost in the Middle: Why LLMs Quietly Ignore the Centre of Their Own Context Window

DEV Community·Vikrant Shukla·23 days ago
#dEwqw0cP
#llm#ai#deeplearning#context#model#attention
Reading 0:00
15s threshold

Every time you hand a long document to an LLM and ask it to summarise or answer a question, something quietly goes wrong. The model reads the whole thing — or appears to — but its answers disproportionately reflect what was at the beginning and the end. Whatever sat in the middle? Largely ignored. This isn't a rumour. It was rigorously documented in a 2023 paper titled "Lost in the Middle: How Language Models Use Long Contexts" (Liu et al., Stanford/UC Berkeley), and it remains one of the most practically important — and underappreciated — findings in applied LLM science. The Shape of the Problem The researchers ran a controlled experiment: they placed the answer to a multi-document QA question inside a set of retrieved documents, then varied which position the relevant document occupied — first, middle, or last. Performance dropped sharply when the relevant document was positioned in the middle of the context, even when the total context length was well within the model's stated window.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More