Chapter 16·Beginner
The Context Window: How Much an LLM Can Actually See
01 / 06
What it is
The model's fixed working memory, in tokens.
The context window is the maximum number of tokens an LLM can take into account at once — your prompt, the conversation, any documents, and its own reply, all together.