Context window
The total text a model can see in one call — input and output combined. Not memory. When it's full, things get dropped or summarized.
The total text a model can see in one call — input and output combined. Not memory. When it's full, things get dropped or summarized.