Once a session exceeds the token limit, the oldest messages get trimmed out so the model can focus on the recent ones. Once a session exceeds the model’s token limit, it starts forgetting the earliest parts of the conversation to make room for newer messages. - by ChatGPT itself