feat(ai-chat): add automatic chat session summarization for long conversations #16742

eneufeld · 2025-12-11T13:56:26Z

What it does

Automatically summarizes chat sessions when token usage approaches the context limit (90% of 200k tokens), enabling continued conversations without losing context from earlier messages.

Core functionality:

Add ChatSessionSummarizationService to orchestrate summarization
Add insertSummary() method to MutableChatModel for inserting summary nodes
Add isStale flag to mark pre-summary messages (excluded from future prompts)
Add kind field to ChatRequest interface ('user' | 'summary')

Budget-aware tool loop:

Add singleRoundTrip flag to UserRequest for controlled tool execution
Extend ChatLanguageModelServiceImpl with budget checking before/during requests
Trigger mid-turn summarization when threshold exceeded during tool loops
Support both threshold-triggered and explicit summarization

Token usage tracking:

Add TokenUsageService for recording token usage across providers
Add TokenUsageServiceClient for frontend notification of usage updates
Display token count indicator in chat UI with session switching support

UI components:

Add collapsible summary node rendering with bookmark icon
Add SummaryPartRenderer for displaying summary content
Add token usage indicator showing current session token count

fixes #16703
fixes #16724

Current Limitations:

only supported by anthropic
hard coded budget of 200k tokens
hard coded trigger when reaching 90% of tokens

How to test

Enable in the settings budget awareness for anthropic.
Start a chat using an anthropic model. let it grow. see that hopefully a summary is automatically triggered when reaching 180k tokens.

Follow-ups

Extend the tool handling to all other llm wrappers.

Breaking changes

This PR introduces breaking changes and requires careful review. If yes, the breaking changes section in the changelog has been updated.

Attribution

Review checklist

As an author, I have thoroughly tested my changes and carefully followed the review guidelines
User-facing text is internationalized using the nls service (for details, please see the Internationalization/Localization section in the Coding Guidelines)

Reminder for reviewers

As a reviewer, I agree to behave in accordance with the review guidelines

…ersations Automatically summarizes chat sessions when token usage approaches the context limit (90% of 200k tokens), enabling continued conversations without losing context from earlier messages. Core functionality: - Add `ChatSessionSummarizationService` to orchestrate summarization - Add `insertSummary()` method to `MutableChatModel` for inserting summary nodes - Add `isStale` flag to mark pre-summary messages (excluded from future prompts) - Add `kind` field to `ChatRequest` interface ('user' | 'summary') Budget-aware tool loop: - Add `singleRoundTrip` flag to `UserRequest` for controlled tool execution - Extend `ChatLanguageModelServiceImpl` with budget checking before/during requests - Trigger mid-turn summarization when threshold exceeded during tool loops - Support both threshold-triggered and explicit summarization Token usage tracking: - Add `TokenUsageService` for recording token usage across providers - Add `TokenUsageServiceClient` for frontend notification of usage updates - Display token count indicator in chat UI with session switching support UI components: - Add collapsible summary node rendering with bookmark icon - Add `SummaryPartRenderer` for displaying summary content - Add token usage indicator showing current session token count fixes #16703 fixes #16724 Current Limitations: - only supported by anthropic - hard coded budget of 200k tokens - hard coded trigger when reaching 90% of tokens

sdirix · 2026-01-08T15:00:13Z

I will review at the latest next week.

sdirix

I did a quick test, sadly it does not work for me: The tokens are not correctly counted. They reset all the time so they never go above 500.

I tried with

@Coder Check all typescript files for spelling errors

and Opus 4.5

I had a rough look over the code and left some comments.

sdirix · 2026-01-15T17:09:55Z

.prompts/project-info.prompttemplate

+
+| Command (from root) | Purpose |
+|---------------------|---------|
+| `npm install` | Install dependencies (required first) |


sdirix · 2026-01-15T17:10:18Z

.prompts/project-info.prompttemplate

+| `npm install` | Install dependencies (required first) |
+| `npm run build:browser` | Build all packages + browser app |
+| `npm run start:browser` | Start browser example at localhost:3000 |
+| `npm run start:electron` | Start Electron desktop app |


Suggested change

| `npm run start:electron` | Start Electron desktop app |

sdirix · 2026-01-15T17:15:05Z

packages/ai-chat-ui/src/browser/chat-response-renderer/summary-part-renderer.tsx

+}
+
+const SummaryContent: React.FC<SummaryContentProps> = ({ content, openerService }) => {
+    const contentRef = useMarkdownRendering(content, openerService);


Likely a follow up but it would be amazing if the summary was editable in case the user is not satisfied with the summary afterwards and for example wants to highlight a specific fact.

sdirix · 2026-01-15T17:19:49Z

packages/ai-chat-ui/src/browser/chat-tree-view/chat-view-tree-widget.tsx

+                // Skip empty branches (can occur during insertSummary operations)
+                if (branch.items.length === 0) {
+                    return;
+                }


The whole empty branch situation seems a bit brittle? Can we switch to a more deterministic and stable invariants so that code like this is not necessary? It should be possible to guarantee a proper branch structure throughout

sdirix · 2026-01-15T17:25:34Z

packages/ai-chat/src/browser/chat-language-model-service.ts

+        if (budgetAwareEnabled && request.tools?.length) {
+            return this.sendRequestWithBudgetAwareness(languageModel, request);
+        }


This new budget loop does not properly handle the history mechanism, leading to weird history view behavior, rendering a lot of requests without responses.

sdirix · 2026-01-15T17:39:02Z

packages/ai-chat/src/browser/chat-language-model-service.ts

+        const budgetAwareEnabled = this.preferenceService.get<boolean>(BUDGET_AWARE_TOOL_LOOP_PREF, false);
+
+        if (budgetAwareEnabled && request.tools?.length) {
+            return this.sendRequestWithBudgetAwareness(languageModel, request);


same kind of strategy pattern would be good I think. Everyone will need to handle the "tool loop" but adopters might want to do different things than handling summary.

eneufeld requested review from planger and sdirix December 11, 2025 13:56

github-project-automation bot added this to PR Backlog Dec 11, 2025

github-project-automation bot moved this to Waiting on reviewers in PR Backlog Dec 11, 2025

eneufeld added 2 commits January 7, 2026 17:38

fix issues

2450c9b

eneufeld force-pushed the feat/budget-aware-chat branch from 9e10da0 to 1cf9d64 Compare January 7, 2026 16:40

final summary updates

48e23e5

eneufeld force-pushed the feat/budget-aware-chat branch from 1cf9d64 to 48e23e5 Compare January 7, 2026 23:06

sdirix requested changes Jan 15, 2026

View reviewed changes

github-project-automation bot moved this from Waiting on reviewers to Waiting on author in PR Backlog Jan 15, 2026

eneufeld added 2 commits January 21, 2026 13:21

small fixes

30ef4e8

more fixes

e589233

eneufeld requested a review from sdirix January 22, 2026 14:10

planger removed their request for review January 22, 2026 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai-chat): add automatic chat session summarization for long conversations #16742

feat(ai-chat): add automatic chat session summarization for long conversations #16742

eneufeld commented Dec 11, 2025

Uh oh!

sdirix commented Jan 8, 2026

Uh oh!

sdirix left a comment •

edited

Loading

Uh oh!

sdirix Jan 15, 2026

Uh oh!

sdirix Jan 15, 2026

Uh oh!

sdirix Jan 15, 2026

Uh oh!

sdirix Jan 15, 2026

Uh oh!

sdirix Jan 15, 2026

Uh oh!

sdirix Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	\| `npm install` \| Install dependencies (required first) \|
	\| `npm ci` \| Install dependencies (required first) \|

feat(ai-chat): add automatic chat session summarization for long conversations #16742

Are you sure you want to change the base?

feat(ai-chat): add automatic chat session summarization for long conversations #16742

Conversation

eneufeld commented Dec 11, 2025

What it does

How to test

Follow-ups

Breaking changes

Attribution

Review checklist

Reminder for reviewers

Uh oh!

sdirix commented Jan 8, 2026

Uh oh!

sdirix left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sdirix Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sdirix Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sdirix Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sdirix Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sdirix Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sdirix Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sdirix left a comment •

edited

Loading