Add Gemini explicit caching support #50

imkhalil · 2025-11-18T20:03:41Z

Summary

Adds support for Gemini's explicit context caching API, enabling significant token savings (up to 99.8%) for repeated requests with static content like system instructions, tool definitions, and reference documents.

Changes

Added cached_content parameter to Chat.__init__() to accept existing cache references
Added 4 cache management methods:
- create_cache() - Create a new cache with system instructions, tools, and contents
- delete_cache() - Delete cache and clear cache state
- update_cache() - Update cache TTL
- get_cache() - Retrieve cache metadata
Modified _call() to pass cached_content to litellm when cache exists
Modified _prep_msg() to exclude system instruction when it's already in cache
Requires Gemini models with -001 suffix (e.g., gemini-2.0-flash-001)

Testing

All methods tested with Gemini API paid tier. Verified:

Cache creation and usage
Token savings (99.8% reduction confirmed)
All CRUD operations work correctly
Regular chat functionality remains unaffected

Thanks for the PR! This is an nbdev project, so the source is the notebooks, not the .py file. You should add your changes to the notebooks. Also add documentation and examples and tests there. Try to follow the coding style in the rest of the notebook, which is based on: https://docs.fast.ai/dev/style.html .

I see you're heavily leaning on AI here, which is fine, but do it in a way where you understand and check each line of code. Open the repo in Solveit ideally and tell the AI my feedback, and have it help you with the process, including using nbdev_export to get code exported.

imkhalil added 11 commits November 17, 2025 16:40

Update core.py

32414ff

Added the gemini caching parameters to __init__

Add Gemini explicit caching support

d68ad46

Fix syntax error in create_cache method

85640f6

fix indent in if statement

bf0177c

Adding it to Chat class. Had added the create cache and associated fu…

1df85be

…nctions to core.py

added import types. added update method

55c147b

added s in schemas

1d9a7e5

fix tool schema spelling

d83c8d9

corrected the NONE contents to be made a list

be4ee22

added system instruction flag

6d408ee

added name= to self.cache_name

f20042f

jph00 marked this pull request as draft November 24, 2025 06:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Gemini explicit caching support #50

Add Gemini explicit caching support #50

Uh oh!

imkhalil commented Nov 18, 2025

Uh oh!

jph00 commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Gemini explicit caching support #50

Are you sure you want to change the base?

Add Gemini explicit caching support #50

Uh oh!

Conversation

imkhalil commented Nov 18, 2025

Summary

Changes

Testing

Related

Uh oh!

jph00 commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants