caiasprojects
Popular repositories Loading
Repositories
Showing 3 of 3 repositories
- kvfun Public
A research implementation for accelerating Large Language Model (LLM) inference through KV cache projection and selective recomputation. This project explores novel approaches to reduce Time-to-First-Token (TTFT) while maintaining generation quality by using smaller auxiliary models to predict KV caches for larger base models.
caiasprojects/kvfun’s past year of commit activity - LeetcodeEvaluationMetric Public
caiasprojects/LeetcodeEvaluationMetric’s past year of commit activity - SyntheticDataTraining Public
caiasprojects/SyntheticDataTraining’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…