issue/125 - cache interface #127

wooway777 · 2025-12-12T02:01:53Z

resolves #125

Copilot

Pull request overview

This PR introduces a cache interface abstraction (issue #125) to enable support for multiple cache implementations (Dynamic, Paged, etc.) through a common interface. The refactoring extracts the existing DynamicCache implementation into a separate directory and establishes a factory pattern for cache creation.

Key Changes:

Introduced CacheInterface abstract base class with factory method for polymorphic cache creation
Refactored DynamicCache from a standalone class to implement CacheInterface
Updated cache references throughout the codebase to use the interface type instead of concrete types

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
csrc/cache/cache_interface.hpp	New abstract interface defining the contract for all cache implementations
csrc/cache/cache_factory.cpp	Factory implementation for creating cache instances based on configuration
csrc/cache/cache.hpp	Updated to include both interface and implementation headers
csrc/cache/dynamic_cache/dynamic_cache.hpp	Refactored DynamicCache header to inherit from CacheInterface
csrc/cache/dynamic_cache/dynamic_cache.cpp	Implementation moved from inline to separate compilation unit
csrc/cache/kv_cache.hpp	Removed (functionality moved to dynamic_cache/)
csrc/models/llama/llama_model.hpp	Updated external_cache_ member to use interface pointer
csrc/models/llama/llama_attention.hpp	Updated include to use new cache.hpp header
csrc/engine/rank_worker.hpp	Updated cache_ptr_ to use interface type
csrc/engine/rank_worker.cpp	Updated to use factory method with logging
csrc/pybind11/models/llama.hpp	Updated includes and reformatted method definitions

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

csrc/cache/cache_factory.cpp

csrc/cache/dynamic_cache/dynamic_cache.cpp

Copilot · 2025-12-12T02:06:36Z

csrc/cache/dynamic_cache/dynamic_cache.cpp

+
+    // Copy new k/v into cache at current position
+    bool all_equal = cache_positions.empty() || std::equal(cache_positions.begin() + 1, cache_positions.end(), cache_positions.begin());
+    if (all_equal) {


If cache_positions is empty after ensure_capacity (which could happen if initial_batch_size is 0), accessing cache_positions[0] on line 143 would cause undefined behavior. Consider adding a check to ensure cache_positions is not empty before accessing it, or validate that initial_batch_size is at least 1 in ensure_capacity.

Suggested change

if (all_equal) {

if (all_equal) {

if (cache_positions.empty()) {

throw std::runtime_error("KVCache update: cache_positions is empty after ensure_capacity, cannot update cache.");

}

csrc/models/llama/llama_model.hpp

csrc/cache/dynamic_cache/dynamic_cache.cpp

Copilot

Pull request overview

Copilot reviewed 14 out of 14 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-12T02:49:48Z

csrc/cache/dynamic_cache/dynamic_cache.hpp

+     */
+    std::pair<infinicore::Tensor, infinicore::Tensor> update(
+        const infinicore::Tensor &k_new,
+        const infinicore::Tensor &v_new);


The update method that takes two parameters should be marked with the override keyword for consistency and clarity. The base class CacheInterface provides a virtual implementation of this method, and marking overrides explicitly helps prevent errors and improves code maintainability.

Suggested change

const infinicore::Tensor &v_new);

const infinicore::Tensor &v_new) override;

csrc/cache/dynamic_cache/dynamic_cache.cpp

wooway777 requested review from PanZezhong1725 and Copilot December 12, 2025 02:01

Copilot started reviewing on behalf of wooway777 December 12, 2025 02:02 View session

Copilot AI reviewed Dec 12, 2025

View reviewed changes

wooway777 force-pushed the issue/125 branch from 0666878 to f119954 Compare December 12, 2025 02:40

wooway777 requested a review from Copilot December 12, 2025 02:46

Copilot started reviewing on behalf of wooway777 December 12, 2025 02:46 View session

Copilot AI reviewed Dec 12, 2025

View reviewed changes

issue/125 - cache interface

79646a4

wooway777 force-pushed the issue/125 branch from f119954 to 79646a4 Compare December 12, 2025 03:00

issue/125 - renamed cache interface to cache

9da90b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

issue/125 - cache interface #127

issue/125 - cache interface #127

Uh oh!

wooway777 commented Dec 12, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 12, 2025

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	const infinicore::Tensor &v_new);
	const infinicore::Tensor &v_new) override;

issue/125 - cache interface #127

Are you sure you want to change the base?

issue/125 - cache interface #127

Uh oh!

Conversation

wooway777 commented Dec 12, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants