This is the course project repo for AI+X: Speeding up LLMs, Summer 2025, Tsinghua University.
D2C is a novel approach to optimize diffusion Large Language Models (dLLMs) by leveraging dual dynamic caching mechanisms. This repository contains the implementation of D2C, along with instructions on how to use it.
See our course report here.