Skip to content
Change the repository type filter

All

    Repositories list

    • DDT

      Public
      DDT: Decoupled Diffusion Transformer
      Python
      1124900Updated May 20, 2025May 20, 2025
    • [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling
      Python
      MIT License
      01800Updated May 1, 2025May 1, 2025
    • DMM

      Public
      DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
      Python
      44320Updated Apr 27, 2025Apr 27, 2025
    • MOTIP

      Public
      [CVPR 2025] Multiple Object Tracking as ID Prediction
      Python
      Apache License 2.0
      1723740Updated Apr 21, 2025Apr 21, 2025
    • [TPAMI] JointFormer: A Unified Framework with Joint Modeling for Video Object Segmentation
      Python
      0800Updated Apr 16, 2025Apr 16, 2025
    • [CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online
      Python
      23840Updated Apr 6, 2025Apr 6, 2025
    • Tra-MoE

      Public
      [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
      Python
      13300Updated Apr 1, 2025Apr 1, 2025
    • TPM

      Public
      [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation
      Python
      02100Updated Mar 28, 2025Mar 28, 2025
    • Video-DC

      Public
      Python
      Apache License 2.0
      11010Updated Mar 21, 2025Mar 21, 2025
    • CaReBench

      Public
      A Fine-grained Benchmark for Video Captioning and Retrieval
      Python
      01510Updated Mar 20, 2025Mar 20, 2025
    • MoG_Web

      Public
      JavaScript
      0000Updated Mar 11, 2025Mar 11, 2025
    • MoG-VFI

      Public
      Motion-Aware Generative Frame Interpolation
      Python
      Apache License 2.0
      12420Updated Mar 11, 2025Mar 11, 2025
    • HTML
      0100Updated Jan 13, 2025Jan 13, 2025
    • VideoEval

      Public
      VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
      Python
      01000Updated Jan 12, 2025Jan 12, 2025
    • p-MoD

      Public
      p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
      Python
      Apache License 2.0
      23510Updated Jan 8, 2025Jan 8, 2025
    • FlowDCN

      Public
      [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
      Python
      03200Updated Dec 23, 2024Dec 23, 2024
    • SPLAM

      Public
      [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
      Python
      MIT License
      12010Updated Nov 1, 2024Nov 1, 2024
    • AWT

      Public
      [NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
      Python
      Apache License 2.0
      410010Updated Oct 5, 2024Oct 5, 2024
    • VFIMamba

      Public
      [NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models
      Python
      Apache License 2.0
      910550Updated Sep 26, 2024Sep 26, 2024
    • [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
      Python
      Apache License 2.0
      02500Updated Sep 11, 2024Sep 11, 2024
    • PRVG

      Public
      [CVIU 2024] End-to-end dense video grounding via parallel regression
      Python
      Apache License 2.0
      0600Updated Sep 11, 2024Sep 11, 2024
    • BIVDiff

      Public
      [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
      Python
      Apache License 2.0
      27300Updated Sep 11, 2024Sep 11, 2024
    • CoMAE

      Public
      [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
      Python
      43631Updated Aug 20, 2024Aug 20, 2024
    • SparseOcc

      Public
      [ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric
      Python
      Apache License 2.0
      27338221Updated Aug 15, 2024Aug 15, 2024
    • ProVP

      Public
      [IJCV] Progressive Visual Prompt Learning with Contrastive Feature Re-formation
      Python
      01300Updated Aug 10, 2024Aug 10, 2024
    • CamLiFlow

      Public
      [CVPR 2022 Oral & TPAMI 2023] Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion
      Python
      2323910Updated Jul 29, 2024Jul 29, 2024
    • ZeroI2V

      Public
      [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
      Python
      Apache License 2.0
      12100Updated Jul 29, 2024Jul 29, 2024
    • LinK

      Public
      [CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception
      Python
      MIT License
      59440Updated Jul 27, 2024Jul 27, 2024
    • SGM-VFI

      Public
      [CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion
      Python
      57310Updated Jul 4, 2024Jul 4, 2024
    • ViT-TAD

      Public
      [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
      Python
      11100Updated Jun 11, 2024Jun 11, 2024