I may be slow to respond.
AIGC and VLM
-
Alibaba Cloud
- Hangzhou, China
-
06:40
(UTC +08:00) - https://xiaosu-zhu.github.io
Pinned Loading
-
D2I-ai/Qwen-VL-Narrator
D2I-ai/Qwen-VL-Narrator PublicQwen-VL-Narrator is an expert model for understanding video clips to generate fine-grained descriptions. Qwen-VL-Narrator 是一个影视领域专家模型,能够提供影视频片段详细描述,应用于视频检索、摘要、理解、细粒度标注等场景,也能用于视频生成工作流来实现视频反推。
-
roscenes/RoScenes
roscenes/RoScenes Public[ECCV 2024] RoScenes: A large-scale multi-view 3d dataset for roadside perception
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.





