EgocentricVision

Egocentric videos are long and unstructured, making information retrieval challenging. This project extends temporal localization by generating textual answers from relevant video segments, enabling efficient query processing and improving interpretability in video understanding.

Features

✔ Two Model Architectures

VSLBase (simplified) and VSLNet (with Query-Guided Highlighting)
Supports both Omnivore and EgoVLP features

✔ End-to-End Pipeline

Localizes relevant video segments
Generates textual answers
Evaluates with ROUGE/METEOR metrics

✔ Optimized for Egocentric Videos

Handles long, unstructured first-person recordings
Focuses computational resources on key moments

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Ego4D_NLQ_FinalExtension.ipynb		Ego4D_NLQ_FinalExtension.ipynb
Ego4D_NLQ_VSLBase+EgoVLP.ipynb		Ego4D_NLQ_VSLBase+EgoVLP.ipynb
Ego4D_NLQ_VSLBase+Omnivore.ipynb		Ego4D_NLQ_VSLBase+Omnivore.ipynb
Ego4D_NLQ_VSLNet+EgoVLP+GloVe.ipynb		Ego4D_NLQ_VSLNet+EgoVLP+GloVe.ipynb
Ego4D_NLQ_VSLNet+EgoVLP.ipynb		Ego4D_NLQ_VSLNet+EgoVLP.ipynb
Ego4D_NLQ_VSLNet+Omnivore+GloVe.ipynb		Ego4D_NLQ_VSLNet+Omnivore+GloVe.ipynb
Ego4D_NLQ_VSLNet+Omnivore.ipynb		Ego4D_NLQ_VSLNet+Omnivore.ipynb
README.md		README.md
annotated_textual_answers.txt		annotated_textual_answers.txt
code_explanation.pdf		code_explanation.pdf
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EgocentricVision

Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EgocentricVision

Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages