neurips25-eai

Embodied Agent Interface (EAI) Challenge @ NeurIPS 2025

Benchmark

https://embodied-agent-interface.github.io/

benchmark dataset: 338 tasks in VirtualHome, 100 tasks in BEHAVIOR

Challenge

https://neurips25-eai.github.io/

https://eval.ai/web/challenges/challenge-page/2621/overview

09/02 started working on this challenge and learning LLM from zero
11/15 last day of the challenge development phase

set up conda environment

conda create -n EAIenv python=3.12
conda activate EAIenv
pip install vllm==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu128
pip install transformers==4.57.1
pip install flashinfer-python==0.4.1
pip install scikit-learn matplotlib pandas

Changelog

09/23 add Qwen3
09/30 add GPT-OSS
10/15 update conda environment

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
code		code
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

neurips25-eai

Benchmark

Challenge

set up conda environment

Changelog

About

Uh oh!

Languages

License

artcmd/neurips25-eai

Folders and files

Latest commit

History

Repository files navigation

neurips25-eai

Benchmark

Challenge

set up conda environment

Changelog

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages