GitHub - RaghavPrabhakar66/COAT-code

Official Codebase for 'Physical Reasoning and Object Planning for Household Embodied Agents' [TMLR May 2024]

This repository consists of code for he following aspects:

1. Ai2Thor experiment
2. Dataset Creation
3. Evaluating the Datasets

AI2Thor Experiment

Querying LLMs and ConceptNet for generating object utility for all visible objects. We save the RGB frames with bounding box and utility labellings. This highlighted the shortcomings of knowledge graphs like conceptnet and further helped us qualitatively conclude the presence of proper object-utility mappings in LLMs.

ConceptNet	PaLM	Alpaca7B
Closed-set utilities from ConceptNet	Open set utilities from PaLM	Open set utilities from Alpaca7B

Code present in thortils/ folder

install AI2thor

pip install ai2thor

Install thortils repository

- concept_query.py : contains standalone code to query conceptnet
- bbox_conceptnet_query_teleop.py : [teleop] makes a dictionary of object-utility pairings(conceptnet) - saves RGB frames with bounding box utility labellings
- precoded_traj_llm_query.py : [pre-coded trajectory] make a dictionary of object-utility pairings(PaLM/Alpaca7B) - saves RGB frames with bounding box utility labellings.

Video Summary of AI2thor experiemnt : Youtube Video Link

Dataset Creation Code:

Download the datasets and store them at thortils/data/ in task_u, task_0, task_1, task_2, task_fi, task_fm folders.
The directory structure is maintained in the google drive

Code for creating datasets is located in thortils/data

For running scipts pass the args.root as `/thortils`
## Task-u :
requires: objects.json
    - run  `task_u_dataset_gen.py`; it creates task_u dataset

## Task-0 : 
requires: objects.json, tasks.json, oracle.json
    - Add your task, concept, concept objects, and oracle objects in the jsons
    - run `task_0-1-2_dataset_gen.py` ; it creates task_0 dataset

## Task-1 : 
requires: tasks.json, oracle.json, pouch_oracle.json, all_config.json 
    - run `task_0-1-2_dataset_gen.py` ; it creates task_1 dataset 

## Task-2 : 
requires: tasks.json, pouch_subop.json
    - run `task_0-1-2_dataset_gen.py` ; it creates task_2 dataset 

## Full Pipeline Data Creation:
For running scipts pass the args.root as `/thortils/data`
requires : tasks.json,oracle.json, pouch_subop.json, all_config.json and full_pipeline_datasets_helper.py
    - run `full-pipeline-dataset-creation.py`
    - script contains function to generate both F_ideal and F_moderate datasets

Evaluation Code [for task-0,1,2 evaluations]

export API_KEY='your PALM API KEY'

Code for evaluating language models is located at /commonsense

For running scipts pass the args.root as `/thortils/data`

- run the `evaluation_script.py`
- `LLM.py` defines a class for setting a language model and prompting it.
- `database.py` allows us to create caching mechanism for resuming evaluations 
- `constants.py` sets the constants important for running evaluations

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
commonsense		commonsense
thortils		thortils
.gitignore		.gitignore
README.md		README.md
utility-bbox-alpaca.png		utility-bbox-alpaca.png
utility-bbox-cn.png		utility-bbox-cn.png
utility-bbox-palm.png		utility-bbox-palm.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Official Codebase for 'Physical Reasoning and Object Planning for Household Embodied Agents' [TMLR May 2024]

AI2Thor Experiment

install AI2thor

Install thortils repository

Dataset Creation Code:

Evaluation Code [for task-0,1,2 evaluations]

About

Uh oh!

Releases

Packages

Languages

RaghavPrabhakar66/COAT-code

Folders and files

Latest commit

History

Repository files navigation

Official Codebase for 'Physical Reasoning and Object Planning for Household Embodied Agents' [TMLR May 2024]

AI2Thor Experiment

install AI2thor

Install thortils repository

Dataset Creation Code:

Evaluation Code [for task-0,1,2 evaluations]

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages