Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
__init__.py		__init__.py
claude-vision.py		claude-vision.py
gemini-pro-vision.py		gemini-pro-vision.py
gemini-pro.py		gemini-pro.py
gpt4.py		gpt4.py
gpt4v.py		gpt4v.py
llama.py		llama.py

README.md

OpenEQA Baselines

The commands required to run serveral baselines are listed below. Some baselines are labeled (language-only) because the model only receives an EQA question $Q$ and must answer based on its prior knowledge of the world. Others baselines are vision-language models (VLMs), which are able to jointly process the question $Q$ and image frames from the episode history $H$.

GPT-4 (language-only)

# requires setting the OPENAI_API_KEY environment variable
python openeqa/baselines/gpt4.py --dry-run  # remove --dry-run to process the full benchmark

LLaMA (language-only)

First, download LLaMA weights in the Hugging Face format from here. Then, run:
```
python openeqa/baselines/llama.py -m <path/to/hf/weights>
```

GPT-4V (vision + language)

# requires setting the OPENAI_API_KEY environment variable
python openeqa/baselines/gpt4v.py --num-frames 50 --dry-run  # remove --dry-run to process the full benchmark

Gemini Pro (language-only)

# requires setting the GOOGLE_API_KEY environment variable
python openeqa/baselines/gemini-pro.py --dry-run  # remove --dry-run to process the full benchmark

Gemini Pro Vision (vision + language)

# requires setting the GOOGLE_API_KEY environment variable
python openeqa/baselines/gemini-pro-vision.py --num-frames 15 --dry-run  # remove --dry-run to process the full benchmark

Claude 3 (vision + language)

# requires setting the ANTHROPIC_API_KEY environment variable
python openeqa/baselines/claude-vision.py --num-frames 20 --dry-run  # remove --dry-run to process the full benchmark

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baselines

baselines

README.md

OpenEQA Baselines

Files

baselines

Directory actions

More options

Directory actions

More options

Latest commit

History

baselines

Folders and files

parent directory

README.md

OpenEQA Baselines