Bongard in Wonderland

This is the official repository of the article "Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad?".

Getting started

To run the code you can either set up a conda environment and install requirements.txt (without LLaVA) or build the docker container to launch LLaVA on your machine. You can find more details on that in llava_steps.md.

Usage

The experimental scripts can be found in experiments/. You can execute them from the command line, e.g.,

python experiments/zero_shot_bp.py --model "gpt-4o"

Make sure to include your API access keys in the respective folders of the model, e.g., gpt-4o/open-ai-key.

The results of the evaluations will be stored in results/. The evaluation scripts, including the llm-judge can be found in experiments/evaluate. You can run those from the command line as well, e.g.,

python experiments/zero_shot_bp.py --model "gpt-4o" --mode "zero_shot"

Data

We use the dataset provided by Depeweg et. al [1] which contains the 100 original Bongard Problems in high resolution (Link here). For the perception-focussed evaluation we considered the single diagrams of BPs 16, 19, 29 and 36. These are stored in data/bongard-problems-high-res/.

[1] Depeweg, S., Rothkopf, C.A., Jäkel, F. (2024). Solving Bongard Problems with a Visual Language and Pragmatic Constraints. Cognitive Science, 48(5), e13432.

Citation

If you find the code of this repository helpful, consider citing us.

@inproceedings{wust2bongard,
  title={Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad?},
  author={W{\"u}st, Antonia and Tobiasch, Tim and Helff, Lukas and Dhami, Devendra Singh and Rothkopf, Constantin A and Kersting, Kristian},
  booktitle={The First Workshop on System-2 Reasoning at Scale, NeurIPS'24}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
claude		claude
data/bongard-problems-high-res		data/bongard-problems-high-res
docker		docker
experiments		experiments
gemini		gemini
gpt4		gpt4
llava		llava
prompts		prompts
README.md		README.md
llava_steps.md		llava_steps.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bongard in Wonderland

Getting started

Usage

Data

Citation

About

Releases

Packages

Languages

ml-research/bongard-in-wonderland

Folders and files

Latest commit

History

Repository files navigation

Bongard in Wonderland

Getting started

Usage

Data

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages