Active Visual Exploration Based on Attention-Map Entropy (IJCAI 2023)

Project Page | Paper

Official PyTorch implementation of the paper: "Active Visual Exploration Based on Attention-Map Entropy"

Active Visual Exploration Based on Attention-Map Entropy
Adam Pardyl, Grzegorz Rypeść, Grzegorz Kurzejamski, Bartosz Zieliński, Tomasz Trzciński
https://doi.org/10.24963/ijcai.2023/145

Abstract: Active visual exploration addresses the issue of limited sensor capabilities in real-world scenarios, where successive observations are actively chosen based on the environment. To tackle this problem, we introduce a new technique called Attention-Map Entropy (AME). It leverages the internal uncertainty of the transformer-based model to determine the most informative observations. In contrast to existing solutions, it does not require additional loss components, which simplifies the training. Through experiments, which also mimic retina-like sensors, we show that such simplified training significantly improves the performance of reconstruction, segmentation and classification on publicly available datasets.

Setup

git clone https://github.com/apardyl/AME.git && cd AME
conda env create -f environment.yml -n wtln # we recommend using mamba instead of conda (better performance)
conda activate wtln

Train

download and extract the requested dataset
run training with:

python train.py <dataset> <model> [params]

where dataset is one of:

Reconstruction task:
- ADE20KReconstruction
- Coco2014Reconstruction
- Sun360Reconstruction
- TestImageDirReconstruction
Segmentation task:
- ADE20KSegmentation
Classification task:
- Sun360Classification (for train all configuration)
- EmbedClassification (for head-only configuration, prepare embeddings with predict.py first)

and the model is:

{Attention/Random/Checkerboard}Mae for reconstruction
{Attention/Random/Checkerboard}SegMae for segmentation
{Attention/Random/Checkerboard}ClsMae for train-all classification
EmbedClassifier for head-only classification

Example: Run AttentionMAE on MS COCO 14 with reconstruction task

python train.py Coco2014Reconstruction AttentionMae  --data-dir DATASET_DIR

Run python train.py <dataset> <model> --help for available training params.

Visualizations form the paper can be generated using predict.py (use --visualization-path param).

Embeddings for head-only classification are generated with predict.py (--dump-path).

Average glimpse selection maps and evaluation of a trained model can be obtained with predict.py with --avg-glimpse-path and --test params accordingly.

Trained models

Models trained by us are avaialble at: https://huggingface.co/apardyl/AME/tree/main

Citation

If you find this useful for your research, please use the following.

@inproceedings{pardyl2023active,
  title     = {Active Visual Exploration Based on Attention-Map Entropy},
  author    = {Pardyl, Adam and Rypeść, Grzegorz and Kurzejamski, Grzegorz and Zieliński, Bartosz and Trzciński, Tomasz},
  booktitle = {Proceedings of the Thirty-Second International Joint Conference on
               Artificial Intelligence, {IJCAI-23}},
  publisher = {International Joint Conferences on Artificial Intelligence Organization},
  editor    = {Edith Elkind},
  pages     = {1303--1311},
  year      = {2023},
  month     = {8},
  note      = {Main Track},
  doi       = {10.24963/ijcai.2023/145},
  url       = {https://doi.org/10.24963/ijcai.2023/145}
}

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
architectures		architectures
datasets		datasets
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
predict.py		predict.py
supplementary.pdf		supplementary.pdf
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Active Visual Exploration Based on Attention-Map Entropy (IJCAI 2023)

Project Page | Paper

Setup

Train

Trained models

Citation

About

Releases

Packages

Contributors 2

Languages

License

apardyl/AME

Folders and files

Latest commit

History

Repository files navigation

Active Visual Exploration Based on Attention-Map Entropy (IJCAI 2023)

Project Page | Paper

Setup

Train

Trained models

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages