VQA-HUD

This is the repository for AAAI 2025 Paper: Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA

🛠 Installation & Usage

Clone the repository:

git clone https://github.com/mainlp/vqa-hud.git
cd vqa-hud

Prepare the dataset and base models: Download the dataset VQA 2.0 Follow the LXMERT and BEIT3, and fine-tune the provided checkpoints.

TODO

[☑️] script for HUD scores
[☑️] script for Evaluation

Prepare the dataset and base models: Run:

 python HUD_score.py
 python split_hud.py --ascending
 to get the hud scores and set splits.

You can find all the evaluation functions in evaluation.py to implement any customized data evaluations.

📄 Citation

@article{Lan_Frassinelli_Plank_2025, title={Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA}, volume={39}, url={https://ojs.aaai.org/index.php/AAAI/article/view/32468}, DOI={10.1609/aaai.v39i4.32468}, number={4}, journal={Proceedings of the AAAI Conference on Artificial Intelligence}, author={Lan, Jian and Frassinelli, Diego and Plank, Barbara}, year={2025}, month={Apr.}, pages={4446-4454} }

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
HUD_score.py		HUD_score.py
HUD_split.py		HUD_split.py
README.md		README.md
evaluations.py		evaluations.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VQA-HUD

🛠 Installation & Usage

TODO

📄 Citation

About

Uh oh!

Releases

Packages

Languages

mainlp/vqa-hud

Folders and files

Latest commit

History

Repository files navigation

VQA-HUD

🛠 Installation & Usage

TODO

📄 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages