AdvisorQA

This is the GitHub repository of "AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence", accepted at NAACL 2025. This paper mainly discusses the hurdles to progress in subjective QA, mainly in post-processing (alignment).

AdvisorQA dataset is in "[data link]". If you download it as JSON files, move it to the 'data' directory for post-training: SFT, DPO, and PPO.

Use the following to cite our paper:

@article{kim2024advisorqa,
  title={AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence},
  author={Kim, Minbeom and Lee, Hwanhee and Park, Joonsuk and Lee, Hwaran and Jung, Kyomin},
  journal={arXiv preprint arXiv:2404.11826},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
log		log
model		model
results		results
reward_model		reward_model
DPO.py		DPO.py
Fine-grained-views.png		Fine-grained-views.png
Overviews.pdf		Overviews.pdf
PPO.py		PPO.py
README.md		README.md
SFT.py		SFT.py
generator.py		generator.py
reward_modeling.py		reward_modeling.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AdvisorQA

About

Uh oh!

Releases

Packages

Languages

minbeomkim/AdvisorQA

Folders and files

Latest commit

History

Repository files navigation

AdvisorQA

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages