Skip to content

baskargroup/Ag_reasoning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

37 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

AgReasoning Benchmark

"AgReasoning Benchmark", which introduces a large-scale question-answering (QA) benchmark tailored to the agricultural domain.

๐Ÿ“„ Paper Summary

  • Goal: Benchmark LLMs and reasoning models on domain-specific agronomic QA tasks.
  • Dataset: 55K expert-in-the-loop QA pairs covering diverse agricultural question categories.
  • Key Contributions:
    • A multi-stage flowchart-driven pipeline for dataset curation.
    • Evaluation framework using LLM-as-a-Judge.
    • A distilled model that matches larger models in performance with higher efficiency.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published