FuncQA Experiments: Syntax error-free decoding for Math Equations

This repo shows two methods for syntax error-free decoding

using a context-free grammar (defined with lark)
using a finite-state automtata that is evaluated directly on the GPU and thus 4x faster

It uses these approaches to improve the performance of the Zephyr 7B LLM on FuncQA, a math equation benchmark. This approach outperforms current state-of-the-art.

It was developed as part of a seminar at HPI. Here are additional resources

Slides outlining the basic idea
Report outlining the approach in-depth

Results

The results of this results vs. the ToolDec baseline from the literature and ChatGPT is as follows. Details are given in the report.

Model Name	Results
Zephyr 7B Chat (ours) + CFG	14.7%
Zephyr 7B Chat (ours) + CFG + SFT	19.1%
ToolDec	13.2%
ChatGPT (0-shot)	9.0%

Literature

The methods implemented here are inspiered by the following two papers

ToolDec: Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding [arxiv]
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings [arxiv]

Running the experiments

You can simply reproduce the experiments by running

pip install ./requirements.txt
sh scripts/training_commands.sh
sh scripts/eval_commands.sh

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
data/funcqa		data/funcqa
docs		docs
eval_reports		eval_reports
notebooks		notebooks
scripts		scripts
tests		tests
toolmode		toolmode
.gitignore		.gitignore
README.md		README.md
funcqa.lark		funcqa.lark
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FuncQA Experiments: Syntax error-free decoding for Math Equations

Results

Literature

Running the experiments

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

jvhoffbauer/syntax-constrained-decoding-math-llm

Folders and files

Latest commit

History

Repository files navigation

FuncQA Experiments: Syntax error-free decoding for Math Equations

Results

Literature

Running the experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages