fix: gracefully handle hallucinated dataset IDs in populate_stage to prevent KeyError #152 by veryniceuser · Pull Request #155 · epam/statgpt-backend

veryniceuser · 2026-02-18T13:24:01Z

Skip unknown dataset IDs with a warning log in format_multidataset_queries and V2 populate_stage dataset references, instead of crashing. This prevents a KeyError when the LLM hallucinates a dataset UUID before _remove_hallucinations gets a chance to filter it out.

Applicable issues

fixes KeyError in populate_stage when LLM hallucinates dataset ID #152

Description of changes

Checklist

Title of the pull request follows Conventional Commits specification
Deployed and tested in a Review environment.

By submitting this pull request, I confirm that my contribution is made under the terms of the MIT license.

…Error Skip unknown dataset IDs with a warning log in format_multidataset_queries and V2 populate_stage dataset references, instead of crashing. This prevents a KeyError when the LLM hallucinates a dataset UUID before _remove_hallucinations gets a chance to filter it out.

navalnica · 2026-02-18T13:44:21Z

the issue is not only with formatting. if llm selected incorrect uuid, the whole query is unsable. we need to delete it inside indicator selection chain

navalnica · 2026-02-18T13:46:37Z

also, it's better not to pass uuid to llm. instead, we can pass:

dataset urn (imf:weo, for example)
source_id/entity_id
1-based index

we also need to make sure there are no duplicates. for example if there are duplicated urns (for some reason. though they should not) we can add index prefix: "1-imf:weo"

veryniceuser · 2026-02-18T13:57:39Z

the issue is not only with formatting. if llm selected incorrect uuid, the whole query is unsable. we need to delete it inside indicator selection chain

I think it gets deleted in the latter stage at _remove_hallucinations()

veryniceuser · 2026-02-18T14:00:22Z

also, it's better not to pass uuid to llm. instead, we can pass:

dataset urn (imf:weo, for example)

source_id/entity_id

1-based index

we also need to make sure there are no duplicates. for example if there are duplicated urns (for some reason. though they should not) we can add index prefix: "1-imf:weo"

Sound like a bigger change. Should we merge this PR as a hotfix, and create a follow-up feature request to make these changes?

navalnica · 2026-02-18T15:33:49Z

I don't like the current proposed change. it states that, when populating stage or formatting query, the query can contain dataset queries to unavailable datasets. this seems incorrect - we should remove any hallucinations from LLM response and call formatting only afterwards

…allucinated-dataset-id-in-populate-stage

…lection Chain

…in-populate-stage

…cator Selection Chain" This reverts commit d3ec0bd.

veryniceuser · 2026-02-27T09:34:40Z

also, it's better not to pass uuid to llm. instead, we can pass:

dataset urn (imf:weo, for example)

source_id/entity_id

1-based index

we also need to make sure there are no duplicates. for example if there are duplicated urns (for some reason. though they should not) we can add index prefix: "1-imf:weo"

Created a separate feature request: #172, and pushed changes in PR #173. It's done to un-block this hotfix merge, and to have review and discussion for the in-depth fix in the separate place.

veryniceuser · 2026-02-27T09:36:20Z

I don't like the current proposed change. it states that, when populating stage or formatting query, the query can contain dataset queries to unavailable datasets. this seems incorrect - we should remove any hallucinations from LLM response and call formatting only afterwards

I disagree that proposed change is wrong, populate stage should not make implicit assumptions, and must not fail if the code that calls it will change in the future.

On the other hand, swapping remove_hallucinations and populate_stage makes sense. But because of the code structure, such change seem to require refactoring. And so should be prioritized separately

Add warning log for unknown dataset IDs when populating stage content.

…in-populate-stage

veryniceuser requested a review from ypldan as a code owner February 18, 2026 13:24

veryniceuser requested a review from navalnica February 18, 2026 13:24

veryniceuser self-assigned this Feb 18, 2026

DzmitryVabishchewichTR and others added 4 commits February 25, 2026 17:17

Merge remote-tracking branch 'origin/development' into fix/keyerror-h…

4434aa7

…allucinated-dataset-id-in-populate-stage

Add translation logic from UUID to Source ID and back in Indicator Se…

d3ec0bd

…lection Chain

Merge branch 'development' into fix/keyerror-hallucinated-dataset-id-…

f0e3657

…in-populate-stage

Revert "Add translation logic from UUID to Source ID and back in Indi…

557bb32

…cator Selection Chain" This reverts commit d3ec0bd.

veryniceuser changed the title ~~fix: handle hallucinated dataset IDs in populate_stage to prevent KeyError #152~~ hotfix: gracefully handle hallucinated dataset IDs in populate_stage to prevent KeyError #152 Feb 27, 2026

veryniceuser removed the request for review from ypldan February 27, 2026 09:36

Log warning for unknown dataset IDs

e1d0957

Add warning log for unknown dataset IDs when populating stage content.

veryniceuser changed the title ~~hotfix: gracefully handle hallucinated dataset IDs in populate_stage to prevent KeyError #152~~ fix: gracefully handle hallucinated dataset IDs in populate_stage to prevent KeyError #152 Feb 27, 2026

DzmitryVabishchewichTR and others added 2 commits February 27, 2026 15:20

Fix formatting

76ca24f

Merge branch 'development' into fix/keyerror-hallucinated-dataset-id-…

2baa79d

…in-populate-stage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: gracefully handle hallucinated dataset IDs in populate_stage to prevent KeyError #152#155

fix: gracefully handle hallucinated dataset IDs in populate_stage to prevent KeyError #152#155
veryniceuser wants to merge 8 commits intodevelopmentfrom
fix/keyerror-hallucinated-dataset-id-in-populate-stage

veryniceuser commented Feb 18, 2026 •

edited

Loading

Uh oh!

navalnica commented Feb 18, 2026

Uh oh!

navalnica commented Feb 18, 2026

Uh oh!

veryniceuser commented Feb 18, 2026

Uh oh!

veryniceuser commented Feb 18, 2026

Uh oh!

navalnica commented Feb 18, 2026

Uh oh!

veryniceuser commented Feb 27, 2026

Uh oh!

veryniceuser commented Feb 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

veryniceuser commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Applicable issues

Description of changes

Checklist

Uh oh!

navalnica commented Feb 18, 2026

Uh oh!

navalnica commented Feb 18, 2026

Uh oh!

veryniceuser commented Feb 18, 2026

Uh oh!

veryniceuser commented Feb 18, 2026

Uh oh!

navalnica commented Feb 18, 2026

Uh oh!

veryniceuser commented Feb 27, 2026

Uh oh!

veryniceuser commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

veryniceuser commented Feb 18, 2026 •

edited

Loading

veryniceuser commented Feb 27, 2026 •

edited

Loading