fix: add ollama_num_ctx for longer context length #64

anxkhn · 2025-10-02T17:36:17Z

Currently, when using Ollama inference, it defaults to num_ctx=2048, which is too small for longer prompts and outputs. This results in truncated responses, as confirmed by the Ollama logs.

This PR updates our configuration and utility code to set num_ctx explicitly, preventing prompt and response cutoffs for detailed posts. Following Ollama’s guidance, we set num_ctx to a higher value (default 8192 - seems like a sweet spot ensuring compatibility with smaller machines), and Ollama will automatically cap it at the model’s maximum supported context size.

Changes:

Added ollama_num_ctx setting to config.toml (default: 8192).
Updated ollama_predict in leetcomp/utils.py to pass num_ctx from config.

Result:
Ensures longer prompts and responses are not truncated, without requiring per-model manual tuning.

feat: add ollama_num_ctx configuration for context length

447ce5c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: add ollama_num_ctx for longer context length #64

fix: add ollama_num_ctx for longer context length #64

Uh oh!

anxkhn commented Oct 2, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: add ollama_num_ctx for longer context length #64

Are you sure you want to change the base?

fix: add ollama_num_ctx for longer context length #64

Uh oh!

Conversation

anxkhn commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anxkhn commented Oct 2, 2025 •

edited

Loading