llm-reasoning

Here are 89 public repositories matching this topic...

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

machine-learning reinforcement-learning tinker distributed-training ml-infrastructure ml-platform agent-framework search-agent llm-training llm-reasoning agentic-workflow swe-agent verl coding-agent

Updated Apr 4, 2026
Python

inclusionAI / AReaL

Star

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent reinforcement-learning rl machine-learning-systems mlsys llm llm-agent llm-reasoning

Updated Apr 3, 2026
Python

Gen-Verse / MMaDA

Star

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)

diffusion-models llm-reasoning unified-multimodal-understanding-and-generation

Updated Feb 14, 2026
Python

YangLing0818 / buffer-of-thought-llm

Star

[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

large-language-models chain-of-thought-reasoning retrieval-augmented-generation llm-reasoning

Updated Jun 28, 2025
Python

reasoning-survey / Awesome-Reasoning-Foundation-Models

Star

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

reasoning multimodal foundation-models llm reasoning-agent llm-reasoning reasoning-language-models

Updated Jun 16, 2025

yinizhilian / ICLR2025-Papers-with-Code

Star

历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

python machine-learning transformer gpt nlp-machine-learning nlp-keywords-extraction iclr2021 paperwithcode iclr2022 llms iclr2023 llm-agent llm-training gemmini llm-framework iclr2024 llm-reasoning llama3 deep-learning-paper

Updated Mar 14, 2025

Gen-Verse / dLLM-RL

Star

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.

reinforcement-learning-algorithms code-generation large-language-models rlhf llm-reasoning mathmatical-reasoning diffusion-language-models

Updated Jan 28, 2026
Python

Gen-Verse / Open-AgentRL

Star

RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings

reinforcement-learning ppo multi-agent-reinforcement-learning rlhf llm-agent llm-reasoning entropy-method gui-agent grpo coding-agent agent-rl

Updated Feb 27, 2026
Python

IAAR-Shanghai / Awesome-Attention-Heads

Star

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

awesome survey transformer gpt attention-mechanism research-paper circuit-analysis interpretability cognitive-neuroscience visualization-tools large-language-models llm chain-of-thought llm-reasoning machine-psychology attention-head-mining

Updated Mar 2, 2025
TeX

DebarghaG / proofofthought

Star

Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)

z3 automated-reasoning trustworthy-ai llm llm-inference llm-reasoning

Updated Apr 2, 2026
Python

inclusionAI / Ling

Star

Ling is a MoE LLM provided and open-sourced by InclusionAI.

machine-learning rl moe llm llm-reasoning

Updated May 14, 2025
Python

mangopy / SearchLM

Star

Official code for NeurIPS2025 "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"

rag large-language-models retrieval-augmented-generation llm-reasoning

Updated Jan 14, 2026
Python

Peiyang-Song / Awesome-LLM-Reasoning-Failures

Star

Repo for "Large Language Model Reasoning Failures"

failure-analysis logical-reasoning physical-reasoning llm llm-reasoning formal-reasoning cognitive-reasoning embodied-reasoning informal-reasoning

Updated Feb 17, 2026

YangLing0818 / SuperCorrect-llm

Star

[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction

reflection self-correction dpo llm llm-reasoning

Updated Mar 23, 2025
Python

pearls-lab / meow-tea-taro

Star

A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning

reinforcement-learning post-training llms llm-reasoning agentic-ai

Updated Jan 16, 2026
Python

pittisl / PhyT2V

Star

official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation

video-generation diffusion-models prompt-tuning llm-reasoning cvpr2025

Updated Jul 31, 2025
Python

Trae1ounG / BuPO

Star

[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

rl interpretability llms llm-reasoning verl

Updated Feb 6, 2026
Python

falonss703 / Awesome-Uncertainty-based-Reinforcement-Learning

Star

🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL

unsupervised-learning uncertainty-analysis rainforcement-learning llm-reasoning mllm-reasoning

Updated Aug 24, 2025

vstorm-co / pydantic-ai-todo

Star

Task Planning and Tracking toolset for Pydantic AI agents, enabling hierarchical task management with subtasks, PostgreSQL storage for multi-tenancy, and an event system for webhooks and callbacks.

python todo ai gemini toolset autonomous-agents ai-agents task-management pydantic llm chatgpt anthropic llm-reasoning ai-agent-framework pydantic-ai deepagents pydantic-deep

Updated Mar 31, 2026
Python

MozerWang / AMPO

Star

[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents

agent large-language-models reasoning-agent llm-reasoning reasoning-language-models long-cot sotopia

Updated Feb 2, 2026
Python

Improve this page

Add a description, image, and links to the llm-reasoning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-reasoning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-reasoning

Here are 89 public repositories matching this topic...

rllm-org / rllm

inclusionAI / AReaL

Gen-Verse / MMaDA

YangLing0818 / buffer-of-thought-llm

reasoning-survey / Awesome-Reasoning-Foundation-Models

yinizhilian / ICLR2025-Papers-with-Code

Gen-Verse / dLLM-RL

Gen-Verse / Open-AgentRL

IAAR-Shanghai / Awesome-Attention-Heads

DebarghaG / proofofthought

inclusionAI / Ling

mangopy / SearchLM

Peiyang-Song / Awesome-LLM-Reasoning-Failures

YangLing0818 / SuperCorrect-llm

pearls-lab / meow-tea-taro

pittisl / PhyT2V

Trae1ounG / BuPO

falonss703 / Awesome-Uncertainty-based-Reinforcement-Learning

vstorm-co / pydantic-ai-todo

MozerWang / AMPO

Improve this page

Add this topic to your repo