RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
-
Updated
Oct 31, 2025 - Python
RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
Byte-pair Algorithm used in GPT Tokenization based on the paper "Language Model are unsupervised multitask learners"
Joke telling AI assistant developed using python and large language models (LLMS) in artificial intelligence.
A high-performance, enterprise-grade universal ebook translation toolkit supporting any ebook format and any language pair, featuring multiple translation engines, REST API with HTTP/3 support, and real-time WebSocket events.
Add a description, image, and links to the lllms topic page so that developers can more easily learn about it.
To associate your repository with the lllms topic, visit your repo's landing page and select "manage topics."