i love getting my hands dirty by training deep neural networks and optimizing them. lately, i’ve been focusing on reinforcement learning. portfolio: click here
Pinned Loading
-
reinforcement-learning-agents
reinforcement-learning-agents Publica collection of advanced reinforcement learning (rl) agents and implementations, including dqn, actor-critic, ppo, dpo, and more. provides reference code, algorithmic insights, and setups for resea…
-
Transformer-from-scratch.
Transformer-from-scratch. Publicattention is all you need — pytorch implementation of the original transformer architecture for english to nepali neural machine translation (nmt), achieving around 27 bleu score.
-
openai-gpt-oss
openai-gpt-oss PublicA PyTorch reimplementation of OpenAI’s GPT OSS model. Designed for research, experimentation, and learning, featuring MoE layers, mixed-precision training, and modular components for easy customiza…
Jupyter Notebook 1
-
model-reincarnated
model-reincarnated Publica collection of re-implementations of renowned artificial intelligence models and architectures from foundational research papers.
Jupyter Notebook 1
-
-
LLaMA-2-from-Scratch
LLaMA-2-from-Scratch Publicllama-2 from scratch — a clean, educational pytorch implementation of the llama-2 transformer architecture. features grouped query attention (gqa), rotary position embeddings (rope), kv caching, an…
Python 1
If the problem persists, check the GitHub status page or contact support.
