Skip to content
View ajheshbasnet's full-sized avatar

Block or report ajheshbasnet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ajheshbasnet/README.md

i love getting my hands dirty by training deep neural networks and optimizing them. lately, i’ve been focusing on reinforcement learning. portfolio: click here

Pinned Loading

  1. reinforcement-learning-agents reinforcement-learning-agents Public

    a collection of advanced reinforcement learning (rl) agents and implementations, including dqn, actor-critic, ppo, dpo, and more. provides reference code, algorithmic insights, and setups for resea…

    Jupyter Notebook 19 1

  2. Transformer-from-scratch. Transformer-from-scratch. Public

    attention is all you need — pytorch implementation of the original transformer architecture for english to nepali neural machine translation (nmt), achieving around 27 bleu score.

    Python 3 1

  3. openai-gpt-oss openai-gpt-oss Public

    A PyTorch reimplementation of OpenAI’s GPT OSS model. Designed for research, experimentation, and learning, featuring MoE layers, mixed-precision training, and modular components for easy customiza…

    Jupyter Notebook 1

  4. model-reincarnated model-reincarnated Public

    a collection of re-implementations of renowned artificial intelligence models and architectures from foundational research papers.

    Jupyter Notebook 1

  5. ViT-CIFAR10-Classifier ViT-CIFAR10-Classifier Public

    Jupyter Notebook 3 1

  6. LLaMA-2-from-Scratch LLaMA-2-from-Scratch Public

    llama-2 from scratch — a clean, educational pytorch implementation of the llama-2 transformer architecture. features grouped query attention (gqa), rotary position embeddings (rope), kv caching, an…

    Python 1