Skip to content
View Shumatsurontek's full-sized avatar
💭
Spelunking
💭
Spelunking

Block or report Shumatsurontek

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Shumatsurontek/README.md

Arthur — AI/LLM Engineer

Building production LLM systems at Tiime | MS Electrical Engineering, INSA Lyon

I design and ship agent architectures, retrieval pipelines, and evaluation frameworks for real-world accounting automation.

What I work on

Agents        Langgraph multi-agent workflows, dynamic model routing, lazy skill loading
Retrieval     Hybrid search (BM25 + vector), Cohere reranking, Bayesian confidence scoring
Evaluation    LLM-as-judge, golden datasets, batch comparison across model families
Infrastructure  Prompt caching (Bedrock), SingleFlight embedding dedup, OpenSearch indexing

Selected research & experiments

Project What
neo-deep-agent-lab Deep agent SQL lab with Modal serverless deployment
neo-neo-sn62 Bittensor SN62 — decentralized AI subnet mining & validation
steering-research Where, how, and why activation steering works on instruction-tuned LLMs
Gptq-Babai-Quantization Visualization of GPTQ lattice reduction for weight quantization
sparse-moe-torch Sparse Mixture-of-Experts routing visualization

Technical interests

  • Mechanistic interpretability — activation steering, feature circuits, sparse probing
  • Quantization — GPTQ, Babai lattice reduction, mixed-precision strategies
  • Mixture of Experts — routing policies, load balancing, capacity factors
  • Information Retrieval — BM25/vector fusion, learned sparse representations, reciprocal rank fusion
  • Reinforcement Learning — policy gradients, RLHF/DPO alignment, reward modeling

Stack

Python FastAPI Langgraph PostgreSQL pgvector OpenSearch AWS Bedrock Cohere Celery SQLAlchemy


Bordeaux, France

Pinned Loading

  1. steering-research steering-research Public

    A deep dive into where, how, and why activation steering works (or doesn't) on instruction-tuned LLMs

    Python 2

  2. neo-deep-agent-lab neo-deep-agent-lab Public

    deepagent-sql-lab with modal implementation

    Python

  3. local-agent local-agent Public

    Python 1