building autonomous systems at Vesper Dynamics. RL, edge deployment, alignment research.
San Jose β San Francisco Β· colonel1223.net
most of my time right now goes to:
- real-time ML inference on edge hardware (quantization, pruning, latency optimization)
- hierarchical RL policies for embodied agents
- figuring out why alignment guarantees break at scale (formal models)
some things I've built:
- learned-reranker β hybrid retrieval + neural re-ranking, +36% NDCG@10 |
- conformal-multimodal β distribution-free uncertainty quantification |
- CHIMERA β 847K traces showing hallucination is information-theoretic |
- agentic-rag-diagnostics β closed-loop retrieval agent |
python c++ pytorch rl edge deployment
ζ₯ζ¬θͺγθ©±γγΎγ :-)