NPU-accelerated AI • Edge & Android inference • GenAI enablement & optimization
I'm an AI Software Developer at Intel working on the OpenVINO toolkit, specializing in NPU-accelerated AI and edge/Android deployment using Intel AI technologies. My work focuses on enabling and optimizing cutting-edge AI models for Intel hardware platforms.
Key Contributions & Achievements:
- 🚀 Enabled and optimized state-of-the-art models including Gemini for Intel NPUs with quantization, performance tuning, and accuracy validation
- 🔧 Enabled OpenVINO support for LiteRT models, bridging Intel AI technologies with TensorFlow Lite
- 📱 Contributed to Intel Android AI stack and Android Neural Network HAL operator implementations
- 🚗 Built Android Driver Monitoring System (DMS) application leveraging OpenVINO CV models for real-time inference
- 🎯 Designed AI Dispatcher module using OpenVINO + gRPC for scalable object detection and face detection services
- 🗣️ Research background in NLP and Reinforcement Learning, including Dialogue State Tracking for Hindi dialogue systems
Primary Languages:
Technologies & Frameworks:
- Distributed AI inference system using OpenVINO + gRPC
- Android Neural Network HAL implementation for Intel platforms
- Chrome AI integration with Gemini Nano for on-device GenAI
- 💼 LinkedIn: ratnesh-kumar-rai-2a4004139
- 📧 Work Email: ratnesh.kumar.rai@intel.com
- 📧 Personal Email: ratn.dav2@gmail.com

