一款简单易用和高性能的端侧AI部署框架 | An Easy-to-Use and High-Performance Edge AI Deployment Framework
-
Updated
Nov 1, 2025 - C++
一款简单易用和高性能的端侧AI部署框架 | An Easy-to-Use and High-Performance Edge AI Deployment Framework
A mock Azure OpenAI API for seamless testing and development, supporting both streaming and non-streaming responses. Easily emulate OpenAI completions with token-based streaming in a local or Dockerized environment.
Mechanistic analysis of a GPT-2–like model exploring the compositionality gap in transformers. Using Logit Lens and Causal Tracing, the study identifies and mitigates a deep-layer bottleneck via dataset enhancement to improve logical reasoning.
Comprehensive guide to FastAPI, Pydantic, and SQLAlchemy for AI engineers. Learn API design, validation, and ORM workflows with practical examples and setup 🐙
Multi-agentic researcher (RAG)
A Streamlit-based spam classifier that predicts whether a message is spam or not spam using machine learning.
Compare PyTorch vs Triton inference latency with CLI tools, benchmarks, and performance plots.
Add a description, image, and links to the ai-deployment topic page so that developers can more easily learn about it.
To associate your repository with the ai-deployment topic, visit your repo's landing page and select "manage topics."