feat: Add Text Embeddings Inference (TEI) provider support#60
Merged
jayscambler merged 5 commits intomainfrom Jul 2, 2025
Merged
Conversation
- Fix pylance dependency (was incorrectly 'lance' in v0.1.1) - Fix 38 failing integration tests with API corrections - Add 'member_of' to valid relationship types for collections - Fix custom metadata string validation issues - Implement Lance v0.30.0 vector search bug workaround - Fix UUID property access and len() usage on datasets - Improve error messages with field context and helpful hints feat: Add new features for better developer experience - Add full-text search index creation (create_fts_index method) - Add UUID override support at creation time - Add auto-indexing option for full-text search - Enhance create_scalar_index with index type support - Reorganize tests into unit/ and integration/ structure docs: Update documentation and changelog - Add comprehensive CHANGELOG entry for v0.1.2 - Add migration guide (docs/migration/api-changes-v012.md) - Add API improvements roadmap (docs/roadmap/api-improvements-v02.md) - Update API reference documentation BREAKING CHANGE: Replaced LlamaIndex text splitter with semantic-text-splitter
- Add TEIProvider class for high-performance embedding inference - Support both local and remote TEI server instances - Add httpx as optional dependency for lightweight HTTP client - Update factory function to support provider_type='tei' - Add comprehensive documentation and examples - Include unit tests with mocks for TEI functionality - Support for authentication, retries, and health checks TEI provides optimized inference for 100+ open-source models with: - Flash Attention and dynamic batching - GPU/CPU hardware acceleration - Production-ready monitoring and metrics - Self-hosted deployment for data privacy Implements CFOS-46
- Add comprehensive TEI setup guide covering hardware requirements, installation methods, and troubleshooting - Include Docker, Docker Compose, and Kubernetes deployment examples - Add security considerations and performance tuning tips - Document NumPy 2.x compatibility issues with PyArrow - Link from main embedding providers doc to setup guide
- Upgrade PyArrow from 14.0.2 to >=17.0.0 for better NumPy compatibility - Pin NumPy to 1.x series (numpy>=1.24,<2) to avoid NumPy 2.x issues - Resolves 'numpy.core.multiarray failed to import' errors - Fixes test environment and development workflows
…embeddings-inference-tei-support-to-embeddings
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds support for Hugging Face's Text Embeddings Inference (TEI) server as an embedding provider in ContextFrame, providing high-performance, self-hosted embeddings for 100+ open-source models.
Changes
TEIProviderclass implementing theEmbeddingProviderinterfaceprovider_type="tei"Features
Example Usage
Benefits
Testing
The implementation includes comprehensive unit tests using mocks. Note: Current test suite has NumPy compatibility issues unrelated to this PR that will be addressed separately.
Related
Docker Setup