Skip to content

add classify command, dedup --local, update embed default to v5#1

Merged
hanxiao merged 1 commit intomainfrom
add-classify-and-improvements
Mar 14, 2026
Merged

add classify command, dedup --local, update embed default to v5#1
hanxiao merged 1 commit intomainfrom
add-classify-and-improvements

Conversation

@hanxiao
Copy link
Member

@hanxiao hanxiao commented Mar 14, 2026

  • Add jina classify command (labels + score, stdin pipe support)
  • Add --local mode for dedup using local embeddings
  • Update embed default model from v3 to v5-text-small
  • Add pipe integration tests
  • Update README

- change embed default model from jina-embeddings-v3 to jina-embeddings-v5-text-small
- add classify subcommand: POST /v1/classify with --labels, stdin support
- add --local flag to dedup using local embeddings + cosine similarity
- extract _deduplicate_from_embeddings to share between API and local dedup
- add classify and pipe integration tests
- update README with classify docs, dedup --local, model reference

Co-Authored-By: Claude <noreply@anthropic.com>
@hanxiao hanxiao merged commit c4c2c36 into main Mar 14, 2026
1 check passed
@hanxiao hanxiao deleted the add-classify-and-improvements branch March 14, 2026 10:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant