Fuzzy matching and more functionality for spaCy.
- 
            Updated
            
Jul 6, 2024  - Python
 
Fuzzy matching and more functionality for spaCy.
DuckDB Community Extension adding RapidFuzz algorithms for search, deduplication, and record linkage.
Fast Batch String Matching in Python (Levenshtein, Jaro-Winkler, Hamming) with Zero Cache Misses - made for Python, written in C++
Fast Scalable Dedupe - Fuzzy Matching With Opensearch + nmslib + Rapidfuzz
Guts of FantasyNameSearch.com
A simple and efficient spelling correction system that uses Python's rapidfuzz library to find and correct misspelled sentences by matching them with the closest correct ones from a given dataset.
🟡Processing
The repository is a duplicate of the local folder which contains codes created by Yuanzhan Gao (yg8ch@virginia.edu) to conduct scaled fuzzy matching procedure on EIDL and PPP dataset. Please see the README file for more information.
Phoenix II Discord Bot
Performs OCR on a list of images using Tesseract and performs fuzzy string matching with a given list of strings.
DEMO: extract media tags with Spotify API to relational Docker backend
Cleaned and transformed Netflix dataset using Python (Pandas, RapidFuzz) for visual analysis in Power BI.
Intuitive way of using fuzz matching in pandas
Binary fuzzy matching in all file types [fzf (pre-filter)/rapidfuzz (finds the best result)]
Add a description, image, and links to the rapidfuzz topic page so that developers can more easily learn about it.
To associate your repository with the rapidfuzz topic, visit your repo's landing page and select "manage topics."