fuzzywuzzy
Fuzzy String Matching in Python
This is the archived original home of a Python fuzzy string matching library. The project has been renamed to TheFuzz and moved to a new GitHub location, all active development happens there.
This repository was a Python library for fuzzy string matching, which is the ability to compare two pieces of text and find a similarity score even when they are not identical. This is useful for tasks like matching a user's search term to a list of items when typos are present, or finding which entry in a database most closely matches an incoming record that might be spelled slightly differently.
The repository has been renamed and moved. It is now called TheFuzz and lives at a different GitHub address. The README is short and only explains this transition: version 0.19.0 of TheFuzz corresponds to version 0.18.0 of the original project, with the main difference being the name change throughout the code. New issues or pull requests should be submitted to the TheFuzz repository rather than here.
There is no further documentation in this repository about how the library works or what functions it provided. Anyone looking to use or contribute to the project should follow the link in the README to the current TheFuzz repository.
Where it fits
- Find the closest matching string from a list even when there are typos or slight spelling differences.
- Match incoming database records to existing entries where field values may be spelled or abbreviated differently.