51 Text Processing packages and projects
-
Lark
8.0 6.0 PythonLark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity. -
TextDistance
7.0 4.1 Python📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage. -
msgspec
6.9 8.8 PythonDISCONTINUED. A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML [Moved to: https://github.com/msgspec/msgspec] -
jellyfish
6.0 4.7 Jupyter Notebook🪼 a python library for doing approximate and phonetic matching of strings. -
Data Profiler
5.4 5.8 PythonWhat's in your data? Extract schema, statistics and entities from datasets -
python-user-agents
5.4 0.0 L4 PythonA Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings. -
Levenshtein
5.0 0.0 L1 CThe Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity -
Construct
4.7 2.7 PythonConstruct: Declarative data structures for python that allow symmetric parsing and building -
python-nameparser
4.2 3.3 L2 PythonA simple Python module for parsing human names into their individual components -
AnyAscii
3.2 3.7 KotlinUnicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET -
json-streamer
2.7 5.9 PythonA fast streaming JSON parser for Python that generates SAX-like events using yajl -
Efficient keyword mining with regular expressions
2.1 4.9 PythonEfficient string matching with regular expressions -
LLMWorkbook
0.7 8.1 PythonEffortlessly harness the power of LLMs on Excel and DataFrames—seamless, smart, and efficient! -
GoBeautifulSoup
0.3 3.3 PythonGoBeautifulSoup is a high-performance HTML/XML parsing library that provides a 100% compatible API with BeautifulSoup4, but powered by Go for dramatically improved performance. It's designed as a drop-in replacement for BeautifulSoup4 with significant speed improvements. -
Prompt Optimizer
0.3 8.1 PythonAutomated prompt optimization using mentor-agent architecture. Generate and refine prompts from labeled data. -
iban-tools
0.3 7.9 PythonComprehensive IBAN & BIC toolkit for Python — validate, parse, format, generate, and extract IBANs from text/PDF.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
Promo
www.saashub.com
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.