Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter downPDF packages
Showing projects tagged as Specific Formats Processing and PDF
-
PyPDF2
8.8 9.5 L2 PythonA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files -
PyMuPDF
8.6 9.7 PythonPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. -
Kreuzberg
8.4 10.0 RustA polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server. -
PDFMiner
8.3 0.0 L3 PythonDISCONTINUED. Python PDF Parser (Not actively maintained). Check out pdfminer.six. -
pdftabextract
6.4 0.0 L3 PythonA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents. -
plutoprint
4.5 9.2 PythonA Python Library for Generating PDFs and Images from HTML, powered by PlutoBook -
Meltano Singer SDK
2.8 9.8 PythonWrite 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.