Package Usage: pypi: pdfminer.six
PDF parser and analyzer
27 versions
Latest release: over 1 year ago
162 dependent packages
5,502,185 downloads last month
View more package details: https://packages.ecosystem.code.gouv.fr/registries/pypi.org/packages/pdfminer.six
Dependent Repos 7
etalab-ia/pdf_api Fork of Rob192/pdf_api
Size: 1.65 MB - Last synced: 1 day ago - Pushed: over 3 years ago

etalab/programme10pourcent-socratext
Size: 2.26 MB - Last synced: 6 days ago - Pushed: over 1 year ago

etalab-ia/ami-ia-dgs
Dépôt de code pour le projet AMI IA 2 de la DGS.Size: 38.1 MB - Last synced: 1 day ago - Pushed: over 2 years ago

aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.Size: 8.93 MB - Last synced: 6 days ago - Pushed: 3 months ago


cedar/connection-studio
ConnectionStudio integrates highly heterogeneous data into graphs, enriched with extracted entities. Studio users can discover the entities in their data, navigate across connections between datasets, explore and query the data in many ways. The Studio currently supports: CSV, JSON, XML, RDF, text, property graphs, all Office formats, and PDF datasets. For more information, see: https://connectionstudio.inria.fr The scientific publications behind the platform: https://team.inria.fr/cedar/connectionlens/Last synced: 7 months ago
