Topic: "extraction"
aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
Language: Python - Size: 8.93 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 47 - Forks: 7

healthdatahub/boas/hdh/extract_metadata
Extract-Metadata permet l'extraction de métadonnées à partir de bases de données tabulaires (au format .csv) et d’imagerie (au format DICOM). Les métadonnées extraites sont stockées au format JSON.
Last synced at: 5 days ago - Stars: 0 - Forks: 0