Package Usage: maven: org.apache.tika:tika-parsers
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
61 versions
Latest release: about 1 year ago
310 dependent packages
View more package details: https://packages.ecosystem.code.gouv.fr/registries/repo1.maven.org/packages/org.apache.tika:tika-parsers
Dependent Repos 6
hatvp-open/agora
'AGORA' : répertoire numérique des représentants d'intérêtsLast synced: 6 days ago - Pushed: 11 months ago

lutece-platform/lutece-tech-plugin-parser
This plugin exposes a service for parsing streams using tika and pdfbox project.Size: 11.7 KB - Last synced: about 17 hours ago - Pushed: over 2 years ago

BRGM/daobs Fork of ClusterGIS-CAP/daobs
Data analyzer and observerSize: 10.6 MB - Last synced: about 4 hours ago - Pushed: about 7 years ago

ProgrammeVitam/sedatools
Bibliothèque SEDA et exemples d'usageSize: 69.5 MB - Last synced: 6 days ago - Pushed: 7 months ago

ProgrammeVitam/mailextract 📦
!!!! Version dépréciée car intégré désormais dans sedatools !!! Outil d'extraction de boite de messagerie (imap(s), thunderbird, outlook .pst, .msg...) générant une hiérarchies de fichiers correspondant aux ArchiveUnits à créer (format pivot avec generator-seda)Size: 3.61 MB - Last synced: 6 days ago - Pushed: about 6 years ago

Bibliome/alvisnlp
ALvisNLP corpus processing engineSize: 51.9 MB - Last synced: 6 days ago - Pushed: 6 months ago
