GitHub / Envinorma / pdf_ocr_app
A ready-to-deploy Dash application for parsing PDF files with Tesseract
JSON API: https://data.code.gouv.fr/api/v1/hosts/GitHub/repositories/Envinorma%2Fpdf_ocr_app
étoiles: 4
forks: 2
issues ouvertes: 0
licence: mit
langage: Python
taille: 403 ko
dépendances analysées:
30
date de création: il y a environ 4 ans
date de mise à jour: il y a 4 mois
enregistré: il y a presque 4 ans
dernière synchronisation: il y a 6 jours
- actions/checkout v1 composite
- actions/setup-python v1 composite
- codecov/codecov-action v1 composite
requirements-dev.txt
pypi
- black ==20.8b1 development
- codecov >=2.1.4 development
- flake8 ==3.8.4 development
- ipython ==7.19.0 development
- mypy ==0.800 development
- pre-commit ==2.9.3 development
- pylint ==2.6.0 development
- pytest ==6.2.1 development
- pytest-cov >=2.9.0 development
- pytest-mypy ==0.8.0 development
- pytest-raises >=0.11 development
- pytest-runner >=5.2 development
requirements.txt
pypi
- Unidecode ==1.0.23
- alto-xml ==0.0.3
- beautifulsoup4 ==4.8.2
- dash ==1.17.0
- dash-bootstrap-components ==0.11.1
- gunicorn ==20.0.4
- lxml ==4.6.2
- opencv-python-headless ==4.5.1.48
- pdf2image ==1.14.0
- pytesseract ==0.3.7
- requests ==2.25.1
- requests-oauthlib ==1.3.0
- scipy ==1.6.1
- tesseract-ocr-utils ==0.0.4
- textdistance ==4.2.1