mirror of
https://github.com/nextcloud/files_fulltextsearch_tesseract.git
synced 2025-10-26 15:05:31 +01:00
No description
|
|
||
|---|---|---|
| .github/workflows | ||
| appinfo | ||
| js | ||
| lib | ||
| LICENSES | ||
| templates | ||
| .gitignore | ||
| .scrutinizer.yml | ||
| AUTHORS.md | ||
| CHANGELOG.md | ||
| composer.json | ||
| composer.lock | ||
| LICENSE | ||
| Makefile | ||
| README.md | ||
| REUSE.toml | ||
files_fulltextsearch_tesseract
OCR your documents before index
Installation / Setup
-
install Tesseract
-
download language files from: https://github.com/tesseract-ocr/tessdata
-
copy language files into /usr/share/tessdata/ (or /usr/share/tesseract-ocr/tessdata/, depends on our distribution)
-
configure this app in the Full text search Admin panel
-
report bugs
more
devblog about PDF and OCR: https://daita.github.io/files-fulltextsearch-tesseract-ocr-pdf/