No description
Find a file
Andy Scherzinger 636dc03b69
Merge pull request #66 from nextcloud/automated/update-workflows/default
chore: update workflows from templates
2025-03-03 18:45:23 +01:00
.github/workflows chore: update workflows from templates 2025-03-03 18:44:24 +01:00
appinfo 31.0.0 2025-02-28 10:30:41 -01:00
js docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00
lib docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00
LICENSES ci(reuse): Add reuse check 2024-10-29 19:22:47 +01:00
templates docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00
.gitignore 31.0.0 2025-02-28 10:30:41 -01:00
.scrutinizer.yml docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00
AUTHORS.md docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00
CHANGELOG.md docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00
composer.json 31.0.0 2025-02-28 10:30:41 -01:00
composer.lock 31.0.0 2025-02-28 10:30:41 -01:00
LICENSE first commit 2018-05-25 08:31:07 -01:00
Makefile docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00
README.md docs(readme): Add reuse status badge 2024-10-29 19:24:23 +01:00
REUSE.toml docs(reuse): Add SPDX header 2024-10-29 19:22:09 +01:00

files_fulltextsearch_tesseract

REUSE status

OCR your documents before index

Installation / Setup

  • install Tesseract

  • download language files from: https://github.com/tesseract-ocr/tessdata

  • copy language files into /usr/share/tessdata/ (or /usr/share/tesseract-ocr/tessdata/, depends on our distribution)

  • configure this app in the Full text search Admin panel

  • report bugs

more

devblog about PDF and OCR: https://daita.github.io/files-fulltextsearch-tesseract-ocr-pdf/