Page 1 of 1

Slovenian OCR

Posted: Sun Feb 10, 2019 8:17 pm
by jevgl
Slovenian OCR would be very appreciated

Re: Slovenian OCR

Posted: Mon Feb 11, 2019 5:39 am
by rosarior
A Slovenian package is included in Ubuntu. Use:

sudo apt-get install tesseract-ocr-slv

to install it. Then select Slovenian as the language when you upload a document. You can also change the language of existing document by using the "Edit document properties" action from the document view dropdown.

If you are using the Docker image, pass the Tesseract language package you wish to install via the MAYAN_APT_INSTALLS environment variable. Example:

-e MAYAN_APT_INSTALLS='tesseract-ocr-slv'

Documentation chapter on installing OCR languages: ... cr-backend