Search found 17 matches

by bwakkie
Thu Sep 10, 2020 7:55 pm
Forum: Issues or Errors
Topic: difference between content and OCR of a document?
Replies: 3
Views: 226

Re: difference between content and OCR of a document?

Thanks for the explanation. I see the advantage for ocr-ing even when there is content. So when there is no content and there is ocr do I just leave it like that or should a workflow at one point move the ocr data to the content section?
by bwakkie
Thu Sep 10, 2020 7:50 pm
Forum: General topics
Topic: How to include metadata from external file
Replies: 2
Views: 2875

Re: How to include metadata from external file

Hi Michael, That's good to hear in import tool will be integrated at one point! In the end I placed all document in the watch folder (-+50Gb) and all where absorbed as Default Document Type. I am still very much a noob at workflow setup, but my though was to indeed create the matadata fields that wh...
by bwakkie
Tue Sep 08, 2020 12:28 pm
Forum: General topics
Topic: How to include metadata from external file
Replies: 2
Views: 2875

How to include metadata from external file

Hi all, I'm moving 9000 documents from FileMaker to Mayan EDMS. I have extracted all metadata from FileMaker in tsv format. Per line a number refers to the corresponding pdf file (3456.pdf 500.pdf 9234.pdf etc). The pdf's where stored on a regular filesystem. How do I import this metadata to the cor...
by bwakkie
Sat Aug 29, 2020 10:46 am
Forum: Deployments
Topic: Apache2 mod_wgsi not working
Replies: 3
Views: 1848

Re: Apache2 mod_wgsi not working

Solved it almost I just needed to link the static folder to the DocumentRoot: My DocumentRoot withing the apache config directs to: DocumentRoot /var/www/wakkie.org/htdocs inside the htdocs folder I pointe the static folder to the static folder within the mayan environment: # cd /var/www/mydomain.co...
by bwakkie
Thu Aug 27, 2020 3:34 pm
Forum: Deployments
Topic: cannot upload any document in centos8
Replies: 1
Views: 519

Re: cannot upload any document in centos8

Hi,

The documentation you refer to is not meant for installing docker images.
Here you need to go to: https://docs.mayan-edms.com/parts/insta ... tml#docker

The direct deployment documentation is meant for installation in python virtual environments

regards,
Bastiaan
by bwakkie
Thu Aug 27, 2020 2:07 pm
Forum: Issues or Errors
Topic: difference between content and OCR of a document?
Replies: 3
Views: 226

difference between content and OCR of a document?

Hi,

I am a bit confused about the difference between content and OCR. In my view if there is a content I do not need to OCR. But having them both is strange to me.

regards,
Bastiaan
by bwakkie
Thu Aug 27, 2020 1:53 pm
Forum: Feature requests
Topic: Remove duplicates tool
Replies: 4
Views: 1137

Re: Remove duplicates tool

I would suggest to merge the metadata of the document with the most metadata and then remove newer duplicates. When I have duplicates with different filenames they are not show together in the overview which makes it also difficult to decide which to keep manually. [https://gitlab.com/mayan-edms/may...
by bwakkie
Fri Jun 12, 2020 10:31 am
Forum: Issues or Errors
Topic: How to implement custom postprocess to de-hyphenation after OCR took place?
Replies: 1
Views: 474

How to implement custom postprocess to de-hyphenation after OCR took place?

Hi, To improve the text quality after OCR has finished how does mayan-edms process hyphened words? How would I implement a custom automated post-process that would simply run the following regex (vim)... :%s:\v([a-z])-\n([a-z]):\1\2: ... to remove hyphen where a sentence ends with a lowercase letter...
by bwakkie
Thu Jun 11, 2020 7:37 am
Forum: Issues or Errors
Topic: Leaving files in 'Watch Folder'
Replies: 12
Views: 4352

Re: Leaving files in 'Watch Folder'

Just a note I bounced in today why the watch_folder didn't work:
The redis password (as I generated one) included a ']'. This will block redis from running and therefore cerely.