Disable Plone Archetypes index / convert doc / pdf files

If I rebuild my directory in plone, I will get a lot of such information:

2010-02-18T11:26:09 INFO Archetypes Error while trying to convert file contents to 'text/plain' in <Field file(file:rw)>.getIndexable() of <ATFile at /site/test1/test.doc>: Unable to find binary "wvHtml" in /sbin:/usr/sbin:/usr/local/sbin:/usr/local/bin:/usr/bin:/bin:/usr/X11R6/bin:/usr/games:/usr/lib/jvm/jre/bin 

This happens with .doc and .pdf files.

I do not want to convert documents or PDF files. How can I completely disable it?

+4
source share
3 answers

You can disable (delete) conversions in the ZMI tool / portal_transforms. If Plone does not find the conversion from Word to plain text or PDF to plain text, it will not try to index the contents of the file. Go to ZMI / portal_transforms and delete the word_to_html element.

+4
source

It tries to index your doc / pdf files by first converting them to HTML files. Just install wv (ware). The message will disappear and all your DOC and PDF files will be indexed and appear in the search results.

On Ubuntu / Debian: sudo apt-get install wv

Is there a reason you don't want to index PDF documents or Word documents?

+2
source

I do not want to convert documents or PDF files. How can I disable it completely? [So in the original]

Well, you are not currently indexing them, so nothing will change.

The above message is an INFO message, this is not an error. If you do not want to see them in your event log, you can change the level of the event log in your assembly configuration or zope.conf

from documents:

 event-log-level Set the level of the console output for the event log. Level may be any of CRITICAL, ERROR, WARN, INFO, DEBUG, or ALL. Defaults to INFO. 
+1
source

Source: https://habr.com/ru/post/1301693/


All Articles