Python based document metadata analyzer?

Does anyone know a good parser for document metadata in python for unix-like systems. In Java, apache tika is excellent.

No com ... please :)

thank

+3
source share
4 answers

hachoir_metadata works great with Excel documents http://bitbucket.org/haypo/hachoir/wiki/Home

0
source

You do not need to use Jython to use Tika. You can call Java from Python using JCC . You can find decent instructions for this here .

JCC setuptools, . c7 Ubuntu 10.04.

python stdout Tika.

+3

, Jython, tika.

+1

. , ( OpenOffice ), XLS . Tika Python, .

+1

Source: https://habr.com/ru/post/1732238/


All Articles