Does anyone know a good parser for document metadata in python for unix-like systems. In Java, apache tika is excellent.
No com ... please :)
thank
hachoir_metadata works great with Excel documents http://bitbucket.org/haypo/hachoir/wiki/Home
You do not need to use Jython to use Tika. You can call Java from Python using JCC . You can find decent instructions for this here .
JCC setuptools, . c7 Ubuntu 10.04.
python stdout Tika.
, Jython, tika.
. , ( OpenOffice ), XLS . Tika Python, .
Source: https://habr.com/ru/post/1732238/More articles:Position: Absolute Non-Own Parent in IE6 - htmlWhich one is best suited for enterprise-level JS programming - jQuery or Prototype, and why? - javascriptXmlWriterSettings do not affect XmlWriter created from XmlDocument - c #Why does the ReaderWriterLock program not start automatically when the user shuts down? - multithreadingImplementing a search or creation while importing data with flat files - xmlLearn version control with git first or through SVN? - gitGoogle Maps, SQL, XML, Ajax oh my! - javascriptSAP hosted web service consumed by a .NET application - web-servicesEmbed in codepad.org using bash or curl - bashHow to show hint for disabled TDBEdit? - user-interfaceAll Articles