I am just wondering if anyone knew of good libraries for parsing .doc files (and similar formats, for example .odt) to extract text, and also save formatting information where possible for display on a website.
Being able to do the same for PDF files would be a bonus, but I don't have much for that.
This is for the Rails project, if that helps at all.
Thanks in advance!
source share