At my place of work, we have an outdated document management system, which for various reasons is no longer supported by developers. I was asked to study the extraction of documents contained in this system in order to eventually be imported into a new third-party system.
From tracking and monitoring processes, I determined that document images (mostly tiff files) are stored in the amount of 1.5 GB of files. These files are apparently read from a specific offset, and then written to the tmp file, which is then transferred through the web application to the client and then deleted.
I suppose I'm looking for suggestions on how I can scan these large files containing TIFF images, and ultimately extract and write them to separate files.
source
share