Depends on the quantity and quality of the source images. Managed code sets and image sets will work, but this is not always the best solution if you want to process several million images. For small parties and tight budgets, I agree with previous posters that projects such as Aforge, Paint.NET and other open source libraries will do the trick. Of course, you are on your own, if the results do not improve ... At least this will allow you to put everything you need in one application for low cost.
If you process several hundred thousand images a month, I would suggest you divide the process into a smaller step in the workflow and configure each of them until your cost per image becomes as close to zero as possible. You will find that the OCR results first grow rapidly and then level out sooner than you expected. (I'm not a big fan of OCR, but he has his place)
I am using Recogniform's commercial Windows product to process and clean images before OCR in batch mode using scripts configured for different types of images. If the image does not work with QC or is rejected by the OCR mechanism, it is manually restored using a special .NET application created using the Atalasoft tool. A package handles everything and deals only with what fails.
source share