I'm no expert, but my understanding is that the quality of the tool depends a lot on the specific application, and that much domain-specific customization and post-processing can be necessary to do OCR well. So you may get better advice if you share a bit about what you're trying to do with it.
I've created a series of templates for different types of documents (passports, driver's licenses, receipts, insurance policies) and I want to be able to scan the document and have the OCR 1) determine what template it should be on 2) extract relevant information to fill out the "form" aka template so the user doesn't have to
I've already provided fields of what information should be extracted, i.e., passport
- number
- name
- expiration
- country of issue
- place to attach a picture