I have some 20+year old documents (originally typed with a typewriter) that were copied with an old photocopier or low DPI scanner (mid to late 1990s?) and eventually output in PDF form. The PDF is basically "pictures" of the text, which has varying quality due to the quality of the scanner used back then. I would like to use an OCR tool to read these PDF files and produce a character-based output, such as MS Publisher or Word. I highly doubt that I'll have access to the original paper documents.
I've got about 250 pages to replicate, along with some graphic pages and signature pages which will just be screen-printed and produced as JPEGS.
I could retype the whole thing, and I'm actually prepared to do that. But if there is an OCR way that could ensure accurate verbatim reproduction, that would be my preference; even if it required a bit of post-conversion reformatting.
I've also thought about using one of the Dragon products to dictate the text verbally. That would help relieve the wear-and-tear on hands and fingers, but I don't know if ultimately that would result in more work rather than less.
I use Windows 10 on two computers. If it's a software solution, it needs to have licensing for at least two computers. My budget for this project is a couple hundred dollars, and my desired time-to-completion is leisurely; a couple months or more would be fine.
Suggestions welcome!