Need to extract the text of a image file (OCR)
I have a set of images in JPG format. I have Microsoft Office installed. I need to extract the text off the JPG files. How do I do this?
I have a set of images in JPG format. I have Microsoft Office installed. I need to extract the text off the JPG files. How do I do this?
You will need to use a third party software for OCR. Do you have a scanner by any chance? Scanner software usually comes with a OCR software as a free bundle.
I do not want to install a third party software. Is there a way to do it with Microsoft Word?
Do you have the Office suite of products installed? Â Can you check if you have a Microsoft Office Tool called "Microsoft Office Document Imaging"? You can give it a shot with it.
Yes, I do have Microsoft Office installed and I can see the tool called "Microsoft Office Document Imaging". I opened it up. Unfortunately it only supports MDI and TIFF formats. My file types are JPG!
Microsoft Office Document Imaging only supports MDI and TIFF. There is a long shot that you can try. That is to convert the JPG to TIFF. You can follow these steps:
This "might" work.
You can also use Microsoft Office Picture Manager tool for the file conversion. It is better than using paint. It is a bundled product that comes when you install Microsoft Office suite of products.
I used the Microsoft Office Picture Manager to convert the files to TIFF. Then I used Microsoft Office Document Imaging tool to perform OCR on the TIFF image.  I got some text onto Microsoft Word. The extracted text requires fine tuning but at least I got a majority of the text via OCR. Thank you Math Girl and Whiz Boy for the support and advise!
Glad to be of assistance! Remember, for OCR you need to have the base images in TIFF. This is the industry accepted format for saving scanned images. It is also the base format for OCR text. JPG does a grainy effect when it compresses the file. TIFF does not introduce gray matter into the image. Therefore TIFF is the best format for OCR.