PDF OCR X Community Edition for Mac converts PDFs and images into text or searchable PDF documents. It works with both PDFs generated from a document and scanned PDFs. The application's basic interface couldn't be easier to use, and it worked like a charm in our tests; but this version of the app limits you to one-page conversions.
The main interface of PDF OCR X consists of a small, unassuming window where PDF files can be dragged and dropped. Files can also be manually selected from a Finder window via a drop-down menu. Once moved into the program, an additional menu comes up where users can select from a number of input and output settings. These include preferred language, the number of columns, and different text wrapping types. Clicking a well-labeled convert button initiates the process, which completes quickly. The output file is automatically loaded for review and can then be saved or discarded easily. The tested PDF converted to text well, without any transcription errors. The community version of the program only supports one page PDFs, though, which is a frustrating limitation.
PDF OCR X Community Edition for Mac functions well and offers an easy-to-use solution for users looking for an application that can create searchable one-page documents from PDFs or for those who need to quickly extract text from an image.
From Web Lite Solutions:
PDF OCR X is a simple drag-and-drop utility for Mac OS X, that converts your PDFs and images into text or searchable PDF documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF (or image) even if that text is contained in an image. This is particularly useful for dealing with PDFs and images that were created via a Scan-to-PDF function in a scanner or photo copier. Supports over 60 languages for OCR. The OCR engine is based on Tesseract. The Community Edition supports single page PDFs (or the first page of multi-page PDFs). For multi-page PDF support you should upgrade to the Enterprise Edition.
What's new in this version:
Version 2.0.23 fixed issues with entitlements on OS X Sierra.
I downloaded the free version and once I realised I was trying to get it to OCR two instead of one page it did the job.
It is not as good as ABBYY on a digital photo of a page I had pdf'd - but for free what do you expect? It does soft returns which I don't thin is an option in ABBYY, certainly not in the DEVONthink plug-in version. FYI there is a portable version, which can be found here:
The free version works for single page PDF only. Seems to work well. The full version is $ 29.99.
This application has been significantly improved in its latest iteration. It has handled almost every one-page .pdf I've thrown at it since downloading earlier today.
Like any OCR application, the results aren't always perfect, but overall the performance has been very impressive. I'd say this is a pretty good deal for freeware, and I could see upgrading to the multi-page version. I've yet to see another application that can do what this one does.
It recognizes a lot of words. If we were judging this as a batting average of hits versus strikeouts, it would be the all-time baseball batting average leader. Unfortunately getting 2/3 or even 9/10 of the words right isn't good enough for OCR.
Can't recognize numbers, even in Times, which is considered to be among the most legible fonts in the world. Only crappily OCR's one page at a time.
Lame. This wasted time I couldn't afford to waste. Thanks for trying, but next time include in the description that this is a prototype that isn't road-ready.
The output is searchable, but if you select text and paste it into something else, you'll find that the text has no spaces separating the words, and all the W's are capitalized. It can't recognize a slash mark to save its life. Also, instead of just converting the text into one font, it produces the text in all types and sizes of fonts. Curable (by pasting as plain text), but seriously, what's up with that?
Adobe is still the gold standard, and nothing else comes close.
If your goal is to produce editable text, then you should choose text output rather than searchable PDF. Searchable PDF puts priority on matching the position and site of the text, not on flow. Text output prioritizes flow and editability.
It's FREE and does a great job. Only lets you do one page but again, it's FREE! Gets 99% of the text!
Nothing really. Not even the $30 upgrade price to the full version! Well worth it!
It does the job it needs to do. It does only do one page but it does a great job! Picks up 99% of all the text and then some. Worth the extra $30 to get multiple page PDF's. Great investment! Highly recommended!