Tech Tip: Extracting Text from Online Manuals

Extract Text from Images & Scanned PDF Manuals Online
Source: Digital Inspiration

If you are on a budget, the built-in OCR engine of Google Search is almost a perfect option for converting scanned PDFs to text - just put all your scanned PDF images onto a public website and wait for Google spiders to convert them into editable digital text.
onlineocr.bmp

Obviously there are two drawbacks associated with the original idea. The PDF conversion process is not real time and second, you need access to a public web server where you can upload the PDF images so that Google bots can find them.

If you aren’t willing to wait that long and need to perform instant OCR without downloading any of the software tools, try OCR Terminal - it’s an online Optical Character Recognition service where you can upload scanned images, multi-page PDF documents or even screenshots and convert them into searchable text documents.

The conversion results, as you can noticed in the screenshot above, are pretty accurate and it also preserves the document formatting and layout. You may download the extracted text as RTF or a Word Document.

Read more at Digital Inspiration.

About this Entry

This page contains a single entry by University of Minnesota Law Library published on March 5, 2009 9:10 AM.

Librarian, Friend to U Libraries, Wins Award was the previous entry in this blog.

Film on Controversial Death Penalty View is the next entry in this blog.

Find recent content on the main index or look in the archives to find all content.