How to create Searchable PDFs with OCR.space


What is a Searchable PDF?

A searchable PDF file is a PDF file that includes text that can be searched upon using the standard Adobe Reader “search” functionality. In addition, the text can be selected and copied from the PDF. Generally, PDF files created from Microsoft Office Word and other documents are by their nature searchable as the source document contains text which is replicated in the PDF, but when creating a PDF from a scanned document it only contains images of the text and an OCR process needs to be applied to recognize the characters within the image. Searchable PDF documents are especially useful to access content in documents that must be archived with their precise original appearance. Our online service allows you to make searchable PDFs from scans online for free.

Searchable PDF

The screenshot shows a searchable PDF. The PDF contains the original scanned image plus a separate text layer produced from the OCR process. In this example is the text layer defined as invisible, but can still be selected and searched upon. For this reason (two layers, one image, one text), a searchable PDF is sometimes also called sandwich PDF.

How to make a Searchable PDF

Very easy: Just select one of the two create searchable PDF options on the front page. You can either choose to have the text layer invisible or as visible overlay.

Create Searchable PDF automatically via our Free PDF OCR API

You can also create searchable PDFs directly via our API: In this case the JSON response of the API contains a download link for the the searchable PDF. This download link is valid for one hour, afterwards all data is removed from our OCR servers.

isCreateSearchablePdf = true triggers the generation of the searchable PDF. With the free OCR API, the generated PDF contains a little watermark "Generated by OCR.space" in the lower right corner. With the PRO OCR API, no watermark is added to the created PDF. More details are available in the Searchable PDF API section of the OCR API documentation.

Free version adds a small watermark

The free version of the OCR API adds a small watermark at the bottom of each page of the created searchable PDF. The free version is also limited to the first three pages of your input PDF. With the PRO PDF plan, you can convert PDFs of unlimited page size.

Searchable PDF Watermark - Free version only
The watermark is not added if you use the PRO or PRO PDF OCR API plans.

View OCR API Performance
Follow OCR API on Twitter
UI Vision Free RPA Software
Copyfish OCR Browser Extension
Selenium IDE for Chrome
Try UI.Vision, our OCR-powered Robotic Process Automation (RPA) software. It is available as free browser extension for Chrome and Firefox (OSI-certified Open-Source) plus computer-vision extension modules. UI.Vision is fun to use - and its Cloud Vision OCR features are powered by the OCR.space OCR API.

Do you have an OCR API question? Please email us or visit the OCR API Forum - we love to answer OCR questions.