Create Searchable PDFs
How to create Searchable PDFs with OCR.space
What is a Searchable PDF?
A searchable PDF file is a PDF file that includes
text that can be searched upon using the
standard Adobe Reader “search” functionality.
In addition, the text can be selected and copied
from the PDF. Generally, PDF files created from
Microsoft Office Word and other documents are
by their nature searchable as the source
document contains text which is replicated in the
PDF, but when creating a
PDF from a scanned document it only contains images of the text
and an OCR process needs to be applied to
recognize the characters within the image.
Our online service allows you to make searchable PDFs from scans online for free.
The screenshot shows a searchable PDF. The PDF contains the original scanned image plus a separate text layer produced from the OCR process. In this example is the text layer defined as invisible, but can still be selected and searched upon. For this reason (two layers, one image, one text), a searchable PDF is sometimes also called sandwich PDF.
How to make a Searchable PDF
Very easy: Just select one of the two create searchable PDF options on the front page. You can either choose to have the text layer invisible or as visible overlay.
Create Searchable PDF automatically via our Free PDF OCR API
You can also create searchable PDFs directly via our API: In this case the JSON response of the API contains a download link for the the searchable PDF. This download link is valid for one hour, afterwards all data is removed from our OCR servers.
isCreateSearchablePdf = true triggers the generation of the searchable PDF. With the free OCR API,
the generated PDF contains a little watermark "Generated by OCR.space" in the lower right corner.
With the PRO OCR API, no watermark is added to the created PDF. More details are available in the Searchable PDF API section of the OCR API documentation