This updates brings the abilitiy to generate searchable PDF to OCR Engine2. Searchable PDF are sometimes called Sandwich PDF because they contain two layers: The original PDF and a second layer with the OCR’ed text.
Our popular on-premise OCR software received a major update: OCR Engine 2 is now integrated in the offline version of the OCR.Space OCR API.
We updated our OCR Engine 2 with full PDF OCR support, including auto-rotation and receipt OCR. And while we are at it, we also improved the processing speed of the engine 2.
The February 2020 update adds WebP to the list of supported image files for image ocr. The supported image formats are PNG, JPG, GIF, TIFF and WebP now. And of course PDF for document OCR. But wait… what is WebP?
We launched a new OCR Engine that brings improved numeric and alphanumeric OCR and special character OCR.
We implemented the second OCR Engine to give you access to a second OCR algorithm. It is better than the default engine (now called “engine1”) in many cases for Latin character languages (e. g. English OCR, French OCR, German OCR,…). So we recommend that you try engine1 first (since it is faster!), but if the OCR results are not perfect for your use case, please try the same document with engine2. You can use the new OCR engine with our free online OCR service on the front page, and with the OCR API.
To test the new OCR engine right away, use the free online OCR feature on our front page. You can switch between both OCR engines and compare the result.