How to use ocr function for pdf files?

The pdf format is not recognized in MATLAB. I want to perform ocr on the pdf files. Also, I am not being able to convert the pdf into any image format in MATLAB. Please help. I'm using MATLAB on Windows.

Answers (2)

You can use "extractFileText()" to extract text from pdf,docs or word.But you have to download Text Analytics Toolbox for this function to use.
To know more visit
You will need to call an external program such as ghostscript to render the PDF into images.
ghostscript was included as part of the MATLAB release until recently (somewhere around R2014b or R2015b), but if you have a new release you will need to find and install the program yourself.

Categories

Find more on Convert Image Type in Help Center and File Exchange

Asked:

on 27 May 2016

Answered:

on 17 Mar 2019

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!