Extracting the text from pdf file

26 views (last 30 days)
Is it possible to extract the text from pdf file using matlab script?
I need to parse through the pdf and extract the particular text in the pdf.
Is there any way to do it?

Accepted Answer

Stephen23
Stephen23 on 9 Jul 2015
Edited: Stephen23 on 9 Jul 2015
"Is there any way to do it?"
Of course, in principal any data with a known specification can be parsed by MATLAB.
Is there an easy way of reading a PDF into MATLAB?
Not really, because PDF's are not sequentially organized text, although they might look like that when they are displayed or printed. This is also a topic that has been covered before on this forum, and a simple search will bring up these very informative discussions on the topic:

More Answers (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!