Skip navigation

VB.NET: Extract Text From PDF Files

Last week I was working on an application which was supposed to generate thumbnails from PDF files and extract the text out of it for figuring out the keywords etc.

 
Well, I thought that the snippet might be of some help to some of you. So here is the code that extracts text contents out of PDF files.
 
Will post the thumbnail code in my next tip.
 
Regards
Shobhit
 
For this code to run, you need Adobe Acrobat installed on your machine.
I have tested this code on Adobe Acrobat Professional 6.0 and it works perfectly on that.
Hide comments

Comments

  • Allowed HTML tags: <em> <strong> <blockquote> <br> <p>

Plain text

  • No HTML tags allowed.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Lines and paragraphs break automatically.
Publish