VB.NET: Extract Text From PDF Files

Last week I was working on an application which was supposed to generate thumbnails from PDF files and extract the text out of it for figuring out the keywords etc.

Well, I thought that the snippet might be of some help to some of you. So here is the code that extracts text contents out of PDF files.

Will post the thumbnail code in my next tip.

Regards

Shobhit

For this code to run, you need Adobe Acrobat installed on your machine.

I have tested this code on Adobe Acrobat Professional 6.0 and it works perfectly on that.

Comments

Plain text