grab pdf content and save to excel

  • Thread starter Thread starter Grey
  • Start date Start date
G

Grey

i need to grab the pdf content. the pdf content has the standard format with
data inside. is there any way to 1) grab the table content in pdf format;
and then 2) put the data into excel
 
From my prior research it would appear a) you will not some third party lib
especially if the data is not always structured consistantly. I am currently
working on a project which requires me to extract text, no tables per-say
but it does have structure and just started to evaluate
http://www.pdftron.com/ this library. Each library I try on a single PDF
extracts said data but slightly different, for example.

Order #
XX000411209
07/02/2007
Ship Date

Order #
XX000411209 07/02/2007
Ship Date

Slight differences which makes one's code need to change. Any ways don't
have one specific product to recommend but wanted to alert you of the above
situation if you happen to shift from one library to another.

Lastly, Code Plex has the start of a PDF library which at this point doesn't
provide extraction of content but worthy of checking out for other
functionality with PDF formatted documents.
 
Back
Top