From my prior research it would appear a) you will not some third party lib
especially if the data is not always structured consistantly. I am currently
working on a project which requires me to extract text, no tables per-say
but it does have structure and just started to evaluate
http://www.pdftron.com/ this library. Each library I try on a single PDF
extracts said data but slightly different, for example.
Order #
XX000411209
07/02/2007
Ship Date
Order #
XX000411209 07/02/2007
Ship Date
Slight differences which makes one's code need to change. Any ways don't
have one specific product to recommend but wanted to alert you of the above
situation if you happen to shift from one library to another.
Lastly, Code Plex has the start of a PDF library which at this point doesn't
provide extraction of content but worthy of checking out for other
functionality with PDF formatted documents.