redeaing pdf file

  • Thread starter Thread starter akshar108 via DotNetMonster.com
  • Start date Start date
A

akshar108 via DotNetMonster.com

i want to read pdf file and extract file without any component in .net if it
is possible the please tell me
 
akshar108 said:
i want to read pdf file and extract file without any component in .net if it
is possible the please tell me

Yes, it's possible. Just read the file and extract the information you want.

The file format is publicly available from Adobe.
 
akshar108 said:
can u say me the technique or code how to do that please

Depending on whether the file format is binary or textual, you can use a
BinaryReader or StreamReader to read it.
 
can u say me the technique or code how to do that please

Göran is being slightly tongue-in-cheek here...

I've already told you how to extract data from a PDF file by using the
Aspose PDF component - that's what it's for...

If you don't want to do that, then that's your choice.

However, as Göran says, you will then have to obtain a copy of the PDF file
format from Adobe.

There are various versions of the PDF file format, all of which are
different.

Then you will have to write your own routines for opening a PDF file,
working out which version it is, and then separating all the various
individual parts until you find what you're looking for.

That's what Aspose did...

Of course, they have a large team of developers.

It might take you a month or even longer, depending on your skill level.

Alternatively, just use the Aspose component...
 
do you have msn messanger account so we can direct chat and please say me any
code to read pdf file and extract text

Göran Andersson said:
can u say me the technique or code how to do that please
[quoted text clipped - 3 lines]
Depending on whether the file format is binary or textual, you can use a
BinaryReader or StreamReader to read it.
 
there is no simple code to read a pdf file.

a pdf file is a actually a program written in postscript. to read the
file you write a postscript interpreter (its a pretty simple stack
machine, but has a large graphics library). there is some data around
the postscript program, but its simple to parse out the program. you
them run the program thru the interpreter you write to build the pages.



-- bruce (sqlwork.com)
 
Back
Top