analyzing website: how to

  • Thread starter Thread starter Guest
  • Start date Start date
G

Guest

Hi all,

I want to create an application that downloads a user defined webpage and
does makes some content analyzes. Basically download (only the text) and
write it to a buffer.

Someone knows what classes I can use for this?

Regards
Stijn
 
Stijn,

Basicly you can use
http://msdn.microsoft.com/library/d...ml/frlrfsystemnethttpwebrequestclasstopic.asp

Together with MsHTML
http://msdn.microsoft.com/library/default.asp?url=/workshop/browser/editing/mshtmleditor.asp

However don't expect it is easy.

A clean HTML page exist almost nomore.
By instance in an ASPX page a lot is in a viewstate, while there is in
others a lot in Java, JavaScript or XML etc. files.

However in my opinions are this the first links to look for.

I hope it helps

Cor
 
HttpWebRequest
HttpWebResponse

also, you may want to take a look at wget a Unix utility (but I've seen
online a port to Win) that should let you get a web page/site.
HTH,
F.O.R.
 
Back
Top