HTML parsing

  • Thread starter Thread starter z. f.
  • Start date Start date
Z

z. f.

Hi,

what is the best way to parse HTML (coming from servers) in order to automate HTML/HTTP interaction.
i know i can use the internet explorer control, but this seems to be the most expensive way, but also it is convenience, since i can put values in input and submit forms etc.

on the other hand i can use webrequest control to get/post pages, but then i have to parse html my self.

is there something in the middle - not to use all resouce internet explorer control uses (can i hide it, so it will not show anything, or play sound), and anyway is there some control that can allow me to navigate the DOM of an html page without all the GUI?

TIA.
 
SgmlReader or HtmlAgility (you can find both on gotdotnet). They both work
(with a few minor twists) and allow you to navigate just about any HTML
document like it were wellformed XML ...

Regards
Jesper
Hi,

what is the best way to parse HTML (coming from servers) in order to
automate HTML/HTTP interaction.
i know i can use the internet explorer control, but this seems to be the
most expensive way, but also it is convenience, since i can put values in
input and submit forms etc.

on the other hand i can use webrequest control to get/post pages, but then i
have to parse html my self.

is there something in the middle - not to use all resouce internet explorer
control uses (can i hide it, so it will not show anything, or play sound),
and anyway is there some control that can allow me to navigate the DOM of an
html page without all the GUI?

TIA.
 
Back
Top