M
mp
I just made a quickie app to browse a folder(s) of .htm files
I put a webBrowser control on a form and a previous and next button, and a
save button.
my aim is to quickly browse through the files and if i find a page that has
useful info on it i want to save the "text information" to a text
file(ascii)- that's what the "save" button will be for.
I found this property webBrowser1.DocumentText which returns a string of the
html text of that page.
this string includes all the html tags etc...not just the written" text "
showing up on the page.
what i'm wondering is do i need to write a routine to parse the html tags to
detect what is the actuall "text" showing up on the page (as opposed to
attributes, formatting, etc)...or is there some kind of existing object that
can parse that text and return the actual "words" which appear on the page?
in other words if i look at a page in the browser, i could manually
copy/paste the "text" that shows up on the page.
is there a builtin way to do that programatically or do i need to create my
own html parsing routine?
thanks
mark
I put a webBrowser control on a form and a previous and next button, and a
save button.
my aim is to quickly browse through the files and if i find a page that has
useful info on it i want to save the "text information" to a text
file(ascii)- that's what the "save" button will be for.
I found this property webBrowser1.DocumentText which returns a string of the
html text of that page.
this string includes all the html tags etc...not just the written" text "
showing up on the page.
what i'm wondering is do i need to write a routine to parse the html tags to
detect what is the actuall "text" showing up on the page (as opposed to
attributes, formatting, etc)...or is there some kind of existing object that
can parse that text and return the actual "words" which appear on the page?
in other words if i look at a page in the browser, i could manually
copy/paste the "text" that shows up on the page.
is there a builtin way to do that programatically or do i need to create my
own html parsing routine?
thanks
mark