Converting Word formatted contents to HTML

  • Thread starter Thread starter Craig Petrie
  • Start date Start date
C

Craig Petrie

Hi,
I have a large table in Word 2003 that has formatted text in the cells and
wish to read and convert a cells formatted contents to html output via
vb.net code. The formatting contains the following: bold text, italic text,
superscript & subscript text, bullet points. The output html is then saved
in an XML file as a CDATA section for later use in the application when
formatted output is required.

I have looked up many sites on the web and also searched in user groups but
have not yet seen a way to code the above in vb.net 2003. In my VB.net app I
currently open and read the word table and output all cell contents into an
XML structure OK - I just need to know how to read and convert the formatted
text inside a cell to html format.


Can anyone suggest tools/components or code or examples that would help me
do this.

I am using VB.net 2003 and Office Professional Edition 2003.


cheers,
Craig, New Zealand
 
Hi Graig,

An XML file is not always an dataset.
A dataset can always be made as a XML file to disk or streaming to another
place.

I hope I make this a little bit clear with saying that because maybe it is
cryptic.

To use an XML file you can read it as a XML doc using the loadXML or access
it as a streaming file using the XML reader.

If you want to know more, feel free to ask?

Cor
 
* "Craig Petrie said:
I have a large table in Word 2003 that has formatted text in the cells and
wish to read and convert a cells formatted contents to html output via
vb.net code.

Maybe you can save the document to HTML using Word automation. I would
ask this question in one of the Word VBA/programming groups.
 
Hi Craig,

Simply saving the document in html format will allow the table to be
displayed in a web browser (with a few exceptions)

If you need assistance let me know.
 
Back
Top