Word to HTML Conversion

  • Thread starter Thread starter Juan Romero
  • Start date Start date
J

Juan Romero

Guys,

Has anyone tried to make a program to convert Word Documents to HTML?
Can someone give me some ideas for approaching this?

If you know of a good component out there that I can use for this purpose,
please reply to this message too. If I can't make one, I would like server
component or DLL that I can use programmatically. I have been looking over
the internet but so far I have only found complete packages with interfaces.

Thanks!
 
Hi Juan:

Many people complain about the verbosity of the HTML that word creates, but
you can invoke it directly if you have Word installed on the machine through
the Word Interop library. If you have a word document, it's only one line
of code to save it as HTML...
ActiveDocument.SaveAs FileName:="DOCUMENTNAMEHERE.htm",
FileFormat:=wdFormatHTML, EmbedTrueTypeFonts:=False
 
Thank you for replying.

I guess I forgot to mention that the reason why I am looking to make (or
get) a converter is because Word does not produce 100% (or at least close
to) identical output.

It is extremely important for our clients to get identical output. Another
requisite is that the converter should produce HTML 3.2 compliant code.

If you have any other ideas please let me know.

Thanks!
 
Back
Top