Best way to get rid of Word formating?

J

Jeremy

I receive Word documents from my team, to be posted to our website as .htm.
The word docs come with all kinds of styles and formating that are not
needed and only slow down the delivery of the .htm to the user.

Typically the only formating I need to keep from Word docs are h2, h3, p, i,
and b, and indents/blockquotes.

What I do now, is to reformat the incoming Word doc to Normal (making sure I
am not loosing the italics and bolds), then save the doc as filtered .htm,
then copy the document to a website page. But I find I still need to go
through the web page to remove vast quantities of spans, divs, sometimes
fonts, and the ubiqiutous p.MsoNormal. Then I optimize the html in
FrontPage.

Is there a better way? Thanks!
 
E

E. T. Culling

Copy and paste the docs into Notepad to get rid of all the terrible Word
formatting. Then copy and paste into your page and format properly. With all
the junk cleared away your ready to use regular CSS.
Eleanor
 
T

Thomas A. Rowe

Convert the document to a PDF, import and link to it. Then user can download it.

--
==============================================
Thomas A. Rowe (Microsoft MVP - FrontPage)
WEBMASTER Resources(tm)

FrontPage Resources, WebCircle, MS KB Quick Links, etc.
==============================================
 
J

John

Tom,

I think you have missed the point on this post.

That method is not suitable if the client is sending text you need to use in
a webpage. Copy and paste the text into a TXT (Notepad) file and then
copy/paste into FrontPage.

That cleans all the extraneous code out.

John
 
J

John

If you do it this way, all Word code goes along with it. Far better to
copy/paste into notepad, then copy/paste into FrontPage

John
 
?

=?Windows-1252?Q?Rob_Giordano_\=28aka:_Crash_Gordo

Sounds like a lot of work for you. Is there a reason it has to come through you?


| I receive Word documents from my team, to be posted to our website as ..htm.
| The word docs come with all kinds of styles and formating that are not
| needed and only slow down the delivery of the .htm to the user.
|
| Typically the only formating I need to keep from Word docs are h2, h3, p, i,
| and b, and indents/blockquotes.
|
| What I do now, is to reformat the incoming Word doc to Normal (making sure I
| am not loosing the italics and bolds), then save the doc as filtered ..htm,
| then copy the document to a website page. But I find I still need to go
| through the web page to remove vast quantities of spans, divs, sometimes
| fonts, and the ubiqiutous p.MsoNormal. Then I optimize the html in
| FrontPage.
|
| Is there a better way? Thanks!
|
|
 
T

Thomas A. Rowe

I didn't miss the point! I responded based on Jeremy's question "Is there a better way?"

IMO, the better way is to convert the file to a PDF file, thereby retaining all formatting of the
original document. This will also reduce the amount of time required to get the content on to the
web and available to users, to just a few minutes.
--
==============================================
Thomas A. Rowe (Microsoft MVP - FrontPage)
WEBMASTER Resources(tm)

FrontPage Resources, WebCircle, MS KB Quick Links, etc.
==============================================
 
W

Wally S

I've been doing it for the last nine years. You don't get the Word code. If
you copy it first into Notepad, you will lose all the formatting. The
problem here is that Jeremy wants to keep some of the formatting and get rid
of the rest of it. If he wants to get rid of all the different fonts, he
could select the whole doc and then format it as one font. That will work.

Wally S
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top