In Word XP there is an export filter called "Web Page, Filtered". This is
accessed through the "Save As" dialog. This will leave you will interanl
styles and HTML 4.0.
Word puts all that stuff in so that the HTML can be brought back into Word
without loosing any formatting (round tripping). It was never Microsoft's
intent for the HTML out of Word to be used for an external audience on the
internet. IE 5.5 and IE 6.0 understand all of the Office only markup.
Mark
Quote:
> Hi there:
> We are using Word 2002 on Win2K O/S. I have now had a
> chance to work with Word and Front Page 2002 and some
> things that I am seeing that Word does makes me crazy.
> One of my user creates documents from Word 2002 to give to
> me for our web site. However, she is saving documents
> (.doc) to .html. When I go to check these documents out,
> there is a ton of the xml code and smart tagging
> information. I do NOT want that at all. It's bad enough
> that Front Page adds its own code to documents.
> Can someone please tell me ASAP how I can take a simple
> word document, convert it to HTML and NOT have it add that
> extra junk called XML?
> I'd be forever appreciative.
> Cordially,
> Peggy DaValt
> State of Wisconsin
> Dept of Regulation & Licensing
> IT Section