Eksempel inde fra wordcleaner:
Example of WordCleaner in Action
As an example if you create a document in Word which has the following line of text:
This is a test
If you then save this as a html document using words inbuilt save as webpage feature, this is the source html of the Word HTML file that is produced:
<html xmlns:o="urn:schemas-microsoft-com:office:office"
xmlns:w="urn:schemas-microsoft-com:office:word"
xmlns="
http://www.w3.org/TR/REC-html40"><head>
< meta http-equiv=Content-Type content="text/html; charset=windows-1252">
<meta name=ProgId content=Word.Document>
<meta name=Generator content="Microsoft Word 10">
<meta name=Originator content="Microsoft Word 10">
<link rel=File-List href="This%20is%20a%20test_files/filelist.xml">
<title>This is a test</title>
<!--[if gte mso 9]><xml>
<o:DocumentProperties>
<o:Author>Brian </o:Author>
<o:LastAuthor>Brian </o:LastAuthor>
<o:Revision>1</o:Revision>
<o:TotalTime>0</o:TotalTime>
<o:Created>2003-02-02T19:11:00Z</o:Created>
<o:LastSaved>2003-02-02T19:11:00Z</o:LastSaved>
<o:Pages>1</o:Pages>
<o:Words>2</o:Words>
<o:Characters>13</o:Characters>
<o:Company>mambosoft</o:Company>
<o:Lines>1</o:Lines>
<o:Paragraphs>1</o:Paragraphs>
<o:CharactersWithSpaces>14</o:CharactersWithSpaces>
<o:Version>10.3131</o:Version>
</o:DocumentProperties>
<o:OfficeDocumentSettings>
<o:DoNotRelyOnCSS/>
</o:OfficeDocumentSettings>
< /xml><![endif]--><!--[if gte mso 9]><xml>
<w:WordDocument>
<w:GrammarState>Clean</w:GrammarState>
<w:Compatibility>
<w:BreakWrappedTables/>
<w:SnapToGridInCell/>
<w:WrapTextWithPunct/>
<w:UseAsianBreakRules/>
</w:Compatibility>
<w:BrowserLevel>MicrosoftInternetExplorer4</w:BrowserLevel>
</w:WordDocument>
</xml><![endif]-->
<style>
<!--
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{mso-style-parent:"";
margin:0cm;
margin-bottom:.0001pt;
mso-pagination:widow-orphan;
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-font-family:"Times New Roman";}
@page Section1
{size:595.3pt 841.9pt;
margin:72.0pt 90.0pt 72.0pt 90.0pt;
mso-header-margin:35.4pt;
mso-footer-margin:35.4pt;
mso-paper-source:0;}
div.Section1
{page:Section1;}
-->
</style>
< !--[if gte mso 10]>
<style>
/* Style Definitions */
table.MsoNormalTable
{mso-style-name:"Table Normal";
--------- eksempel slut ---------------
Dette er jo på ingen måde normalt når man koder.
Microsoft skriver da også at det er for at gøre teksten lettere og hurtigere at redigere igen med word.
Men sagens kerne er jo at man gerne vil have fjernet alt dette fra koden når man paster fra word og ind editoren, så det bliver til almindelig html....
Hvis man skal lave en liste/array over alle microsoft's "funky-codes" skal man da smide en 250 GB disk i sin maskine for at kunne rumme alle dem.
Bare se i eksempelet alt det *censored* som ms putter ind.....