Word HTML Cleaner v1.1

Instructions

Or copy and paste HTML here:

Browser view of Word version

For reference purposes.

Browser view of Cleaned version



Clean HTML Code (for copy and paste)

Instructions

This script is designed to clean the HTML that Microsoft Word creates. To use it,

The two boxes below the input box show how the original Word HTML appears in a browser, and how the cleaned version appears. If the clean version is satisfactory, copy the HTML in the bottom box, labeled Clean HTML Code, and paste it into an HTML file using a text editor.

You can edit the HTML in Clean HTML Code if you wish. Click Update Clean Version View to update the view on the right. Caution: if you reload the page, your edits will be lost.

If the Word file is large, cleaning may take a minute. Please be aware that this cleaner cannot fix everything that is wrong with Word's HTML. The cleaner does not actually know how the document looks in Word. It simply removes all the extra tags and attributes that Word adds. Some editing will probably be required after the conversion is complete.

Note: empty elements are removed, except for empty paragraph and table cells

This cleaner has absolutely no warranty, not even the implied warranty of merchantability or fitness for a particular purpose.

The original version of this code is copyright 2005 by Connor McKay. This version is substantially revised by Chris Riesbeck.

Back to top