Sillybean

November 22, 2005

Automator script for Word HTML cleanup

So, the other Steph and I were kvetching earlier about the lousy Word HTML we have to clean up all day… and I remembered something. A long time ago, I’d tried to use AppleScript to make the Word Unmunger’s batch mode easier to use. At the time, AppleScript defeated me… but now it’s Automator, and it’s a lot better.

Voila… the Word Unmunger Automator script.. You’ll need to grab the Unmunger itself, of course, and edit the workflow to match your path to the script. (I had renamed mine fix.py because I was constantly typing the file name in Terminal.)

Now if only Dreamweaver’s commands were available to Automator. See, the Unmunger sometimes can’t handle HTML from Word files created on a Mac, and running it through Dreamweaver’s Clean Up Word HTML command first solves the problem.

Oh well. This is still going to make my professional life a lot easier.

No comments yet.

Leave a Reply

-- or --

Textile formatting is in effect.

RSS feed for comments on this post. TrackBack URI

'round here

Writing & Publishing 101

Paged Media: Web Design for Authors

elsewhere

The Minority Report hand-waving computer interface is now real. Its users will have the best toned arms in the office.

Comment on this

The Matrix runs on Windows. Ow. It hurts to laugh that hard.

Comment on this

Google flu trends — doing something useful with all those searches for “flu symptoms.”

Comment on this

I’m addicted to the NYTimes maps this morning, especially the county bubble view. Hello, population distribution! It’s fascinating to compare this year to 2004.

Comment on this

LibriVox — “acoustical liberation of books in the public domain”

Comment on this