I am using HTMLarea3 and I have my users pasting word HTML in to it. Sometimes it looks on in htmlarea sometimes it does not. Either way when it gets posted to a site, it looks VERY bad. In dreamweaver there is a "Clean up word HTML" option. Is there any way to do something like that in perl, with regex. I am not very good with regex's but has someone done something like this?

Update: How does that script work with all the # in it?

