Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling
 
PerlMonks  

Re: Converting Word97 (or later) exported HTML to valid HTML

by andye (Curate)
on Nov 06, 2001 at 18:00 UTC ( #123571=note: print w/replies, xml ) Need Help??


in reply to Converting Word97 (or later) exported HTML to valid HTML

You're so right - it's really quite horrendous. I've used two solutions for this in the past (neither Perl though, sorry) :
  • Microsoft themselves have released a utility to do this - presumably available from their website
  • Macromedia Dreamweaver has a specific function to do this
The second of these obviously can't be incorporated in a script, the first probably can't, but perhaps you could persuade your users to run their html files through the Microsoft utility, on their Windows desktop?

hth a little,
andy.

  • Comment on Re: Converting Word97 (or later) exported HTML to valid HTML

Replies are listed 'Best First'.
Re: Re: Converting Word97 (or later) exported HTML to valid HTML
by impossiblerobot (Deacon) on Nov 06, 2001 at 20:44 UTC
    I've found a Word filter from Microsoft that is supposed to output cleaner HTML. (I assume this is what you were talking about.)

    I also tend to use Dreamweaver for this task, but it does leave some of the CSS stuff behind, so some cleanup is still required.

    Update: Although I still haven't tested the output, it appears that the MS Word filter can be used from the command line, as a standalone GUI application, or from within Word, and can batch process multiple files.


    Impossible Robot

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://123571]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others perusing the Monastery: (4)
As of 2021-06-12 20:05 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    What does the "s" stand for in "perls"? (Whence perls)












    Results (53 votes). Check out past polls.

    Notices?