Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

Re: conversion from doc to html

by wfsp (Abbot)
on Dec 05, 2011 at 15:16 UTC ( #941875=note: print w/ replies, xml ) Need Help??


in reply to conversion from doc to html

Some googling turned up this by Util. ++ to him. This is a cut down version that could get you started.

#!/usr/bin/perl use strict; use warnings; use Win32::OLE; use Win32::OLE::Enum; my $word = Win32::OLE->GetActiveObject('Word.Application'); my $document = $word->ActiveDocument; my $paragraphs = $document->Paragraphs(); my $enumerate = Win32::OLE::Enum->new($paragraphs); while( my $paragraph = $enumerate->Next()) { my $style = $paragraph->{Style}->{NameLocal}; my $text = $paragraph->{Range}->{Text}; $text =~ tr{\n\r}{}d; $text =~ tr{\x0b}{\n}; printf qq{%s -> ***%s***\n}, $style, $text; }
It assumes a document is open in Word. My simple document parsed as
Heading 1 -> ***Heading 1 text*** Heading 2 -> ***Heading 2 text*** Normal -> ***Normal***
For producing HTML I would consider something like HTML::Element.


Comment on Re: conversion from doc to html
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://941875]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (6)
As of 2014-12-27 06:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (176 votes), past polls