Hello wise ones.
I have a script that opens a Word Document, saves it as a txt document. This code works fine. Does anyone know of a way to capture the text from the word document into an array without saving it to a txt document first? Below is the working code and as always I am open to any and all criticism both good and bad.
Thanks in advance.
use Win32::OLE;
use constant wdCRLF => 0;
use constant wdFormatText => 2;
use constant wdOpenFormatAuto => 0;
$doc = "c:\\temp\\test.doc";
$txtdoc = "$ENV{TEMP}\\reportmacro.txt";
$Win32::OLE::Warn = 3;
my $wd_object = (Win32::OLE->GetActiveObject('Word.Application') ||
Win32::OLE->new('Word.Application', 'Quit'));
##### MAKE WORD APP VISIBLE(1), NOT VISIBLE(0) ####
$wd_object -> {Visible} = 1;
$wd_object->Documents->Open({FileName => "$doc", ConfirmConversions
+=> 0, ReadOnly => 0,
AddToRecentFiles => 0, PasswordDocume
+nt => '', PasswordTemplate => '',
Revert => 0, WritePasswordDocument =>
+ '', WritePasswordTemplate => '',
Format => wdOpenFormatAuto, XMLTransf
+orm => ''});
$wd_object->ActiveDocument->SaveAs({FileName => "$txtdoc", FileForma
+t => wdFormatText, LockComments => 0,
password => '', AddToRecentFil
+es => 1, WritePassword => '',
ReadOnlyRecommended => 0, Embe
+dTrueTypeFonts => 0,
SaveNativePictureFormat => 0,
+SaveFormsData => 0,
SaveAsAOCELetter => 0, Encodin
+g => 1252, InsertLineBreaks => 1,
AllowSubstitutions => 0, LineE
+nding => wdCRLF});
$wd_object->ActiveDocument->Close();
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.