It'd help if you could provide a better example of the HTML. You could try something like this:
#!/usr/bin/perl
use strict;
use warnings;
use feature 'say';
use Mojo::DOM;
my $html = '<div class="JMWMJ"><div class="toI8Rb OSrXXb usbThf">Sam N
+amett, MD - Physician - Interventional Orthopedics ...Exosomes are na
+novesicles (30-200 nm) found in extracellular space of various cell t
+ypes, and in biofluids; having diverse functions including intracellu
+lar ...</div></div>
<div class="JMWMJ"><div class="toI8Rb OSrXXb usbThf">Dr. Bombay - Phys
+ician - witch doctor ...canned laughter ...</div></div>
<div class="JMWMJ"><div class="toI8Rb OSrXXb usbThf">Dr. Who - time lo
+rd - previously good TV show ...</div></div>';
my $dom = Mojo::DOM->new( $html );
for my $entry ( $dom->find('div.JMWMJ')->each ){
say $entry->all_text;
}
Outputting:
Sam Namett, MD - Physician - Interventional Orthopedics ...Exosomes ar
+e nanovesicles (30-200 nm) found in extracellular space of various ce
+ll types, and in biofluids; having diverse functions including intrac
+ellular ...
Dr. Bombay - Physician - witch doctor ...canned laughter ...
Dr. Who - time lord - previously good TV show ...
Posting a more complete example (or example URL) would be beneficial. Super Search for more Mojo::Dom goodness.
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|