Section items are all capitalized, i.e., "ITEM," but your regex is matching both "ITEM" and "item." The following works to capture only the ITEM I: BUSINESS text from the data you provided:
use File::Slurp qw/read_file/;
my $text = read_file 'a5927574.txt';
my ($businessItemText) =
$text =~ /(ITEM [\dA-Z]+?[: -]+BUSINESS.+?)ITEM [\dA-Z]+?[: -]+/s;
ITEM I: BUSINESS
Littlefield Corporation develops, owns and operates charitable bingo
owns and operates an event rental company. In our Entertainment div
operate 37 charitable bingo halls in Texas, Alabama, Florida and South
are with Littlefield Hospitality and twelve (12) are at corporate he
in Austin, Texas. Littlefield Entertainment consists of sixteen (16)
+ full time
employees and nineteen (19) part time employees. Littlefield H
consists of thirty-two (32) full time employees and one part time empl
Hope this helps!
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
Outside of code tags, you may need to use entities for some characters:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.
| & || & |
| < || < |
| > || > |
| [ || [ |
| ] || ] ||