...colleagues who tend to do a lot of elaborate string manipulation using only built-in string
functions—a common anti-pattern I've observed
Curiously, I more frequently see the converse anti-pattern, namely over-using
For example, I've often seen Perl beginners essaying:
if ($nodename =~ /$mynode/)
when they should have been using:
if ($nodename eq $mynode)
Apart from the obvious problem of embedded names (e.g. "freddy" v "fred"),
I've lost count of the number of times I've asked a Perl rookie to
consider what happens if $mynode
contains regex metacharacters
So I suggest you mention \Q
and quotemeta in your course.
I faced a similar Perl training problem a few years back and sent out an email with
seven specific word puzzle problems against the Unix "words" file (e.g. /usr/dict/words).
I offered some sort of prize for the winner IIRC.
All could be solved as one liners using: perl -ne 'your-program-here' words
or via a longer program, if you prefer.
For example: find all palindromes; find the longest word in the dictionary;
find all words that contain a particular letter four or more times;
find all words that start with "e", have "n" as their second last letter,
and are greater than seven characters in length;
find all words that are of even length and contain an even number of each and
every distinct vowel in the word. There are an endless number of interesting
word puzzles available.
You can further ask them to produce both regex and non-regex solutions,
to compare and contrast which approach is more appropriate for each problem.
It is not hard to invent problems where the regex solution is
vastly superior to the non-regex one, which may help convince them of
the power of regex.
Update: See also:
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.
| & || & |
| < || < |
| > || > |
| [ || [ |
| ] || ] ||