#> How do I write a pattern for removing roman numerals? The first 10 is #> enough. #Well, the first ten roman numerals are: # I, II, III, IV, V, VI, VII, VIII, IX, X # Just put those in a regex. s/\b(I|II|...)\b//g; # would remove roman numerals, provided they aren't touching any word # characters.