http://www.perlmonks.org?node_id=811315


in reply to Unicode regular expressions

This is small, but no one has mentioned it. The OP wrote:

Additionally ignoring the unicode characters the following works:
/^[\w\ \-]+$/
Because of the "\w", this will match underscores (_); I don't think that is what is intended by:
...regular letters, hyphens, unicode letters, numbers, spaces, newlines (\n or \r\n) but no punctuation of any sort.