http://www.perlmonks.org?node_id=935405


in reply to Regex Matching Unicode and Regex Classes

The default Unicode semantics just check the Unicode properties of a codepoint. "ä" is U+00E4 LATIN SMALL LETTER A WITH DIAERESIS and classified as a letter, so \w matches it.