http://www.perlmonks.org?node_id=1210696


in reply to Re^2: String match in Chinese character
in thread String match in Chinese character

then escape the unicode in the regex:

use warnings;
use strict;
use utf8;
my $string = '看【厂家直销 儿童加绒加厚打底裤 中小童冬季】Ib';

binmode STDOUT, ':encoding(UTF-8)';
while ($string =~ /\x{3010}(.*?)\x{3011}/g) {
    print "Match: $string\n";
}

(I left the use utf8 so that I could easily include the same string that choroba did. However you get the string is fine)

Replies are listed 'Best First'.
Re^4: String match in Chinese character
by 1nickt (Canon) on Mar 12, 2018 at 01:42 UTC

    Or use the name of the character if you don't have the codes on the tip of your tongue:

    use warnings;
    use strict;
    use utf8;
    use charnames ':full';
    
    my $string = '看【厂家直销 儿童加绒加厚打底裤 中小童冬季】Ib';
    
    binmode STDOUT, ':encoding(UTF-8)';
    print "Match: $string\n" while $string =~ /
        \N{LEFT BLACK LENTICULAR BRACKET} (.*?) \N{RIGHT BLACK LENTICULAR BRACKET}
    /gx;
    


    The way forward always starts with a minimal test.