Find Number in String then Ignore Characters proceeding

BenPen95 has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Find Number in String then Ignore Characters proceeding by Eliya (Vicar) on May 29, 2012 at 21:05 UTC
`$str =~ s/[+-](\d+)(??{".{$1}"})//g;` [download] (Unfortunately, the straightforward attempt `$str =~ s/[+-](\d+).{\1}//g;` doesn't work.) See (??{ code }). Upd: changed `(??{"\\w{$1}"})` to `(??{".{$1}"})`, just in case you need to remove any character, not just alphanumeric.	[reply] [d/l] [select]
Re^2: Find Number in String then Ignore Characters proceeding by BenPen95 (Initiate) on May 29, 2012 at 21:29 UTC
Thank you, This was my first time posting and you guys are awesome! I was trying the straight forward way after the first responce. How come the straight forward way doesn't work? And how come \\w rather than \w works?	[reply]
Re^3: Find Number in String then Ignore Characters proceeding by AnomalousMonk (Archbishop) on May 29, 2012 at 22:51 UTC
(Unfortunately, the straightforward attempt `$str =~ s/[+-](\d+).{\1}//g;` doesn't work.) How come the straight forward way doesn't work? Because the regex compiler will attempt to compile the entire `[+-](\d+).{\1}` regex (the 'search' regex of the substitution) at compile time, but the `\1` backreference of the `.{\1}` counted quantifier sub-expression is not known until run time, when something may be captured that it can actually refer back to. OTOH, the `(??{".{$1}"})` 'postponed' extended pattern is specifically designed to both compile and run at run-time. BTW: The use of `$^N` is, IMHO, 'safer' than the use of `$1` in the sub-expression `(??{ ".{$1}" })` (making it `(??{ ".{$^N}" })` instead) because `$^N` equals the contents of the most recently closed capturing group and will not change (semantically) if the relative positional relationship between that capture group and the use of `$^N` does not change; whereas adding another capture group anywhere before the `(\d+)` group will change the semantics of `$1` because capture group counting will change.	[reply] [d/l] [select]
Re^4: Find Number in String then Ignore Characters proceeding by Eliya (Vicar) on May 29, 2012 at 23:44 UTC
Re^3: Find Number in String then Ignore Characters proceeding by Eliya (Vicar) on May 29, 2012 at 21:40 UTC
And how come \\w rather than \w works? The double backslash is just because it's in a double-quoted string, so a literal `\w` remains in the runtime constructed regex pattern fragment.	[reply] [d/l]
Re: Find Number in String then Ignore Characters proceeding by aaron_baugher (Curate) on May 29, 2012 at 21:33 UTC
If you know that the characters in question will always be uppercase letters (or some other particular character set that doesn't include the next + or -), it's fairly easy: capture the digits and letters that follow a + or -, and use `substr` to drop the correct number of letters off the beginning: `#!/usr/bin/env perl use Modern::Perl; my $str = ".,a..A,,C..+4ACGTG.,-2TG,,...,a"; $str =~ s/[+-](\d+)(\w+)/substr $2, $1/ge; say $str;` [download] Aaron B. Available for small or large Perl jobs; see my home node.	[reply] [d/l] [select]
Re: Find Number in String then Ignore Characters proceeding by snape (Pilgrim) on May 29, 2012 at 20:32 UTC
This should work. Change the "number of characters" as per your need.You need to modify the code as per your need `#!/usr/bin/perl use strict; use warnings; my $str = ".,a..A,,C..+4ACGTG.,-2TG,,...,a"; $str =~ s/[+-]?\d\w{4}//g; $str =~ s/[+-]?\d\w+//g; print $str;` [download] Update 1: Eliya's method works awesomely well. I learnt something today. Thanks Eliya Update 2: After several tries, I got this regex and it should also work `#!/usr/bin/perl use strict; use warnings; my $str = ".,a..A,,C..+4ACGTG.,-2TG,,...,a"; $str =~ s/[+\|-](\d)(\w)/(substr $2, $1)/ge; print $str;` [download]	[reply] [d/l] [select]
Re^2: Find Number in String then Ignore Characters proceeding by BenPen95 (Initiate) on May 29, 2012 at 21:01 UTC
The number after the + or - is the number of characters I would like to remove. If `..,+3AGCT.,.` it should remove `+3AGC` but leaves the `..,T.,.`	[reply] [d/l] [select]
Re: Find Number in String then Ignore Characters proceeding by temporal (Pilgrim) on May 29, 2012 at 21:33 UTC
A little more legible (if not as elegant) version of what the previous post is doing: `#! perl my $str = '.,a..A,,C..+4ACGTG.,-2TG,,...,a.'; print replace($str); sub replace { my $str = shift; if ($str =~ m/([+-])(\d*)/) { $str =~ s/\Q$1\E$2.{$2}//; return replace($str); } return $str; }` [download] ^{Strange things are afoot at the Circle-K.}	[reply] [d/l]


Pathologically Eclectic Rubbish Lister
	PerlMonks