match the urls

Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Please help in writing the regex. match the url below

https://www.abc.com/ap/ap/top-news/76/nTk3M/
https://www.abc.com/videos/news/eye-catching/vmxJr
https://www.abc.com/ap/top/45/state-political-news/fgdfgd
[download]

and not match

https://www.abc.com/ap/ap/top-news/
https://www.abc.com/
[download]

There should be some value after three "/"'s after the domain name and should match

Comment on match the urls Select or Download Code

Replies are listed 'Best First'.
Re: match the urls by tobyink (Canon) on Jan 11, 2013 at 09:38 UTC
`use 5.010; use strict; use warnings; my @urls = qw< https://www.abc.com/ap/ap/top-news/76/nTk3M/ https://www.abc.com/videos/news/eye-catching/vmxJr https://www.abc.com/ap/top/45/state-political-news/fgdfgd https://www.abc.com/ap/ap/top-news/ https://www.abc.com/ >; for (@urls) { m{ ^ https://www\.abc\.com/ # correct stem (?: .+? / ){3} # non-greedy string then slash x 3 .+ # at least one other character }x ? say("MATCH: $_") : say("NOT: $_") }` [download] `perl -E'sub Monkey::do{say$_,for@_,do{($monkey=[caller(0)]->[3])=~s{::}{ }and$monkey}}"Monkey say"->Monkey::do'`	[reply] [d/l]
Re: match the urls by Anonymous Monk on Jan 11, 2013 at 09:25 UTC
No thanks, see Single regex, perlintro/perlrequick and write some yourself :)	[reply]
Re: match the urls by sen (Hermit) on Jan 11, 2013 at 14:35 UTC
`#!/usr/bin/perl use strict; use warnings; my @array = ('https://www.abc.com/ap/ap/top-news/76/nTk3M/','https://w +ww.abc.com/videos/news/eye-catching/vmxJr','https://www.abc.com/ap/to +p/45/state-politic al-news/fgdfgd', 'https://www.abc.com/ap/ap/top-news/', 'https://www.a +bc.com/'); my @b = grep {$_ =~ /\w+\:\/\/\w+\.\w+\.\w+\/(\w+\-?\w+?\/){3,}./ } @a +rray; print "@b";` [download]	[reply] [d/l]

Back to Seekers of Perl Wisdom