Scrappy Module

sankarg has asked for the wisdom of the Perl Monks concerning the following question:

Dear Monks,

Any information on Scrappy Module. I just use that module and found results for some of the sites. But mostly it may not be work for some kind of tags in the content. So, help me in scrappy module. Any latest news also update.

Thanks Sankar G.

Comment on Scrappy Module

Replies are listed 'Best First'.
Re: Scrappy Module by marto (Cardinal) on May 12, 2011 at 11:22 UTC
Have you looked at Scrappy? What's your actual question? What don't you think is working? Some tags? Read and understand How do I post a question effectively?.	[reply]
Re^2: Scrappy Module by sankarg (Initiate) on May 12, 2011 at 11:35 UTC
Thanks for your immediate reply marto. I already worked with scrappy module. I can able to get the content when scrapping a website. My question is in the latest version of scrappy `use Scrappy; my $scraper = Scrappy->new; $scraper->crawl('http://search.cpan.org/recent', '/recent' => { '#cpansearch li a' => sub { print $_[1]->{href}, "\n"; } } );` [download] you can find that this the url 'http://search.cpan.org/recent' means we need to give the 'recent' tag. it is working only for this cpan site. And it is not working for other sites. That is my question. How we could use the tags and get scrape a website. Can you able to understand.	[reply] [d/l]
Re^3: Scrappy Module by marto (Cardinal) on May 12, 2011 at 11:48 UTC
I've never used this module, if you look at the source for the URL you provide in your example along with the documentation (see the `crawl` method), unless other sites contain the same '/recent' link and element with and id of 'cpansearch' etc, it's not going to work. In other words, you need to write your own code to work with your own sites. Other parsing modules are available, see WWW::Mechanize::Firefox among others.	[reply] [d/l]
Re: Scrappy Module by Anonymous Monk on May 12, 2011 at 11:38 UTC
Dear Monks, Any information on Scrappy Module. I just use that module and found results for some of the sites. But mostly it may not be work for some kind of tags in the content. So, help me in scrappy module. Any latest news also update. Thanks Sankar G. Dear Sankar, maybe translate.google.com can help you to ask a question effectively, by providing a better translation (maybe). How do I post a question effectively? explains how to make your question easy for us to understand (code, input, actual output, wanted output, how its different). Since Scrappy is based on Web::Scraper/xpath, I would start with Web::Scraper (esp examples in http://search.cpan.org/dist/Web-Scraper/MANIFEST ), and Re: Help With Online Table Scraper, Re^2: Help With Online Table Scraper Then I would generate a sample application using scrappy and read the source ...	[reply]

Back to Seekers of Perl Wisdom