Beefy Boxes and Bandwidth Generously Provided by pair Networks
more useful options
 
PerlMonks  

Related module links on search.cpan.org

by ferreira (Chaplain)
on Apr 11, 2008 at 13:00 UTC ( [id://679670]=monkdiscuss: print w/replies, xml ) Need Help??

Graham Barr announced at http://log.perl.org:

Documentation pages on search.cpan.org now contain a list of related modules. For an example see LWP::UserAgent

This list is determined by data created by Perlmonks.org about which modules are discussed together.

Where's that data search.cpan is pulling? How do we contribute to it? Some of the modules listed in SEE ALSO sections of the documentation are immediate candidates for these links.

Update: as pointed by Corion, that information is probably being extracted from cpan module search with a XML display type and filling up the field module and selecting "Other Modules". That information is automated, relating the appearance of modules referenced in the same article. This automation is good because it is automatically maintained, but it is dull sometimes relating, for example, LWP::UserAgent and Data::Dumper.

Replies are listed 'Best First'.
Re: Related module links on search.cpan.org
by Corion (Patriarch) on Apr 11, 2008 at 13:24 UTC

    The link on search.cpan.org also has a backlink to cpan module search, so I guess what search.cpan.org uses is the RSS provided by that node. I'm not sure about the load implications of all of this - if search.cpan.org produces lots of "related modules" queries for perlmonks.org, we might have to cache these results (if they aren't cached already).

      should have looked at META.yml instead :)
Re: Related module links on search.cpan.org
by ack (Deacon) on Apr 11, 2008 at 17:30 UTC

    I've seen this recently on CPAN; it's actually kind of neat and actually pretty helpful overall. Though, as you noted:

    ...information is automated, relating the appearance of modules referenced in the same article. This automation is good because it is automatically maintained, but it is dull sometimes relating, for example, LWP::UserAgent and Data::Dumper...

    It is a lot like the Amazon.com strategy of the Other people who looked at this item also looked at.... And although the CPAN version just says Related Modules, I have generally just used in the spirit of the Amazon.com strategy.

    There is certainly room for improvement, IMHO, but is still a nice addition to CPAN.

    ack Albuquerque, NM

      Yes, it would be helpful to normalize the scores so that Data::Dumper isn't the module considered most closely "related" to just about every module on CPAN just because it is the most mentioned on PerlMonks (and so is the "most mentioned in conjuction with X" for most values of "X").

      So it'd be cool to be able to list modules "related to X" sorted by how much more likely they are to be mentioned when "X" is mentioned. I think even the math for that is quite simple. Then Data::Dumper could even have a negative score in relation to some module. :)

      - tye        

        In a nutshell:

        • We have a list of matching modules sorted by "tf"
        • We need a list of matching modules sorted by the weighted "tf/idf"

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: monkdiscuss [id://679670]
Approved by Arunbear
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others sharing their wisdom with the Monastery: (9)
As of 2024-03-28 12:25 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found