Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight
 
PerlMonks  

Re: Machine learning

by mce (Curate)
on Feb 27, 2004 at 14:47 UTC ( #332259=note: print w/ replies, xml ) Need Help??


in reply to Machine learning

Hi,
Have a look at PerlFect.

This is a nice free perl search engine.

It might give you some ideas.


---------------------------
Dr. Mark Ceulemans
Senior Consultant
BMC, Belgium


Comment on Re: Machine learning
Re: Re: Machine learning
by sri (Vicar) on Feb 27, 2004 at 15:30 UTC
    Thanks for your tip, but that's not really what i wanted.

    I already reached the state of PerlFect, i even implemented massive storage clustering.

    The key feature of the search engine should be categorization of results and learning from the users input, changed keywords, wants more than first ten results...
    The whole concept of the user interactivity goes far too wide to explain here.

    What i am seeking is simply a good learning algorithm! ;)
      Automatic categorisation is the panacea of Knowledge Management and it is something that a great many people are working on with a view to making some serious financial gain. The auto-cat software is therfore expensive but the blurb on the vendor's websites may be of interest.

      I use search engine software from Verity who implement machine assisted categorisation in a workbench tool such that the output keyword net can be applied to content as it is indexed. This works well in a corporate environment where content doesn't change that much and you just want to locate it in a defined categorisation structure. Verity also have a 'social network' product that allows people to see locate subject matter experts. I haven't worked with this bit yet but the demo looked cool.

      I have also looked at Autonomy who popularised Baysian techniques for clustering results. Their search engine works really well for newsfeeds where the clustering is generally unknown and fluid. The search results can appear really random until the internals have caught up with a new cluster of information. I am told that the BBC News website uses this technique to create the 'related stories' links on it's website.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://332259]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (4)
As of 2014-07-12 14:16 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    When choosing user names for websites, I prefer to use:








    Results (240 votes), past polls