Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

fastest searchengine for perl

by marcelser (Initiate)
on Aug 08, 2006 at 13:12 UTC ( [id://566150]=perlquestion: print w/replies, xml ) Need Help??

marcelser has asked for the wisdom of the Perl Monks concerning the following question:

Hi Folks,

I'm looking for a really fast search engine for perl (or with perl interface).

I was already trying plucene but plucene is very slow. At the moment I'm working on tests for kinosearch but I think it would be too slow too. The CLucene package is outdated and doesn't work anymore, (does anybody know of an update, cause I think clucene would be very fast?)

Does anybody know of any other fast search engines I did not come across?

At the moment I'm playing with the idea of calling the Java Lucene with Java::Import but as my Java knowledge is absolutely zero it would be quite difficult to write the wrapper classes and compile the stuff. Has anyone does something like this yet?

Best regards,

Marc

Replies are listed 'Best First'.
Re: fastest searchengine for perl
by jeteve (Pilgrim) on Aug 08, 2006 at 13:34 UTC

    You should contact tbusch at cpan dot org . He's about to release a cpan distrib simply called Lucene.pl It's a XS interface to clucene and it's almost complete .

    -- Nice photos of naked perl sources here !

Re: fastest searchengine for perl
by perrin (Chancellor) on Aug 08, 2006 at 14:44 UTC
    If you want a very fast and easy-to-use search with a good and well-maintained Perl API, I recommend SWISH-E.
      I already took a look at Swish-E but Swish-E has two serious problems:

      1.) not utf8 capable
      2.) cannot delete documents from index!

      Therefore it's not really an option
        There's not much need to delete documents from the index when rebuilding it over gigs of data takes only a few seconds. The UTF8 thing has no easy solution though.

        UPDATE: Maybe you'd be interested in KinoSearch.

Re: fastest searchengine for perl
by derby (Abbot) on Aug 08, 2006 at 14:06 UTC

    At the moment I'm playing with the idea of calling the Java Lucene with Java::Import but as my Java knowledge is absolutely zero it would be quite difficult to write the wrapper classes and compile the stuff. Has anyone does something like this yet?

    Kinda ... doing Lucene stuff (or any Java) within perl is going to give you a run-time penalty - which may or may not be a problem for you. A common solution is to implement a Lucene Search service (via tomcat) and then in your perl, use LWP to query the service.

    -derby
Re: fastest searchengine for perl
by LTjake (Prior) on Aug 09, 2006 at 01:28 UTC

    You can try as derby says and use lucene as a web service -- specifically, try our Lucene Web Service servlet.

    Check the repository for a perl client. We're using it in production with great success.

    --
    "Go up to the next female stranger you see and tell her that her "body is a wonderland."
    My hypothesis is that she’ll be too busy laughing at you to even bother slapping you.
    " (src)

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: perlquestion [id://566150]
Approved by marto
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others wandering the Monastery: (6)
As of 2024-04-18 08:52 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found