Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re: Re: Re: Complicated searches in a very large text file.

by ant9000 (Monk)
on Jun 30, 2003 at 17:42 UTC ( [id://270236]=note: print w/replies, xml ) Need Help??


in reply to Re: Re: Complicated searches in a very large text file.
in thread Complicated searches in a very large text file.

Tons of people already pointed out Swish-E: that's the way I'd go myself, considering it already does all the searches you are looking for (and more) and that you can access its index files directly from Perl, via its Perl interface.
  • Comment on Re: Re: Re: Complicated searches in a very large text file.

Replies are listed 'Best First'.
Re: Re: Re: Re: Complicated searches in a very large text file.
by waswas-fng (Curate) on Jun 30, 2003 at 18:45 UTC
    From the Swish docs,
    The wildcard (*) is available, however it can only be used at the end +of a word: otherwise is is considerd a normal character (i.e. can be +searched for if included in the WordCharacters directive). swish-e -w "librarian" -f myIndex this query only retrieves files which contain the given word. On the other hand: swish-e -w "librarian*" -f myIndex retrieves ``librarians'', ``librarianship'', etc. along with ``librari +an''.
    So how would you go about having a search string such as "test and apple" be able to match on a document conatining "an apple is good to eat when they come from the grabaltester reigon of madeupcountry."

    -Waswas

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://270236]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others chanting in the Monastery: (5)
As of 2024-04-24 19:58 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found