Anyone fancy taking a look at this in the context of OpenGuides to see
whether it holds the same benefits for us?
Cheers,
Dominic.
----- Forwarded message from Michael Chaney <mdchaney(a)michaelchaney.com> -----
X-Spam-Status: No, score=-2.5 required=5.0 tests=BAYES_00,FORGED_RCVD_HELO
autolearn=ham version=3.1.3
Date: Thu, 31 Aug 2006 20:54:08 -0500
From: Michael Chaney <mdchaney(a)michaelchaney.com>
To: plucene(a)kasei.com
Subject: Re: [Plucene] New CPAN module Lucene-0.01
Organization: Michael Chaney Consulting Corporation
User-Agent: Mutt/1.5.9i
On Wed, Aug 09, 2006 at 06:41:22PM +0200, Thomas Busch wrote:
Hi all,
I just released the CPAN module Lucene-0.01 which is a perl
API to the C++ port of the Lucene search engine.
Have a look:
http://search.cpan.org/~tbusch/Lucene-0.01/lib/Lucene.pm
I wanted to do a little writeup on this when I had a chance.
Unfortunately, the machine that I set it up on is down so I can't
include code just yet, but let me say this: if you are using Plucene,
run, and I mean run, to this.
I have some pretty complex code to do indexing and searching. I spent
about two hours total recoding it to work with this Lucene version. The
speed difference cannot be overstated. Building an index went from
taking 5.5 hours to 13 minutes (this is about 330,000 items). That's a
speed difference of about 25 times. For lookups, it usually takes about
.1 second, as opposed to 4-5 seconds. This is an even larger speedup in
proportionality.
Changing it boiled down to renaming a couple of things. Obviously,
Plucene -> Lucene. Beyond that, index::writer is index::indexwriter or
something like that. The changes were simple.
Missing functionality that I would like: I had something that would
read the keys from a Plucene index, and it's not there. I can work
around it, probably another 30 minutes of work. The weighted-query
parser is missing, too, but I could probably hack that in given time.
It's not present on the CLucene base, so maybe it'll be added there.
Many thanks to Thomas Busch for this, as well as the CLucene authors.
This thing is extremely fast and very reliable. It pretty much worked
the first time for me. Give it a try if you're using Plucene; you will
be glad you did.
Michael
--
Michael Darrin Chaney
mdchaney(a)michaelchaney.com
http://www.michaelchaney.com/
_______________________________________________
Plucene mailing list
Plucene(a)kasei.com
http://www.kasei.com/mailman/listinfo/plucene
----- End forwarded message -----
--
Dominic Hargreaves |
http://www.larted.org.uk/~dom/
PGP key 5178E2A5 from the.earth.li (keyserver,web,email)