Re: Re: Optimizing indexes with mulitiple processors?

2005-06-10 Thread Ben van Klinken
This raises an interesting point and it's an issue that i think i dealt with in CLucene. I modified the way the clucene tokenstream works with some large performance increases. I change the tokenstream interface to the following: from Token next(); to boolean next(Token t); then the document writ

CLucene - ASF

2005-06-11 Thread Ben van Klinken
Hi, I have been discussing with Otis about incorporating CLucene into the ASF incubation program. We need a sponsor and mentor for this transition and we would like to ask if there is anybody interested? CLucene is a port of the Lucene search engine to C++. Most of the Lucene 1.4.3 features are n

Re: new spanish analyzer

2006-01-10 Thread Ben van Klinken
Also, the snowball stemmer is available in the contributions of clucene. ben On 1/10/06, Steven Rowe <[EMAIL PROTECTED]> wrote: > Hola José, > > Did you know that Java Lucene already has a contributed Snowball-based > stemmer/analyzer, very similar to yours? > > http://svn.apache.org/viewcvs.cgi/

VInt's as prefix. Was: bytecount as prefix

2006-05-11 Thread Ben van Klinken
Hi, I'm the author of CLucene (a c++ port of lucene). I've been following the 'using byte count as prefix' discussion and I think this discussion sort of ties into something we are trying to achieve. We are trying to optimise the way the index writing works, and we also want to be able to index

Re: VInt's as prefix. Was: bytecount as prefix

2006-05-11 Thread Ben van Klinken
What we really need is the ability to add "leading zeroes" to a VInt. I really like this idea! A VInt can then be written with a static length. Then in clucene we can implement our stream optimisations without any changes to the code logic. What's the chance of this making it into Lucene 2.0? L

Re: VInt's as prefix. Was: bytecount as prefix

2006-05-11 Thread Ben van Klinken
ncompatible, it would be a great help for us. ben On 5/11/06, Doug Cutting <[EMAIL PROTECTED]> wrote: Ben van Klinken wrote: > What's the chance of this making it into Lucene 2.0? Let me know if > there's anything i can do to get this into Lucene 2. Lucene 2.0 is all but out

Re: CLucene incubation - call for a mentor

2006-10-15 Thread Ben van Klinken
Hi, Lucy is certainly an interesting project. There is a great advantage of having a C language lucene which doesn't conform as closely to the java lucene api (as CLucene does). I imagine there are many optimisations available. But CLucene does offer something to developers, among other things: a