Dear CLuceners

It's been a very long time since I've done anything on CLucene, and I
hope I'm not stepping on anyone's toes if I jump in here now. I just
wanted to keep you up with a few things I've been doing lately.

1. I've done some work on CLucene and released it as a stable version.
It's basically the master branch that everyone has been working on,
plus a whole lot of patches:
    * lots of platform fixes.
      - Running on suse's compile farm:
https://build.opensuse.org/package/show?package=clucene-core&project=home%3Austramooner
      - Packaged on Ubuntu's compile farm:
https://launchpad.net/~ustramooner/+archive/ppa/+packages
    * compiles with clang++
    * fixed various deadlocks and memory leaks
    * lots of pedantic warning fixes

   Please send me comments on how that branch is for you. I'm fairly
confident in so far as that the tests pass on a whole lot of platforms
which were failing earlier. But still... :)

2. I've been working on a new version of CLucene with Alan Wright.
This new version is called Lucene++, is compatible with Java Lucene
3.0.3 and is heavily built with boost (full use of shared_ptr, and a
lot of other areas), and thus avoids many of the problems that users
have with memory handling and bugs relating to threading, etc. For a
number of reasons, I've decided not to try and merge the two projects.
For one, I think that CLucene still has a place (in fact I'm also
doing some work on CLucene - read below): namely for projects who
don't want or can't handle the overhead that Lucene++'s heavy use of
boost creates - such as on mobiles.
   - see http://www.github.com/ustramooner/LucenePlusPlus
1a. I've created python bindings around Lucene++
   - see http://www.github.com/ustramooner/python-lucenepp

4. I've created some bindings around Strigi (a desktop search engine
on kde). I've wrapped the libstreams aspect of the engine, which
exposes meta data extraction in a stream way (i.e. you can extract
meta data from files deep in an archive without extracting each file.

I'm hoping that between these two bindings (python-lucene++ and
python-streamanalyzer) a really innovative python based desktop search
engine can be created. I believe that 90% of the hard work and
performance of a desktop search engine is tied up in the areas that
these libraries do best. I really believe desktop search has become
stale and we REALLY lack something good still (I'm still using grep
and locate!!!!). Perhaps these libraries will allow someone who isn't
caught up in the nitty grity of implementing meta extraction and
indexing to come in and blow us all away...
   - see http://www.github.com/ustramooner/python-streamanalyzer


Anyway, that's been my busy week, and no I'm off to Moreton Island to
do some snorkelling :)

Cheers
Ben

------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers

Reply via email to