Hi all,
I've just ran a comparison between Apache Lucene and CLucene.
My machine is Intel Pentium Dual Core, 1Gb ram, OS is Slackware 13.0.
My dataset was only one text file with 200,000 lines, each line has 
format of
"[id] <Tab> [content]"
content is a string which has about 250-300 unicode words.

The result is quite strange, since CLucene is much slower than Apache 
Lucene:
CLucene took 617s, and index size is 566mb while Apache Lucene took only 
122s and generate 125mb of index data.

Anybody did the same comparison as I did and have a similar result ? I 
expected CLucene to be much faster than Apache Lucene anyway.

PS:
I paste here the indexing code, it's quite simple and straight forward:
Apache Lucene: http://paste.pocoo.org/show/198146/
CLucene: http://paste.pocoo.org/show/198147/


------------------------------------------------------------------------------
Download Intel&#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers

Reply via email to