RE: NLucene up to date ?
It would be great to see Nlucene continue. I've been using 1.2b2 and have a few bug fixes (some critical) which I'd be delighted to contribute. Alas, the sourceforge project doesn't show a lot of vitality of late, but the authors have suggest a new version is coming RSN... I know there has been some considerable debate about whether to keep with the translation to C# or just to re-host the Java Lucene 1.3 using vs.net's java compiler... -Eric -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Thursday, July 31, 2003 8:43 AM To: [EMAIL PROTECTED] Subject: NLucene up to date ? Hi all, http://sourceforge.net/projects/nlucene/ has a version numbered 1.2b2. Does anyone know if this source is still being maintained to be closer to the java developments ? Was this an external project to Apache Jakarta ? I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Any other c# users who would like to see NLucene kept in line with the java version ? Maybe I'm just being lazy with having to maintain my own version of Lucene =). Surely there are others out there who are c# users and follow the mailing lists (I remember a Brian somewhere !) but seldom post. Brendon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: NLucene up to date ?
Yes, given the lack of updating the c# version I thought users would be maintaining their own version in line with current developments. I too had to add those items you mentioned. What I would like to see is all these 'implementations' consolidated and maintained regularly as per java. I am not sure how widely known Lucene is in the .NET community - my guess it it isn't. A tried and tested Lucene .NET version will definitely help it reach other audiences. I am pleased to hear Pasha (re)taking up the reigns. Brendon [EMAIL PROTECTED] wrote: I talked to one of the maintainers of NLucene and he said that he was planning on releasing a 1.2 version (not beta apparently) in two months. That was back in June and I haven't heard or seen anything since then so I cant really say if it is still being actively developed. Sounds like you are doing the same thing I am doing which is adding functionality that you need on your own. I've also added a few things to NLucene like multifield queries and the default boolean operator setting. Brian Hi all, http://sourceforge.net/projects/nlucene/ has a version numbered 1.2b2. Does anyone know if this source is still being maintained to be closer to the java developments ? Was this an external project to Apache Jakarta ? I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Any other c# users who would like to see NLucene kept in line with the java version ? Maybe I'm just being lazy with having to maintain my own version of Lucene =). Surely there are others out there who are c# users and follow the mailing lists (I remember a Brian somewhere !) but seldom post. Brendon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: NLucene up to date ? Lucene.Net is up to date.
Excellent news. Will you be keeping the source up to date with the java developments ? Can't wait to get my hands on the source, yes that damn bit shift operator (unsigned ?) always worried me =) Just by the way, would the .NET version have a similar style sandbox area where users can submit small add-on type functionality ? For example, field sorting. Would love to share the code for use and comment as this seems to be a common request. Big thanks Pasha, Brendon [EMAIL PROTECTED] wrote: Hi, I talked to one of the maintainers of NLucene and he said that he was planning on releasing a 1.2 version (not beta apparently) in two months. That was back in June and I haven't heard or seen anything since then so I cant really say if it is still being actively developed. Sounds like you are doing the same thing I am doing which is adding functionality that you need on your own. I've also added a few things to NLucene like multifield queries and the default boolean operator setting. By the way, I hope that Lucene.Net 1.3rc1 will be available from http://sourceforge.net/ in this week. Lucene.Net is ready, but sourceforge is not :) Lucene.Net is a complete up to date port of Lucene 1.3rc1 includes samples and demos (web demo also). A few differences between nLucene and Lucene.net are: 1. version of Lucene: Lucene.Net is a 1.3rc1, nLucene - is a 1.2 2. java code compatible: Lucene.Net only change naming notation, like IndexWriter, nLucene implement some methods as a attributes and others 3. demos: Lucene.Net contain all of Lucene demos and tests include web demos. nLucene does not. 4. .NET Framework 1.1 and VS 2003 compatible 5. (for internal developer only): correct implement of java operator :) Pasha - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: NLucene up to date ?
I talked to one of the maintainers of NLucene and he said that he was planning on releasing a 1.2 version (not beta apparently) in two months. That was back in June and I haven't heard or seen anything since then so I cant really say if it is still being actively developed. Sounds like you are doing the same thing I am doing which is adding functionality that you need on your own. I've also added a few things to NLucene like multifield queries and the default boolean operator setting. Brian Hi all, http://sourceforge.net/projects/nlucene/ has a version numbered 1.2b2. Does anyone know if this source is still being maintained to be closer to the java developments ? Was this an external project to Apache Jakarta ? I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Any other c# users who would like to see NLucene kept in line with the java version ? Maybe I'm just being lazy with having to maintain my own version of Lucene =). Surely there are others out there who are c# users and follow the mailing lists (I remember a Brian somewhere !) but seldom post. Brendon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: NLucene up to date ?
No additional classes have been created. The functionality was simply implemented via new properties and method overloading, so original signatures remain intact. As far as supporting future versions, I cannot say as I will no longer be using it at work. Keeping the c# version in line with java would have to be done in my own time, so no guarantees. Taking the 1.2b2 source I only brought in the fixes, enhancements, etc that affected how I was using Lucene. I keep up with the nightly builds on a regular basis and update the c# source where appropriate, so any bugs should have been rectified. Brendon [EMAIL PROTECTED] wrote: Hi, From: [EMAIL PROTECTED] I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Is it hard-code additional or new classes? Are you going to support new versions of lucene? Pasha P.s nLucene is lucene 1.2 based with old bugs and not supported. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
NLucene up to date ?
Hi all, http://sourceforge.net/projects/nlucene/ has a version numbered 1.2b2. Does anyone know if this source is still being maintained to be closer to the java developments ? Was this an external project to Apache Jakarta ? I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Any other c# users who would like to see NLucene kept in line with the java version ? Maybe I'm just being lazy with having to maintain my own version of Lucene =). Surely there are others out there who are c# users and follow the mailing lists (I remember a Brian somewhere !) but seldom post. Brendon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: NLucene up to date ?
I'd love to see there be quality implementations of the Lucene API in other languages, that are up to date with the latest Java codebase. I'm embarking on a Ruby port, which I'm hosting at rubyforge.org. There is a Python version called Lupy. A related question I have is what about performance comparisons between the different language implementations? Will Java be the fastest? Is there a test suite already available that can demonstrate the performance characteristics of a particular implementation? I'd love to see the numbers and see if even the Java version can be beat. Erik On Thursday, July 31, 2003, at 08:43 AM, [EMAIL PROTECTED] wrote: Hi all, http://sourceforge.net/projects/nlucene/ has a version numbered 1.2b2. Does anyone know if this source is still being maintained to be closer to the java developments ? Was this an external project to Apache Jakarta ? I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Any other c# users who would like to see NLucene kept in line with the java version ? Maybe I'm just being lazy with having to maintain my own version of Lucene =). Surely there are others out there who are c# users and follow the mailing lists (I remember a Brian somewhere !) but seldom post. Brendon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: NLucene up to date ?
Do these implementations maintain file compatibility with the Java version? Scott Erik Hatcher wrote: I'd love to see there be quality implementations of the Lucene API in other languages, that are up to date with the latest Java codebase. I'm embarking on a Ruby port, which I'm hosting at rubyforge.org. There is a Python version called Lupy. A related question I have is what about performance comparisons between the different language implementations? Will Java be the fastest? Is there a test suite already available that can demonstrate the performance characteristics of a particular implementation? I'd love to see the numbers and see if even the Java version can be beat. Erik On Thursday, July 31, 2003, at 08:43 AM, [EMAIL PROTECTED] wrote: Hi all, http://sourceforge.net/projects/nlucene/ has a version numbered 1.2b2. Does anyone know if this source is still being maintained to be closer to the java developments ? Was this an external project to Apache Jakarta ? I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Any other c# users who would like to see NLucene kept in line with the java version ? Maybe I'm just being lazy with having to maintain my own version of Lucene =). Surely there are others out there who are c# users and follow the mailing lists (I remember a Brian somewhere !) but seldom post. Brendon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: NLucene up to date ?
Replies to Erik and Scott inline. [EMAIL PROTECTED] wrote: Do these implementations maintain file compatibility with the Java version? Scott Yes and no, explanation will help me explain. The field ordering functionality required additional files to be created at index time if the Document.Field property indicates so. At search time, the entire contents of the 'field sorting' files are read in. As the IndexReader is shared for all client calls (for a pre-defined period of time as the index has been implemented 'incremental' style) this cost is only incurred once. Code-wise, the technique follows the pattern for the Normalisation byte writing and reading, the difference being an Int being written. Yes, there is a memory usage hit, but the performance and functionality offered offsets this. All other file formats remain identical. I have coded LuceNET (!) so that it gracefully continues if the index segments do not have these additional 'sorting' files (naming convention like the normalisation files). Erik Hatcher wrote: I'd love to see there be quality implementations of the Lucene API in other languages, that are up to date with the latest Java codebase. I'm embarking on a Ruby port, which I'm hosting at rubyforge.org. There is a Python version called Lupy. A related question I have is what about performance comparisons between the different language implementations? Will Java be the fastest? Is there a test suite already available that can demonstrate the performance characteristics of a particular implementation? I'd love to see the numbers and see if even the Java version can be beat. Erik Performance wise, queries typically run in hundreths of seconds. Including term position in the scoring impacted the timings as expected. Indexing takes time, but then this wasn't really part of the design goals. As far as comparing to the java implementation in terms in performance, I haven't tried as this workplace is a MS shop. Java vs c# all over ? Just kidding =) On Thursday, July 31, 2003, at 08:43 AM, [EMAIL PROTECTED] wrote: Hi all, http://sourceforge.net/projects/nlucene/ has a version numbered 1.2b2. Does anyone know if this source is still being maintained to be closer to the java developments ? Was this an external project to Apache Jakarta ? I (we) have just successfully released a search engine using a c# implmentation of Lucene. Code had to be brought up to date in line with recent java builds, and enhanced with additional features (eg field sorting, term position score factoring, etc). Any other c# users who would like to see NLucene kept in line with the java version ? Maybe I'm just being lazy with having to maintain my own version of Lucene =). Surely there are others out there who are c# users and follow the mailing lists (I remember a Brian somewhere !) but seldom post. Brendon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: NLucene up to date ?
On Thursday, July 31, 2003, at 11:22 AM, Scott Ganyo wrote: Do these implementations maintain file compatibility with the Java version? Lupy claims to. I don't know about NLucene, but it should. Any port of Lucene to another language should (out of respect and common-sense) maintain index file compatibility. Any implementation that doesn't - well, we just won't list them on the resources page :) Erik - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]