Re: mg4j - Managing Gigabyte for Java

2004-09-16 Thread Otis Gospodnetic
Hi Anson, It's not quite correct to comparing MG4J and Lucene directly. Lucene is a toolkit whose primary goal is to let you create an index and search it, while MG4J is really a library of Java classes that people implementing an IR library (such as Lucene, for example) may find useful. You can

RE: mg4j - Managing Gigabyte for Java

2004-09-16 Thread Robert Engels
elopers List Subject: Re: mg4j - Managing Gigabyte for Java Antonio Gulli wrote: > Just a question: my personal experience with a commercial engine i > partly developed is the the "continuation bit" (aka altavista solution) > is a good and efficient solution w.r.t gamma code

Re: mg4j - Managing Gigabyte for Java

2004-09-16 Thread Doug Cutting
Antonio Gulli wrote: Just a question: my personal experience with a commercial engine i partly developed is the the "continuation bit" (aka altavista solution) is a good and efficient solution w.r.t gamma code, delta code and other codes used for variable lenght int rappresentation (see MG). G

Re: mg4j - Managing Gigabyte for Java

2004-09-16 Thread Antonio Gulli
David Spencer wrote: Anson Lau wrote: Hi All, Has anyone seen the project MG4J (Managing Gigabyte for Java) http://mg4j.dsi.unimi.it/ ? Anybody knows enough about both Lucene and MG4J to comment on how the two compares? I've wondered if Lucene does comparable (key/index) compression to what the

Re: mg4j - Managing Gigabyte for Java

2004-09-16 Thread David Spencer
Anson Lau wrote: Hi All, Has anyone seen the project MG4J (Managing Gigabyte for Java) http://mg4j.dsi.unimi.it/ ? Anybody knows enough about both Lucene and MG4J to comment on how the two compares? I've wondered if Lucene does comparable (key/index) compression to what the related book (Managing