question on file formats

2006-01-11 Thread Aditya Liviandi
For .frq In the fileformats documentation online it has the example > For example, the TermFreqs for a term which occurs once in document seven and three times in document eleven would be the following sequence of VInts: 15,22,3 (all decimals) > shouldn't it be 15,8,3 (all

Re: Question about FieldInfos

2006-01-11 Thread Yonik Seeley
On 1/10/06, Robert Kirchgessner <[EMAIL PROTECTED]> wrote: > Is there some reason not to store all field attributes in one place (*.fnm) ? [...] > Some of them are stored as a one byte-bit mask > in the field infos file (*.fnm), [...] > Other attributes are stored as a bit mask in the data file (*.

[jira] Commented: (LUCENE-486) Core Test should not have dependencies on the Demo code

2006-01-11 Thread Grant Ingersoll (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-486?page=comments#action_12362488 ] Grant Ingersoll commented on LUCENE-486: >The more stuff that's regularly compiled & tested, the better. Couldn't agree more. >We don't want the demo or contrib code

Re: [jira] Created: (LUCENE-487) Database as a lucene index target

2006-01-11 Thread Amir Kibbar
Robert, It seems that way. It's just that I tried to find something that does that and just couldn't, so I've decided to write something myself. I will certainly take a look now at compass framework's solution. Amir On 1/11/06, Robert Engels <[EMAIL PROTECTED]> wrote: > > You are better off usi

RE: [jira] Created: (LUCENE-487) Database as a lucene index target

2006-01-11 Thread Robert Engels
You are better off using the one that has already been contributed then. It uses JDBC and breaks the file into blocks. Much more efficient. Sorry to say but your solution/code is inferior to what already exists. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Behalf Of

Re: [jira] Created: (LUCENE-487) Database as a lucene index target

2006-01-11 Thread Amir Kibbar
Robert, My solution is the latter. If it is possible to do it using blob seek, I will attempt to do it next. Amir On 1/11/06, Robert Engels <[EMAIL PROTECTED]> wrote: > > Since no code has been posted, I'll just ask the question... > > Does your implementation use the Blob "seek" functions when

[jira] Updated: (LUCENE-487) Database as a lucene index target

2006-01-11 Thread Amir Kibbar (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-487?page=all ] Amir Kibbar updated LUCENE-487: --- Attachment: files.zip > Database as a lucene index target > - > > Key: LUCENE-487 > URL: http://issues.apache.or

[jira] Commented: (LUCENE-486) Core Test should not have dependencies on the Demo code

2006-01-11 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-486?page=comments#action_12362484 ] Doug Cutting commented on LUCENE-486: - The more stuff that's regularly compiled & tested, the better. We don't want the demo or contrib code to become neglected. Removin

Re: Analyzers, perfect hash, ICU

2006-01-11 Thread Ken Krugler
Hi all, I'm working on the analyzer for the slovanic latin languages (cs,sk) w/o stemming at first. I would like to ask you: The StopWord analyzer uses often HashSet implementation, but the the Stopwords are not changed often (if ever) from shipped in the java code. Do you think that is the

RE: [jira] Created: (LUCENE-487) Database as a lucene index target

2006-01-11 Thread Robert Engels
Since no code has been posted, I'll just ask the question... Does your implementation use the Blob "seek" functions when reading and writing, or does it read/write the blob in its entirety. If it is the latter, your solution is only acceptable for the smallest of Lucene indexes. If it is the fo

[jira] Commented: (LUCENE-486) Core Test should not have dependencies on the Demo code

2006-01-11 Thread Grant Ingersoll (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-486?page=comments#action_12362483 ] Grant Ingersoll commented on LUCENE-486: Doug, The intellij thing is just a symptom. I think the bigger issues are the consequences of the demo being changed and tha

[jira] Created: (LUCENE-487) Database as a lucene index target

2006-01-11 Thread Amir Kibbar (JIRA)
Database as a lucene index target - Key: LUCENE-487 URL: http://issues.apache.org/jira/browse/LUCENE-487 Project: Lucene - Java Type: New Feature Components: Store Versions: 1.9 Environment: MySql (version 4.1 an up), Orac

Re: A Database as a Lucene Index Target

2006-01-11 Thread Nicolas Belisle
Hi, I didn't receive the attachment. Maybe you can contribute your files to JIRA : http://issues.apache.org/jira/browse/LUCENE Have you checked the compass framework: http://www.compassframework.org ? They also developped a JDBC Directory implementation: http://static.compassframework.org/doc

[jira] Commented: (LUCENE-486) Core Test should not have dependencies on the Demo code

2006-01-11 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-486?page=comments#action_12362472 ] Doug Cutting commented on LUCENE-486: - Grant, the test code is not in the "main" source tree, it's in the "test" source tree, which depends on "core" and "demo". I still

Re: [jira] Commented: (LUCENE-486) Core Test should not have dependencies on the Demo code

2006-01-11 Thread DM Smith
I have a principle that I code by: The Principle of Least Surprise - Write code in such a way that it minimize surprises. It is surprising to me that test cases would have a dependency on demo code. IMHO, the dependency should be removed. Yesterday I installed lucene from jpackage using y

Analyzers, perfect hash, ICU

2006-01-11 Thread Karel Tejnora
Hi all, I'm working on the analyzer for the slovanic latin languages (cs,sk) w/o stemming at first. I would like to ask you: The StopWord analyzer uses often HashSet implementation, but the the Stopwords are not changed often (if ever) from shipped in the java code. Do you think that is the

RE: A Database as a Lucene Index Target

2006-01-11 Thread David Freireich
Amir: I'm interested in the type of work you are doing with Lucene. I own an extremely focused talent agency, Core Search Group. We represent the top infrastructure software engineers in the world to companies that must have the best talent possible to realize their goals. Most of our cli

A Database as a Lucene Index Target

2006-01-11 Thread Amir Kibbar
Hi,I hope that this mailing list is the right place for things like that, if not I apologize in advance.I've written an extension for the Directory object called DBDirectory, that allows you to read and write a Lucene index to a database instead of a file system. This is done using blobs. Each blob

[jira] Commented: (LUCENE-486) Core Test should not have dependencies on the Demo code

2006-01-11 Thread Erik Hatcher (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-486?page=comments#action_12362429 ] Erik Hatcher commented on LUCENE-486: - I concur with Grant on this - the dependency from test to demo has caused me annoyance as well. I'm in favor of a fix to it, but h

Re: BooleanQuery: static setMaxClauseCount(int)?

2006-01-11 Thread Paul Elschot
On Wednesday 11 January 2006 03:54, Chris Hostetter wrote: > > I thought the purpose of this method was for applications to specify the > largest possible BooleanQuery that could be created in their application > (either progromaticaly, via QueryParser, or as a result of rewriting a > non-primativ