[jira] Updated: (LUCENE-1257) Port to Java5

2009-10-16 Thread Kay Kay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Kay updated LUCENE-1257: Attachment: LUCENE-1257_BooleanFilter_Generics.patch * BooleanFilter ( internal data structures conformed

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-10-16 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766832#action_12766832 ] Mark Miller commented on LUCENE-1458: - Almost got an initial rough stab at the sep cod

[jira] Updated: (LUCENE-1987) Remove rest of analysis deprecations (Token, CharacterCache)

2009-10-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1987: -- Attachment: LUCENE-1987.patch Pastch with the first three points. The three deprecated methods

[jira] Created: (LUCENE-1987) Remove rest of analysis deprecations (Token, CharacterCache)

2009-10-16 Thread Uwe Schindler (JIRA)
Remove rest of analysis deprecations (Token, CharacterCache) Key: LUCENE-1987 URL: https://issues.apache.org/jira/browse/LUCENE-1987 Project: Lucene - Java Issue Type: Task

[jira] Resolved: (LUCENE-1976) isCurrent() and getVersion() on an NRT reader are broken

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1976. Resolution: Fixed Fix Version/s: (was: 3.1) 3.0

Re: JDBC access to a Lucene index

2009-10-16 Thread Grant Ingersoll
I'm not aware of any, but you might get more mileage asking on java- user. On Oct 16, 2009, at 3:54 AM, Jukka Zitting wrote: Hi, Some while ago I implemented a simple JDBC to JCR bridge [1] that allows one to query a JCR repository from any JDBC client, most notably various reporting tools.

[jira] Commented: (LUCENE-1257) Port to Java5

2009-10-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766657#action_12766657 ] Uwe Schindler commented on LUCENE-1257: --- Committed revision: 826035 > Port to Java5

[jira] Commented: (LUCENE-1976) isCurrent() and getVersion() on an NRT reader are broken

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766654#action_12766654 ] Michael McCandless commented on LUCENE-1976: I plan to back-port this to 2.9.x

[jira] Commented: (LUCENE-1985) DisjunctionMaxQuery - Iterator code to for ( A a : container ) construct

2009-10-16 Thread Kay Kay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766652#action_12766652 ] Kay Kay commented on LUCENE-1985: - Thanks Uwe. Added another patch to LUCENE-1257 to get

[jira] Updated: (LUCENE-1257) Port to Java5

2009-10-16 Thread Kay Kay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Kay updated LUCENE-1257: Attachment: LUCENE-1257-DisjunctionMaxQuery-more_type_safety.patch * DisjunctionMaxQuery.java - some of th

[jira] Resolved: (LUCENE-1124) short circuit FuzzyQuery.rewrite when input token length is small compared to minSimilarity

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1124. Resolution: Fixed > short circuit FuzzyQuery.rewrite when input token length is sm

RE: ant build-contrib fails on trunk?

2009-10-16 Thread Uwe Schindler
Fixed. What an error in contrib./istantiated, found because of generics. :-) One positive affect of generics and type safety. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de _ From: Robert Muir [mailto:rcm...@gmail.com] Sent: Fr

Re: ant build-contrib fails on trunk?

2009-10-16 Thread Michael McCandless
OK thanks! Mike On Fri, Oct 16, 2009 at 1:09 PM, Uwe Schindler wrote: > I'll fix, this is because of generics and compareTo(). I revert the change. > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > >> -Original Message

RE: ant build-contrib fails on trunk?

2009-10-16 Thread Uwe Schindler
It was not the generics change, it was a bug in the comparator. There was one getTerm() missing. I'll add. The compile found the error, because of generics, the signature didn't match correct (in 1.4 it was just Object without a generics hint, now its Object and Term, but InstantiatedTerm does not

Re: ant build-contrib fails on trunk?

2009-10-16 Thread Robert Muir
yes, not just you On Fri, Oct 16, 2009 at 1:00 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > When I run ant build-contrib on current trunk, I hit this: > > compile-core: >[javac] Compiling 1 source file to > /lucene/tmp2/build/contrib/instantiated/classes/java >[javac] > /l

RE: ant build-contrib fails on trunk?

2009-10-16 Thread Uwe Schindler
I'll fix, this is because of generics and compareTo(). I revert the change. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Michael McCandless [mailto:luc...@mikemccandless.com] > Sent: Friday, October 1

[jira] Updated: (LUCENE-1124) short circuit FuzzyQuery.rewrite when input token length is small compared to minSimilarity

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1124: --- Fix Version/s: (was: 2.9) 3.0 2.9.1 > shor

ant build-contrib fails on trunk?

2009-10-16 Thread Michael McCandless
When I run ant build-contrib on current trunk, I hit this: compile-core: [javac] Compiling 1 source file to /lucene/tmp2/build/contrib/instantiated/classes/java [javac] /lucene/tmp2/contrib/instantiated/src/java/org/apache/lucene/store/instantiated/InstantiatedTermDocumentInformation.java

Re: lucene 2.9 sorting algorithm

2009-10-16 Thread Michael McCandless
Oh, no problem... Mike On Fri, Oct 16, 2009 at 12:33 PM, John Wang wrote: > Mike, just a clarification on my first perf report email. > The first section, numHits is incorrectly labeled, it should be 20 instead > of 50. Sorry about the possible confusion. > Thanks > -John > > On Fri, Oct 16, 200

Re: lucene 2.9 sorting algorithm

2009-10-16 Thread John Wang
Mike, just a clarification on my first perf report email. The first section, numHits is incorrectly labeled, it should be 20 instead of 50. Sorry about the possible confusion. Thanks -John On Fri, Oct 16, 2009 at 3:21 AM, Michael McCandless < luc...@mikemccandless.com> wrote: > Thanks John; I'l

[jira] Resolved: (LUCENE-1985) DisjunctionMaxQuery - Iterator code to for ( A a : container ) construct

2009-10-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-1985. --- Resolution: Fixed Fix Version/s: 3.0 Assignee: Uwe Schindler Committed revis

[jira] Reopened: (LUCENE-1124) short circuit FuzzyQuery.rewrite when input token length is small compared to minSimilarity

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reopened LUCENE-1124: This fix breaks the case when the exact term is present in the index. > short circuit

[jira] Updated: (LUCENE-1124) short circuit FuzzyQuery.rewrite when input token length is small compared to minSimilarity

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1124: --- Attachment: LUCENE-1124.patch Attach patch (based on 2.9) showing the bug, along wit

[jira] Commented: (LUCENE-1984) DisjunctionMaxQuery - Type safety

2009-10-16 Thread Kay Kay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766587#action_12766587 ] Kay Kay commented on LUCENE-1984: - As a related patch - LUCENE-1985 added to improve reada

[jira] Updated: (LUCENE-1986) NPE in NearSpansUnordered from PayloadNearQuery

2009-10-16 Thread Peter Keegan (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Keegan updated LUCENE-1986: - Attachment: TestPayloadNearQuery1.java Unit test that causes NPE > NPE in NearSpansUnordered fr

[jira] Updated: (LUCENE-1985) DisjunctionMaxQuery - Iterator code to for ( A a : container ) construct

2009-10-16 Thread Kay Kay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Kay updated LUCENE-1985: Attachment: LUCENE-1985.patch > DisjunctionMaxQuery - Iterator code to for ( A a : container ) construc

[jira] Created: (LUCENE-1986) NPE in NearSpansUnordered from PayloadNearQuery

2009-10-16 Thread Peter Keegan (JIRA)
NPE in NearSpansUnordered from PayloadNearQuery --- Key: LUCENE-1986 URL: https://issues.apache.org/jira/browse/LUCENE-1986 Project: Lucene - Java Issue Type: Bug Components: Search A

[jira] Created: (LUCENE-1985) DisjunctionMaxQuery - Iterator code to for ( A a : container ) construct

2009-10-16 Thread Kay Kay (JIRA)
DisjunctionMaxQuery - Iterator code to for ( A a : container ) construct --- Key: LUCENE-1985 URL: https://issues.apache.org/jira/browse/LUCENE-1985 Project: Lucene - Java

[jira] Closed: (LUCENE-1984) DisjunctionMaxQuery - Type safety

2009-10-16 Thread Kay Kay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Kay closed LUCENE-1984. --- Thanks Uwe. The revised patch looks good as well, with better code readability. > DisjunctionMaxQuery - Type s

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-10-16 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766562#action_12766562 ] Mark Miller commented on LUCENE-1458: - just committed an initial stab at pulsing cache

[jira] Resolved: (LUCENE-1984) DisjunctionMaxQuery - Type safety

2009-10-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-1984. --- Resolution: Fixed Committed revision: 825881 Thanks Kay Kay! > DisjunctionMaxQuery - Type

[jira] Updated: (LUCENE-1984) DisjunctionMaxQuery - Type safety

2009-10-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1984: -- Attachment: LUCENE-1984.patch Small updates in Patch (also implemented Iterable). I also gener

Re: search trough single pdf document - return page number

2009-10-16 Thread IvanDrago
Yes, I tough of that too but i didn't know if I could search trough index only documents that have specific field name. After some researching I found a way to do that: String q = "title:ant"; Query query = parser.parse(q); title:ant -> Contain the term ant in the title field Regards, Ivan Er

Re: search trough single pdf document - return page number

2009-10-16 Thread Erick Erickson
Well, you have to add another field to each document identifying thePDF it came from. From there, restricting to that doc just becomes adding an AND clause. Of course how you specify these is "an exercise left to the reader" . Erick On Fri, Oct 16, 2009 at 8:01 AM, IvanDrago wrote: > > Hey! I d

Re: search trough single pdf document - return page number

2009-10-16 Thread IvanDrago
proximity queries that span pages are not a concern in my case. I asked another question on the bottom of my last post. Could you comment on that If you have some ideas? Erick Erickson wrote: > > Glad things are progressing. The only problem here will be > proximityqueries > that span pages. S

Re: search trough single pdf document - return page number

2009-10-16 Thread Erick Erickson
Glad things are progressing. The only problem here will be proximityqueries that span pages. Say, the last word on page 10 is "salmon" and the first word on page 11 is "fishing". Structuring your index this way won't find the a proximity search for "salmon fishing". If that's not a concern, then t

Re: search trough single pdf document - return page number

2009-10-16 Thread IvanDrago
Hey! I did it! Eric and Robert, you helped a lot. Thanks! I didn't use LucenePDFDocument. I created a new document for every page in a PDF document and added paga number info for every page. PDDocument pddDocument=PDDocument.load(f); PDFTextStripper textStripper=new PDFTextStripp

Re: lucene 2.9 sorting algorithm

2009-10-16 Thread Michael McCandless
Thanks John; I'll have a look. Mike On Fri, Oct 16, 2009 at 12:57 AM, John Wang wrote: > Hi Michael: >     I added classes: ScoreDocComparatorQueue and OneSortNoScoreCollector as > a more general case. I think keeping the old api for ScoreDocComparator and > SortComparatorSource would work. >   

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766482#action_12766482 ] Michael McCandless commented on LUCENE-1458: bq. you wanna remove them & commi

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-10-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766481#action_12766481 ] Michael McCandless commented on LUCENE-1458: OK thank for addressing the new n

JDBC access to a Lucene index

2009-10-16 Thread Jukka Zitting
Hi, Some while ago I implemented a simple JDBC to JCR bridge [1] that allows one to query a JCR repository from any JDBC client, most notably various reporting tools. Now I'm wondering if something similar already exists for a normal Lucene index. Something that would treat your entire index as o