[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677639#action_12677639
]
Koji Sekiguchi commented on LUCENE-1500:
Peter, thank you.
bq. In the thread you
[
https://issues.apache.org/jira/browse/LUCENE-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Thomas Morton updated LUCENE-1550:
--
Attachment: LUCENE-1550.patch
Patch includes implementation of n-gram string matching.
This i
Add N-Gram String Matching for Spell Checking
-
Key: LUCENE-1550
URL: https://issues.apache.org/jira/browse/LUCENE-1550
Project: Lucene - Java
Issue Type: New Feature
Components: contrib/
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677629#action_12677629
]
Peter Wolanin commented on LUCENE-1500:
---
Koji - thanks - I was aware that not all wo
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677626#action_12677626
]
Koji Sekiguchi commented on LUCENE-1500:
bq. for that field type. I will investiga
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677620#action_12677620
]
Peter Wolanin commented on LUCENE-1500:
---
Ah, it occurs to me that we first saw this
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677608#action_12677608
]
Hoss Man commented on LUCENE-1500:
--
Peter: i tried some experiments with teh analyzer spe
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677601#action_12677601
]
Robert Starzer commented on LUCENE-1186:
"EG are you picturing a single opaque cla
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677600#action_12677600
]
Michael McCandless commented on LUCENE-1186:
bq. -> is this true (only for Seg
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677584#action_12677584
]
Robert Starzer commented on LUCENE-1186:
ok thanks!
https://issues.apache.org/jir
> Maybe we can use the
> compression technology mentioned in this Wikipedia article to further
> optimize filters and their DocIdSetIterators.
We already use WAH-encoded bitmap filters over here for roughly a
year. And yes, they are nice.
--
Kirill Zakharenko/Кирилл Захаренко (ear...@gmail.com)
H
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677561#action_12677561
]
Peter Wolanin commented on LUCENE-1500:
---
I'm still trying to get a handle on how the
Right, I think Lucene could decide under-the-hood what's the best data
structure when writing the column-stride field. Sort of like how
BitVector has two ways (sparse vs unsparse) of storing itself on disk.
Mike
Otis Gospodnetic wrote:
So that would require Lucene to dynamically/periodically
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677551#action_12677551
]
Michael McCandless commented on LUCENE-1500:
But that class either uses TermVe
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677549#action_12677549
]
Hoss Man commented on LUCENE-1500:
--
bq. The extent of my tracing suggests it's coming whe
So that would require Lucene to dynamically/periodically check field values and
their frequencies and switch from a regular inverted index to a bitmap index or
just create an additional bitmap index for those fields and their values?
Otis
- Original Message
> From: Michael McCandles
OK, so that bit about filters, OpenBitSet and friends was my
feeling/understanding, too. That sort of matches what that Wikipedia page
describes as in-memory usage of bitmaps a la PostgreSQL. The reason I
mentioned Solr is because I was thinking of low-cardinality fields, perhaps the
same on
I think with column stride fields we should use Bitmap Index to
represent fields that have few values across many docs.
Mike
Uwe Schindler wrote:
In my opinion, we currently use some type of bitmap index with our
filters.
OpenBitSet and SortedVIntList used in filters can be seen as bitmap
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677543#action_12677543
]
Michael McCandless commented on LUCENE-1186:
bq. could you please explain to m
In my opinion, we currently use some type of bitmap index with our filters.
OpenBitSet and SortedVIntList used in filters can be seen as bitmap indexes
specifying if a document is a hit of the filter or not. Maybe we can use the
compression technology mentioned in this Wikipedia article to further
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677540#action_12677540
]
Peter Wolanin commented on LUCENE-1500:
---
I am using Solr, but with a single value fi
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677535#action_12677535
]
Michael McCandless commented on LUCENE-1500:
I thought the bug was in the anal
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677531#action_12677531
]
Peter Wolanin commented on LUCENE-1500:
---
The bug we are seeing now happens on pretty
[
https://issues.apache.org/jira/browse/LUCENE-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677533#action_12677533
]
Michael McCandless commented on LUCENE-1516:
bq. Does this mean that the point
[
https://issues.apache.org/jira/browse/LUCENE-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-1314:
---
Attachment: LUCENE-1314.patch
Attached patch. I plan to commit in a day or two.
O
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677524#action_12677524
]
Michael McCandless commented on LUCENE-1500:
bq. This feels to me like one of
[
https://issues.apache.org/jira/browse/LUCENE-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677523#action_12677523
]
Jeremy Volkman commented on LUCENE-1516:
I noticed the comments about IW.getReader
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677517#action_12677517
]
Peter Wolanin commented on LUCENE-1500:
---
Well, this patch does not (obviously) solve
Hi,
I've had http://en.wikipedia.org/wiki/Bitmap_index open in my browser for
weeks, thinking I'd bring it up here -- would a bitmap index make sense
anywhere in Lucene (or perhaps Solr)?
Otis
-
To unsubscribe, e-mail: java-d
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677507#action_12677507
]
Mark Harwood commented on LUCENE-1500:
--
OK - choices are:
1) Throw a RuntimeExceptio
Michael Busch wrote:
On 2/26/09 1:50 PM, Michael McCandless wrote:
Michael Busch wrote:
On 2/24/09 4:05 AM, Michael McCandless wrote:
I believe we still need this, for remote filesystems (like NFS)
that have inconsistent client-side caching.
The fsync() ensures the local IO system has
On 2/26/09 1:50 PM, Michael McCandless wrote:
Michael Busch wrote:
On 2/24/09 4:05 AM, Michael McCandless wrote:
I believe we still need this, for remote filesystems (like NFS) that
have inconsistent client-side caching.
The fsync() ensures the local IO system has moved the bytes/file
me
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677423#action_12677423
]
Robert Starzer commented on LUCENE-1186:
you could use e.g. spring and specific sp
Yes, I have but it is too memory intensive. I used highlighter as my first
attempt but it was not a good solution because, I have to send the entire
text to highlighter.
What I did instead is similar to your suggestion.
1. use the analyzer to return me a token stream.
2. search the token stre
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677376#action_12677376
]
Michael McCandless commented on LUCENE-1186:
bq. IMHO, some kind of IOC contai
OK, we're now down to 3 2.4.1 issues:
https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&mode=hide&sorter/order=DESC&sorter/field=priority&resolution=-1&pid=12310110&fixfor=12313516
I've got 2 of them and I think Mark Harwood has the 3rd (LUCENE-1500).
Once we get this dow
[
https://issues.apache.org/jira/browse/LUCENE-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless resolved LUCENE-1548.
Resolution: Fixed
Fix Version/s: 2.9
Thanks Thomas!
> LevenshteinDistance
[
https://issues.apache.org/jira/browse/LUCENE-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless reassigned LUCENE-1548:
--
Assignee: Michael McCandless
> LevenshteinDistance code normalization is incor
[
https://issues.apache.org/jira/browse/LUCENE-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677373#action_12677373
]
Michael McCandless commented on LUCENE-1548:
Looks good, I'll commit. Thanks
[
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677372#action_12677372
]
Michael McCandless commented on LUCENE-1500:
Mark, do you want/have time to ta
[
https://issues.apache.org/jira/browse/LUCENE-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-1549:
---
Attachment: LUCENE-1549.patch
Attached patch. I plan to commit in a day or so, to 2
Strengthen CheckIndex a bit
---
Key: LUCENE-1549
URL: https://issues.apache.org/jira/browse/LUCENE-1549
Project: Lucene - Java
Issue Type: Improvement
Components: Index
Affects Versions: 2.4
[
https://issues.apache.org/jira/browse/LUCENE-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless resolved LUCENE-1546.
Resolution: Fixed
Committed revision 748493. Thanks Jason!
> Add IndexReader.flu
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677340#action_12677340
]
Robert Starzer commented on LUCENE-1186:
great! thanks!
IMHO, some kind of IOC con
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-1186:
---
Attachment: LUCENE-1186.patch
New patch, giving credit to Christian. Thanks Christi
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677281#action_12677281
]
rviper edited comment on LUCENE-1186 at 2/27/09 3:53 AM:
-
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-1186:
---
Attachment: LUCENE-1186.patch
Re-using a single analyzer should work around this...
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-1186:
---
Fix Version/s: 2.9
2.4.1
> [PATCH] Clear ThreadLocal instances in
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless reassigned LUCENE-1186:
--
Assignee: Michael McCandless
> [PATCH] Clear ThreadLocal instances in close()
Have you looked at the contrib Highlighter? Or using an Analyzer
directly to give you the offsets?
Erik
On Feb 26, 2009, at 9:32 AM, HPDrifter wrote:
When I get a search result based on my index, I need the exact
tokens which
were identified in the index as part of the result.
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677281#action_12677281
]
rviper edited comment on LUCENE-1186 at 2/27/09 12:38 AM:
--
you have a Lucene LM implementation only for research purposes in
http://ilps.science.uva.nl/resources/lm-lucene
is a very old implementation but maybe could be useful to you
jose
On Thu, Feb 26, 2009 at 9:25 AM, Paul Elschot wrote:
> On Thursday 26 February 2009 02:21:41 Koren Krupko wrote:
>
[
https://issues.apache.org/jira/browse/LUCENE-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677281#action_12677281
]
Robert Starzer commented on LUCENE-1186:
i'm using quartz schedules to trigger ind
53 matches
Mail list logo