[
https://issues.apache.org/jira/browse/SOLR-13752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Hind updated SOLR-13752:
-
Status: Patch Available (was: Open)
> MoreLikeThis MLT is biased for uncommon fields
>
[
https://issues.apache.org/jira/browse/SOLR-13752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16926877#comment-16926877
]
Andy Hind commented on SOLR-13752:
--
https://github.com/apache/lucene-solr/pull/871
> MoreLikeThis MLT
Andy Hind created SOLR-13752:
Summary: MoreLikeThis MLT is biased for uncommon fields
Key: SOLR-13752
URL: https://issues.apache.org/jira/browse/SOLR-13752
Project: Solr
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16805000#comment-16805000
]
Andy Hind commented on SOLR-12879:
--
Yes, there are two parts to the doc update. One for minhash filter
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16804310#comment-16804310
]
Andy Hind commented on LUCENE-6968:
---
[~mayyas], in answer to your questions:
1) Depends on your view
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16804303#comment-16804303
]
Andy Hind commented on SOLR-12879:
--
I do not see the docs for this updated/added in 8.0 ...
> Query
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16737192#comment-16737192
]
Andy Hind commented on LUCENE-6968:
---
[~mayyas] Hi Mayya, there is a good review paper here
[
https://issues.apache.org/jira/browse/SOLR-11207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16696201#comment-16696201
]
Andy Hind commented on SOLR-11207:
--
Is there still interest in adding this improvement? It was pretty
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Hind updated SOLR-12879:
-
Attachment: minhash.qparser.adoc.fragment
> Query Parser for MinHash/LSH
>
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16668841#comment-16668841
]
Andy Hind commented on SOLR-12879:
--
Should I raise separate issues for the documentation?
> Query
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Hind updated SOLR-12879:
-
Attachment: minhash.filter.adoc.fragment
> Query Parser for MinHash/LSH
>
>
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16660681#comment-16660681
]
Andy Hind commented on SOLR-12879:
--
MinHash Filter doc ...
{quote}
== MinHash Filter
Generates a
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16659793#comment-16659793
]
Andy Hind commented on SOLR-12879:
--
I don't think there is any reason the patch would not go back to
[
https://issues.apache.org/jira/browse/SOLR-12879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Hind updated SOLR-12879:
-
Attachment: minhash.patch
> Query Parser for MinHash/LSH
>
>
>
Andy Hind created SOLR-12879:
Summary: Query Parser for MinHash/LSH
Key: SOLR-12879
URL: https://issues.apache.org/jira/browse/SOLR-12879
Project: Solr
Issue Type: New Feature
Security
Andy Hind created SOLR-10025:
Summary: SOLR_SSL_OPTS are ignored in bin\solr.cmd
Key: SOLR-10025
URL: https://issues.apache.org/jira/browse/SOLR-10025
Project: Solr
Issue Type: Bug
[
https://issues.apache.org/jira/browse/LUCENE-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15561858#comment-15561858
]
Andy Hind edited comment on LUCENE-7476 at 10/10/16 10:15 AM:
--
Running the
[
https://issues.apache.org/jira/browse/LUCENE-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15561858#comment-15561858
]
Andy Hind commented on LUCENE-7476:
---
Running the tests 100 times via ant produces no issue. This seems
[
https://issues.apache.org/jira/browse/LUCENE-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15561799#comment-15561799
]
Andy Hind commented on LUCENE-7476:
---
I spotted this running
[
https://issues.apache.org/jira/browse/LUCENE-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15551409#comment-15551409
]
Andy Hind commented on LUCENE-7476:
---
Patch supplied - TestFactories ran with more then 700 seeds
[
https://issues.apache.org/jira/browse/LUCENE-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Hind updated LUCENE-7476:
--
Attachment: LUCENE-7476.patch
> Fix transient failure in JapaneseNumberFilter run from TestFactories
>
Andy Hind created LUCENE-7476:
-
Summary: Fix transient failure in JapaneseNumberFilter run from
TestFactories
Key: LUCENE-7476
URL: https://issues.apache.org/jira/browse/LUCENE-7476
Project: Lucene -
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15330032#comment-15330032
]
Andy Hind commented on LUCENE-6968:
---
Hi Tommaso - are you planning to merge this to 6.x?
> LSH Filter
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328054#comment-15328054
]
Andy Hind edited comment on LUCENE-6968 at 6/13/16 7:55 PM:
Hi Tommaso, the
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328054#comment-15328054
]
Andy Hind commented on LUCENE-6968:
---
Hi Tommaso, the MinHashFilterTest was running fine. It was
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15263867#comment-15263867
]
Andy Hind edited comment on LUCENE-6968 at 5/6/16 8:43 PM:
---
After a bit more
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15274697#comment-15274697
]
Andy Hind commented on LUCENE-6968:
---
I have attached an updated patch.
This addresses the following
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Hind updated LUCENE-6968:
--
Attachment: LUCENE-6968.5.patch
> LSH Filter
> --
>
> Key: LUCENE-6968
>
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15263867#comment-15263867
]
Andy Hind commented on LUCENE-6968:
---
After a bit more digging, the single hash and keeping the minimum
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15262918#comment-15262918
]
Andy Hind commented on LUCENE-6968:
---
[~yo...@apache.org] has murmurhash3_x64_128 here
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15262169#comment-15262169
]
Andy Hind commented on LUCENE-6968:
---
I agree a pure token stream test makes sense. The only concern I
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15259924#comment-15259924
]
Andy Hind edited comment on LUCENE-6968 at 4/28/16 9:27 AM:
This comes down
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15259924#comment-15259924
]
Andy Hind commented on LUCENE-6968:
---
This comes down to "what is a good estimate of |A U B|" and do we
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15256114#comment-15256114
]
Andy Hind edited comment on LUCENE-6968 at 4/25/16 11:52 AM:
-
The argument
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15256114#comment-15256114
]
Andy Hind commented on LUCENE-6968:
---
The argument here says it is pretty much the same.
{code}
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15252743#comment-15252743
]
Andy Hind edited comment on LUCENE-6968 at 4/21/16 11:06 PM:
-
Hi
It would be
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Hind updated LUCENE-6968:
--
Attachment: LUCENE-6968.patch
> LSH Filter
> --
>
> Key: LUCENE-6968
>
[
https://issues.apache.org/jira/browse/LUCENE-6968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15252743#comment-15252743
]
Andy Hind commented on LUCENE-6968:
---
Hi
It would be quite common to use min hashing after shingling.
[
http://issues.apache.org/jira/browse/LUCENE-307?page=comments#action_12417515 ]
Andy Hind commented on LUCENE-307:
--
I have seen something similar.
When the lock file is deleted the return value is not checked.
I have seen cases where the lock file is
[
http://issues.apache.org/jira/browse/LUCENE-415?page=comments#action_12412384 ]
Andy Hind commented on LUCENE-415:
--
file.getChannel() was added on windows.
It was *before* the truncating file issue was found and resolved.
It is possible the two are
[
http://issues.apache.org/jira/browse/LUCENE-436?page=comments#action_12377764 ]
Andy Hind commented on LUCENE-436:
--
I agree this is not strictly an issue with lucenebut .
Lucene has an unusual use pattern for thread locals (instance vs static
[ http://issues.apache.org/jira/browse/LUCENE-529?page=all ]
Andy Hind updated LUCENE-529:
-
Attachment: ThreadLocalTest.java
Attached is a test which you can use to see how ThreadLocals are left around.
Getting an out of memory exception depends on a number
TermInfosReader and other + instance ThreadLocal = transient/odd memory leaks
= OutOfMemoryException
Key: LUCENE-529
URL: http://issues.apache.org/jira/browse/LUCENE-529
Extend NumberTools to support int/long/float/double to string
--
Key: LUCENE-530
URL: http://issues.apache.org/jira/browse/LUCENE-530
Project: Lucene - Java
Type: Improvement
Components: Analysis
[
http://issues.apache.org/jira/browse/LUCENE-415?page=comments#action_12371284 ]
Andy Hind commented on LUCENE-415:
--
We have tested the above solution pretty heavily since 18/11/2005 and would
regard it as stable in 1.4.3.
Looking at the 1.9 code stream
[
http://issues.apache.org/jira/browse/LUCENE-415?page=comments#action_12371385 ]
Andy Hind commented on LUCENE-415:
--
The problem is that the output is going into a file that already exists.
I assume it leaves and then finds old bits during random access
[
http://issues.apache.org/jira/browse/LUCENE-415?page=comments#action_12357882 ]
Andy Hind commented on LUCENE-415:
--
And I can reproduce it .on 1.4.3
When FSDirectory.createFile creates a FSOutputStream the random access file may
already exist and
47 matches
Mail list logo