Jira issue: https://issues.apache.org/jira/browse/SOLR-6468
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4156373.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hello,
Any thoughts on this? Should I open a jira ticket? Or how can we engage at
least one of Solr devs to this issue?
Best,
Alex
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4155582.html
Sent from the Solr - User mailing list
. Perhaps those who made this decision
do not use Solr so they simply don't care, that's the only explanation I can
find.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4155157.html
Sent from the Solr - User mailing list archive at Nabble.com.
for
many people who do know how to use it?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4155162.html
Sent from the Solr - User mailing list archive at Nabble.com.
Sent: Tuesday, August 26, 2014 6:02 AM
To: solr-user@lucene.apache.org
Subject: Re: Help with StopFilterFactory
Hi, just tried your suggestion but get this error:
And then I found the next:
http://stackoverflow.com/questions/18668376/solr-4-4-stopfilterfactory-and-enablepositionincrements.
I
with Solr? Now we have to search alternatives because of such
weird decisions.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4155176.html
Sent from the Solr - User mailing list archive at Nabble.com.
To: solr-user@lucene.apache.org
Subject: Re: Help with StopFilterFactory
So it sounds like a bug to me, doesn't it? Interned is full of complaints
about this issue and why should all we suffer because of someone, who didn't
know when and how to use this feature and as result got wrong data indexed?
Who
.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4155181.html
Sent from the Solr - User mailing list archive at Nabble.com.
has:
parsedquery_toString: +(url_words_ngram:\? twitter com zer0sleep\)
So https part was replaced with a ?.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4154957.html
Sent from the Solr - User mailing list archive at Nabble.com.
, try adding enablePositionIncrements=false, reindex, and see
what happens.
-- Jack Krupansky
-Original Message-
From: heaven
Sent: Monday, August 25, 2014 3:37 AM
To: solr-user@lucene.apache.org
Subject: Re: Help with StopFilterFactory
A valid search:
http://pastie.org/pastes/9500661
.
It should generate phases after solr.StopFilterFactory (if one is defined
for a field). Or there should be another mechanism to remove blacklisted
words like if there were no such words at all so they simply disappear.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help
-
From: heaven
Sent: Sunday, August 24, 2014 11:02 AM
To: solr-user@lucene.apache.org
Subject: Re: Help with StopFilterFactory
Unfortunately I can't change the operator and phrase query for
https://twitter.com/testuser; doesn't work as well.
It does work for twitter.com/testuser but that makes
though https
should be filtered out by the StopFilterFactory.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4154804.html
Sent from the Solr - User mailing list archive at Nabble.com.
=solr.PatternTokenizerFactory pattern=[^\w]+ /
filter class=solr.StopFilterFactory words=url_stopwords.txt
ignoreCase=true /
filter class=solr.LowerCaseFilterFactory /
/analyzer
/fieldType
{code}
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory
so that we can see what was really generated for the
query.
-- Jack Krupansky
-Original Message-
From: heaven
Sent: Sunday, August 24, 2014 12:04 PM
To: solr-user@lucene.apache.org
Subject: Re: Help with StopFilterFactory
I don't see any confusions, the problem is clearly explained
is that schema version is 1.5 and autoGeneratePhraseQueries is set to
true.
I remember about the debug output, will be able to add it tomorrow morning.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4154822.html
Sent from the Solr - User
: Sunday, August 24, 2014 12:47 PM
To: solr-user@lucene.apache.org
Subject: Re: Help with StopFilterFactory
The problem is in #4:
4. if I index twitter.com/testuser and search for
https://twitter.com/testuser I am getting 0 matches even though https
should be filtered out by the StopFilterFactory
Any ideas? Doesn't that seems like a bug?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4154202.html
Sent from the Solr - User mailing list archive at Nabble.com.
On 8/21/2014 7:25 AM, heaven wrote:
Any ideas? Doesn't that seems like a bug?
I think it should have worked even with autoGeneratePhraseQueries
enabled by the older schema version. The relative positions are the
same -- it's 1,2,3 in the index and 2,3,4 in the query. Absolute
positions don't
@lucene.apache.org
Subject: Re: Help with StopFilterFactory
On 8/21/2014 7:25 AM, heaven wrote:
Any ideas? Doesn't that seems like a bug?
I think it should have worked even with autoGeneratePhraseQueries
enabled by the older schema version. The relative positions are the
same -- it's 1,2,3
/testuser
* http://twitter.com/otheruser (match by twitter and com)
* http://twitter.com/anotheruser
* etc, including all sites that match twitter and/or com (and there's a lot,
and all are unrelated).
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help
On 8/21/2014 8:40 AM, heaven wrote:
With 1.5 schema it work but not as it is expected. I am indexing
twitter.com/testuser and only need to get exact matches, not those that
match twitter or com. so my search results should contain just one
record:
* http://twitter.com/testuser
but what I
On 8/21/2014 9:52 AM, Shawn Heisey wrote:
On 8/21/2014 8:40 AM, heaven wrote:
With 1.5 schema it work but not as it is expected. I am indexing
twitter.com/testuser and only need to get exact matches, not those that
match twitter or com. so my search results should contain just one
record:
*
.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4153944.html
Sent from the Solr - User mailing list archive at Nabble.com.
.nabble.com/Help-with-StopFilterFactory-tp4153839p4153954.html
Sent from the Solr - User mailing list archive at Nabble.com.
?
Best,
Alexader
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4153957.html
Sent from the Solr - User mailing list archive at Nabble.com.
? That doesn't
make much sense.
Regards,
Alexander
--
View this message in context:
http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839.html
Sent from the Solr - User mailing list archive at Nabble.com.
Subject: Help with StopFilterFactory
Hi, I have the next text field:
fieldType name=words_ngram class=solr.TextField omitNorms=false
analyzer
tokenizer class=solr.PatternTokenizerFactory pattern=[^\w]+ /
filter class=solr.StopFilterFactory words=url_stopwords.txt
ignoreCase=true /
filter
28 matches
Mail list logo