[
https://issues.apache.org/jira/browse/SOLR-14597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17185226#comment-17185226
]
Alexandre Rafalovitch commented on SOLR-14597:
----------------------------------------------
Posted this on SIP, but it belongs here more:
PatternTypingFilterFactory and DropIfFlaggedFilterFactory seem to be quite
similar to KeywordMarkerFilterFactory and TypeTokenFilterFactory to the degree
that perhaps the existing classes should be enhanced instead to support
additional functionality. Especially since keyword marking is integrated into
other parts of Solr (e.g. not dropping it as stopword, I think). Also
TypeTokenFilter can work as both a blacklist and a whitelist. Both types of
filtering are useful. E.g. I used it in the book to allow to search for emails
only extracted from some text:
[https://github.com/arafalov/solr-indexing-book/blob/master/published/text2/conf/schema.xml]
> Advanced Query Parser
> ---------------------
>
> Key: SOLR-14597
> URL: https://issues.apache.org/jira/browse/SOLR-14597
> Project: Solr
> Issue Type: New Feature
> Components: query parsers
> Affects Versions: 8.6
> Reporter: Mike Nibeck
> Assignee: Gus Heck
> Priority: Major
>
> This JIRA ticket tracks the progress of SIP-9, the Advanced Query Parser that
> is being donated by the Library of Congress. Full description of the feature
> can be found on the SIP Page.
> [https://cwiki.apache.org/confluence/display/SOLR/SIP-9+Advanced+Query+Parser]
> Briefly, this parser provides a comprehensive syntax for users that use
> search on a daily basis. It also reserves a smaller set of punctuators than
> other parsers. This facilitates easier handling of acronyms and punctuated
> patterns with meaning ( such as C++ or 401(k) ). The new syntax opens up some
> advanced features while also preventing access to arbitrary features via
> local parameters. This parser will be safe for accepting user queries
> directly with minimal pre-parsing, but for use cases beyond it's established
> features alternate query paths (using other parsers) will need to be supplied.
> The code drop is being prepared and will be supplied as soon as we receive
> guidance from the PMC regarding the proper process. Given that the Library
> already has a signed CCLA we need to understand which of these (or other
> processes) apply:
> [http://incubator.apache.org/ip-clearance/ip-clearance-template.html]
> and
> [https://www.apache.org/licenses/contributor-agreements.html#grants]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]