[ 
https://issues.apache.org/jira/browse/LUCENE-3271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13058170#comment-13058170
 ] 

Robert Muir commented on LUCENE-3271:
-------------------------------------

Thanks for working to clean this up! a few of my comments:
{quote}
similar.* -> suggest module
{quote}

Seems a little funky? I guess if we had a query-expansion module, I would think 
it belonged there.

{quote}
FieldCacheRewriteMethod -> This doesn't belong in this contrib or the queries 
module. I think we should push it to contrib/misc for the time being. It seems 
to have quite a few constraints on when its useful. If indeed 
CONSTANT_SCORE_AUTO rewrite is better, then I dont see a purpose for it.
{quote}

My vote would actually be to move this to src/test! :) Yeah there are some 
scenarios where this thing could be faster, but really I thought it was just a 
good way to add seek to the doctermsindex termsenum. I do think it and its test 
would be a nice addition to src/test, if someone wants to use it the can always 
snag it from there... its that expert.

{quote}
FuzzyLikeThisQuery -> suggest module. This class seems a mess with its 
Similarity hardcoded. With all that said, it does seem to do what it claims and 
with some cleanup, it could be good.
{quote}

I actually made some comments about this query in the flexscoring branch: 
http://svn.apache.org/repos/asf/lucene/dev/branches/flexscoring/lucene/contrib/queries/src/java/org/apache/lucene/search/FuzzyLikeThisQuery.java

In my opinion, as a rewrite method (i think it would require 2, one for the 
variant that ignores TF), we could get better performance out of this with 
cleaner code... in other words you would just use ordinary FuzzyQuery and set 
this rewrite method for its scoring heuristic, or a BQ of FuzzyQueries if you 
are doing the expansion thing

{quote}
SlowCollated* -> They can stay in the module till we have a better place to 
nuke them.
{quote}

This is all totally deprecated code that seems nice in contrib to reflect the 
fact that its not scalable...  I don't think its a huge issue where it goes due 
to the fact its deprecate and the name :)

Finally, I wanted to say that its my opinion that we shouldn't put garbage in 
modules. Modules should be treated like core I think.... yet at the same time I 
totally support efforts to cleanup contrib, either removing sandy stuff or 
refactoring it where it belongs in a module.

One option could to create a sandbox directory either under lucene (it contains 
src/java and src/test but is totally an unorganized sandbox), or itself as a 
contrib temporarily (contrib/sandbox) and take a look at contrib and move stuff 
thats good into modules, but toss all the 'odd things' into this sandbox. 


> Move 'good' contrib/queries classes to Queries module
> -----------------------------------------------------
>
>                 Key: LUCENE-3271
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3271
>             Project: Lucene - Java
>          Issue Type: Improvement
>            Reporter: Chris Male
>
> With the Queries module now filled with the FunctionQuery stuff, we should 
> look at closing down contrib/queries.  While not a huge contrib, it contains 
> a number of pretty useful classes and some that should go elsewhere.
> Heres my proposed plan:
> - similar.* -> suggest module
> - regex.* -> queries module
> - BooleanFilter -> queries module under .filters package
> - BoostingQuery -> queries module
> - ChainedFilter -> queries module under .filters package
> - DuplicateFilter -> queries module under .filters package
> - FieldCacheRewriteMethod -> This doesn't belong in this contrib or the 
> queries module.  I think we should push it to contrib/misc for the time 
> being.  It seems to have quite a few constraints on when its useful.  If 
> indeed CONSTANT_SCORE_AUTO rewrite is better, then I dont see a purpose for 
> it.
> - FilterClause -> class inside BooleanFilter
> - FuzzyLikeThisQuery -> suggest module. This class seems a mess with its 
> Similarity hardcoded.  With all that said, it does seem to do what it claims 
> and with some cleanup, it could be good.
> - TermsFilter -> queries module under .filters package
> - SlowCollated* -> They can stay in the module till we have a better place to 
> nuke them.
> One of the implications of the above moves, is that the xml-query-parser, 
> which supports many of the queries, will need to have a dependency on the 
> queries module.  But that seems unavoidable at this stage.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to