[ 
https://issues.apache.org/jira/browse/MAHOUT-963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated MAHOUT-963:
-----------------------------

         Priority: Minor  (was: Major)
    Fix Version/s:     (was: 0.6)
       Issue Type: Improvement  (was: Bug)

My profiling shows that the sorting barely registers as part of the time taken 
to load a model -- do you have a case where it seems to make a measurable 
difference?
These arrays are so small that any sort isn't a problem that I can see.

Still, small wins are wins. I tried a little test, sorting a bunch of these 
arrays, where the size is exponentially distributed. The comb sort gets faster 
when the average size is about 15 or more, which is realistic for a lot of data 
sets. I can squeeze a little more out of the implementation to make it a little 
faster.

I think it's a positive change and will put it in after the code freeze.
                
> GenericUserPreferenceArray and GenericItemPreferenceArray use selection sorts
> -----------------------------------------------------------------------------
>
>                 Key: MAHOUT-963
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-963
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.6
>            Reporter: Bryce Nyeggen
>            Assignee: Sean Owen
>            Priority: Minor
>         Attachments: MAHOUT-963.diff
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> Both PreferenceArray implementations use selection sorts with poor 
> performance.  These sorts are invoked during construction of 
> GenericDataModels, causing excessive construction time.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to