[ 
https://issues.apache.org/jira/browse/MAHOUT-633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13011908#comment-13011908
 ] 

Dmitriy Lyubimov commented on MAHOUT-633:
-----------------------------------------

P.S. my stochastic projection jobs with modified VectorWritable which doesn't 
form actual vector at all but rather passes elements on stack memory also seem 
to run significantly faster just because of that. which is basically equivalent 
to case A vs. case B. I don't have exact benchmark comparison on hand since I 
never really wanted to compared it but it's quite apparent even with -Xmx200M 
map tasks. So perhaps if we could have actual real-life simulation benchmarks, 
we could actually see how "bad" or "good" it is. But my evidence so far has 
been pretty "bad", although i did not care collecting it for some time now so 
it related to less modern jvms.

> Add SequenceFileIterable; put Iterable stuff in one place
> ---------------------------------------------------------
>
>                 Key: MAHOUT-633
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-633
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification, Clustering, Collaborative Filtering
>    Affects Versions: 0.4
>            Reporter: Sean Owen
>            Assignee: Sean Owen
>            Priority: Minor
>              Labels: iterable, iterator, sequence-file
>             Fix For: 0.5
>
>         Attachments: MAHOUT-633.patch, MAHOUT-633.patch
>
>
> In another project I have a useful little class, SequenceFileIterable, which 
> simplifies iterating over a sequence file. It's like FileLineIterable. I'd 
> like to add it, then use it throughout the code. See patch, which for now 
> merely has the proposed new classes. 
> Well it also moves some other iterator-related classes that seemed to be 
> outside their rightful home in common.iterator.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to