[jira] [Commented] (MAHOUT-1456) The wikipediaXMLSplitter example fails with heap size error

2014-03-18 Thread mahmood (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938928#comment-13938928 ] mahmood commented on MAHOUT-1456: - Please see more outputs of hadoop-2.1.0-beta. I will

[jira] [Updated] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1464: --- Attachment: MAHOUT-1464.patch Updated patch to match the coding conventions and use

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938958#comment-13938958 ] Sebastian Schelter commented on MAHOUT-1464: The physical operator for

[jira] [Commented] (MAHOUT-1456) The wikipediaXMLSplitter example fails with heap size error

2014-03-18 Thread mahmood (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938995#comment-13938995 ] mahmood commented on MAHOUT-1456: - Here is the output of hadoop 1.2.1. Please note that

[jira] [Commented] (MAHOUT-1365) Weighted ALS-WR iterator for Spark

2014-03-18 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939019#comment-13939019 ] Sebastian Schelter commented on MAHOUT-1365: would be awesome to have this

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939420#comment-13939420 ] Pat Ferrel commented on MAHOUT-1464: PDF in the repo is fine by me. Can the patches

[jira] [Commented] (MAHOUT-1456) The wikipediaXMLSplitter example fails with heap size error

2014-03-18 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939435#comment-13939435 ] Andrew Musselman commented on MAHOUT-1456: -- Like Suneel said and like I think I

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939451#comment-13939451 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- That's what i normally do, yes. The

[jira] [Comment Edited] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939468#comment-13939468 ] Dmitriy Lyubimov edited comment on MAHOUT-1464 at 3/18/14 4:56 PM:

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939468#comment-13939468 ] Dmitriy Lyubimov commented on MAHOUT-1464: -- [~ssc] Looking nice. I guess we

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939471#comment-13939471 ] Pat Ferrel commented on MAHOUT-1464: Since there are potentially commits by D and S

Re: [jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov
On Tue, Mar 18, 2014 at 9:57 AM, Pat Ferrel (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939471#comment-13939471] Pat Ferrel commented on MAHOUT-1464:

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Dmitriy Lyubimov (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939513#comment-13939513 ] Dmitriy Lyubimov commented on MAHOUT-1464: --

[jira] [Commented] (MAHOUT-1456) The wikipediaXMLSplitter example fails with heap size error

2014-03-18 Thread mahmood (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939518#comment-13939518 ] mahmood commented on MAHOUT-1456: - Generally, I agree that there might be some bug in the

[jira] [Issue Comment Deleted] (MAHOUT-1456) The wikipediaXMLSplitter example fails with heap size error

2014-03-18 Thread mahmood (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mahmood updated MAHOUT-1456: Comment: was deleted (was: In that pastbin link, I see that only the last command produces the heap

Introducing PredictionIO: A developer-friendly Mahout stack for production

2014-03-18 Thread Simon Chan
Hi, After a year of work, I would like to present PredictionIO project ( https://github.com/PredictionIO) to this community. When a few of us were doing PhD study, Mahout was the de facto Java package that we used in many research work. This is a very powerful algorithm library, yet we see that

[jira] [Commented] (MAHOUT-1356) Ensure unit tests fail fast when writing outside mvn target directory

2014-03-18 Thread Frank Scholten (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939768#comment-13939768 ] Frank Scholten commented on MAHOUT-1356: [~smarthi] What exactly do I have to add

[jira] [Commented] (MAHOUT-1356) Ensure unit tests fail fast when writing outside mvn target directory

2014-03-18 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939773#comment-13939773 ] Andrew Musselman commented on MAHOUT-1356: -- I don't know this stuff either but

Re: [jira] [Comment Edited] (MAHOUT-1426) GSOC 2013 Neural network algorithms

2014-03-18 Thread Maciej Mazur
I'll say what I think about it. I know that mahout is currently heading in different direction. You are working on refactoring, improving existing api and migrating to Spark. I know that there is a great deal of work to do there. I would also like to help with that. I am impressed by results

[jira] [Commented] (MAHOUT-1356) Ensure unit tests fail fast when writing outside mvn target directory

2014-03-18 Thread Isabel Drost-Fromm (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939807#comment-13939807 ] Isabel Drost-Fromm commented on MAHOUT-1356: [~andrew.musselman] [~smarthi]

[jira] [Commented] (MAHOUT-1356) Ensure unit tests fail fast when writing outside mvn target directory

2014-03-18 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939937#comment-13939937 ] Andrew Musselman commented on MAHOUT-1356: -- I tried to follow along with the

Plan for 1.0

2014-03-18 Thread Saikat Kanjilal
Hi Guys, I read through the email threads with the weigh ins for the inclusion of H2O as well as spark and wanted to circle back on the plan for folks to meet around 1.0, so a few questions: 1) How does the inclusion of H2O and spark weigh in importance versus the current JIRA items that are

Re: [GSOC 2014] Uniform API for Mahout Clustering

2014-03-18 Thread chalitha udara Perera
Hi everyone, Greatly appreciate your interest on this issue. I have gone through the document ScalaSparkBindings [1] . In this project my initial idea was to provide high level API for end user programmers so that they have the flexibility of plugin in different types of algorithms without