Re: Streaming KMeans clustering

2013-12-25 Thread Suneel Marthi
garbage collections. --sebastian On 23.12.2013 22:14, Suneel Marthi wrote: Has anyone be successful running Streaming KMeans clustering on a large dataset ( 100,000 points)? It just seems to take a very long time ( 4hrs) for the mappers to finish on about 300K data points

Re: Streaming KMeans clustering

2013-12-25 Thread Suneel Marthi
Not sure how that would work in a corporate setting wherein there's a fixed systemwide setting that cannot be overridden. Sent from my iPhone On Dec 25, 2013, at 9:44 AM, Sebastian Schelter s...@apache.org wrote: On 25.12.2013 14:19, Suneel Marthi wrote: On Tuesday, December

Re: Streaming KMeans clustering

2013-12-26 Thread Suneel Marthi
On Wed, Dec 25, 2013 at 3:49 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Not sure how that would work in a corporate setting wherein there's a fixed systemwide setting that cannot be overridden. Sent from my iPhone On Dec 25, 2013, at 9:44 AM, Sebastian

Re: Deprecated or drafts only algorithms what is the reasoning?

2014-01-03 Thread Suneel Marthi
It could be because:- a) they have been replaced by better performant alternatives b) lack of usage c) lack of support Please delete from wiki all algorithms that have been marked deprecated (the code for most of them has already been removed from trunk). On Friday, January 3, 2014 3:31

Re: Deprecated or drafts only algorithms what is the reasoning?

2014-01-03 Thread Suneel Marthi
See inline. The code for the deprecated algos has already been purged from trunk, its only the Wiki that needs cleaning up. On Friday, January 3, 2014 4:03 PM, i...@eprice.gr i...@eprice.gr wrote: Please confirm if below list  is correct before removing them:   Classification Deprecated or

Mahout 0.9 code freeze

2014-01-09 Thread Suneel Marthi
All, Working on getting 0.9 release out of the door, please refrain from committing any new code unless its deemed = major.equalsIgnoreCase() . Before I go ahead is there anything else that needs to be committed for 0.9 that we are waiting on??

Mahout 0.9 Release Candidate - VOTE

2014-01-10 Thread Suneel Marthi
Pushed the Mahout 0.9 Release candidate. See https://repository.apache.org/content/repositories/orgapachemahout-1000/ This is a call for Vote.

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-14 Thread Suneel Marthi
Calling for volunteers to test this Release. On Friday, January 10, 2014 7:39 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Pushed the Mahout 0.9 Release candidate. See https://repository.apache.org/content/repositories/orgapachemahout-1000/ This is a call for Vote.

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-14 Thread Suneel Marthi
to volunteer to test this release. What is the procedure/steps to get started and what pre-reqs I need to have? Cheers .S On Tue, Jan 14, 2014 at 6:52 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Calling for volunteers to test this Release. On Friday, January 10, 2014 7:39 PM, Suneel Marthi

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-14 Thread Suneel Marthi
before the installation so I assumed maven dependencies are all available . On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Here's the link to Release artifacts for Mahout 0.9: https://repository.apache.org/content/repositories/orgapachemahout-1000/ For those

Re: Edit CMS in anonymous mode

2014-01-15 Thread Suneel Marthi
How do I edit the new site. don't see login/edit links? On Wednesday, January 15, 2014 8:08 AM, Isabel Drost-Fromm isa...@apache.org wrote: On Fri, Jan 10, 2014 at 05:31:25PM +0530, Tharindu Rusira wrote: Yes Sotiris, Only commitors are allowed to push changes to staging or production

Re: [jira] [Commented] (MAHOUT-1396) Accidental use of commons-math won't work with next Hadoop 2 release

2014-01-15 Thread Suneel Marthi
, 2014 8:09 AM, Suneel Marthi (JIRA) j...@apache.org wrote:     [ https://issues.apache.org/jira/browse/MAHOUT-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13871782#comment-13871782] Suneel Marthi commented on MAHOUT-1396

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-15 Thread Suneel Marthi
? Cheers, .S On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Here's the link to Release artifacts for Mahout 0.9: https://repository.apache.org/content/repositories/orgapachemahout-1000/ For those volunteering to test this, some of the stuff to look out

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-16 Thread Suneel Marthi
download the source tar and check it as any other Mahout release. Regards, Thanks     Chameera On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Tharindu. On Tuesday, January 14

Mahout 0.9 Release - Call for Volunteers

2014-01-16 Thread Suneel Marthi
Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1001/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-16 Thread Suneel Marthi
. On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: It would be .tar.gz file and you would find it under mahout/distribution. On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Ok let's see after fixed the URL Thank

MAHOUT 0.9 Release - New URL

2014-01-16 Thread Suneel Marthi
Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
It works when both MAHOUT_LOCAL=true and '-xm sequential' option are set. Guess will have to cut a release again with '-xm sequential' option set. On Sunday, January 19, 2014 11:31 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Its presently setup to run in MR mode (the way its been

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
...@frankscholten.nl wrote: When I run in MR mode I get the same problem. See http://pastebin.com/TXJ5mQmt On Sun, Jan 19, 2014 at 5:31 PM, Frank Scholten fr...@frankscholten.nl wrote: OK, running in MR mode now. On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Its

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
Stevo, could u test streaming kmeans? Sent from my iPhone On Jan 19, 2014, at 8:10 PM, Stevo Slavić ssla...@gmail.com wrote: +1 (binding) On Sun, Jan 19, 2014 at 7:49 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: I'll try to test out soon

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Suneel Marthi
Hmmm... that's an issue. Since both Dirichlet and Meanshift clustering have been removed from 0.9, cluster-syntheticcontrol.sh options 4,5 are not gonna work and should have been removed for 0.9. To PMC,  - rollback the release, fix this issue (and other patches that were submitted in the

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Suneel Marthi
This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging. Thanks for reporting this Andrew P. On Monday, January 20, 2014 12:34 AM, Andrew Palumbo

Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Suneel Marthi
out the build today On Mon, Jan 20, 2014 at 6:00 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging. Thanks

Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Suneel Marthi
at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix

Re: MAHOUT 0.9 Release - New URL

2014-01-22 Thread Suneel Marthi
Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the

Re: MAHOUT 0.9 Release - New URL

2014-01-22 Thread Suneel Marthi
Same here. I did a), b), c) and d) too and all tests pass. Here's my +1, if my vote counts. On Wednesday, January 22, 2014 7:11 PM, Sebastian Schelter s...@apache.org wrote: I did a) b) c) and d) without noting any problem so far. +1 from me. --sebastian On 01/22/2014 11:55 PM, Suneel

Re: cluster-reuters.sh broken in trunk

2014-01-24 Thread Suneel Marthi
'No Flags' ???  Could u post the command u were trying? On Friday, January 24, 2014 11:38 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Last night I had this issue when testing out cluster-reuters.sh with no flags; anyone seen this recently? 14/01/23 22:03:54 INFO

Re: cluster-reuters.sh broken in trunk

2014-01-24 Thread Suneel Marthi
I assume u r running this in MR mode??  Could u clear up your /tmp/mahout-work- folder and try again. On Friday, January 24, 2014 1:56 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Actually, getting the same error with a fresh svn checkout: 14/01/24 09:42:13 INFO

Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Suneel Marthi
Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging

Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Suneel Marthi
at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered.  The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan

Re: MAHOUT 0.9 Release - New URL

2014-01-25 Thread Suneel Marthi
PM, Ted Dunning ted.dunn...@gmail.com wrote: My schedule has opened up a bit and I can review as well. On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi

Re: [jira] [Updated] (MAHOUT-1410) clusteredPoints do not contain a vector id

2014-01-25 Thread Suneel Marthi
) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) On Sat, Jan 25, 2014 at 1:31 AM, Suneel Marthi (JIRA) j...@apache.orgwrote:       [ https://issues.apache.org/jira

Re: MAHOUT 0.9 Release - New URL

2014-01-25 Thread Suneel Marthi
Rolled back trunk to 0.9-SNAPSHOT, please go ahead and commit any changes. On Saturday, January 25, 2014 4:19 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: I'll be rolling back the 0.9 Release today that's presently in staging in light of the issues that have been reported in the last 2

Re: Build failed in Jenkins: Mahout-Examples-Classify-20News #414

2014-01-27 Thread Suneel Marthi
Its been failing on one of the nodes but succeeds on others.  This has been happening, have not looked at it deeply myself. On Monday, January 27, 2014 3:21 AM, Stevo Slavić ssla...@gmail.com wrote: Odd, apart from being run on different Jenkins nodes and few comments added, this failed

Mahout Math-scala version

2014-01-27 Thread Suneel Marthi
Do we need to upgrade the scala version in mahout? See these warnings while running a build: [WARNING]  Expected all dependencies to require Scala version: 2.9.3 [WARNING]  org.apache.mahout:mahout-math-scala:0.9-SNAPSHOT requires scala version: 2.9.3 [WARNING] 

Re: Mahout Math-scala version

2014-01-27 Thread Suneel Marthi
Sorry please ignore this, its an issue with my local setup. On Monday, January 27, 2014 3:27 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Do we need to upgrade the scala version in mahout? See these warnings while running a build: [WARNING]  Expected all dependencies to require

Re: Problem with CVB

2014-01-27 Thread Suneel Marthi
In Step #, u r generating tf vectors but r expecting tf-idf vectors in Step 4. Change the weight in Step 3 to tfidf (which is the default BTW if none specified). On Monday, January 27, 2014 1:44 PM, Ted Dunning ted.dunn...@gmail.com wrote: I am forwarding this to the list for Peyman.

Re: Problem with CVB

2014-01-27 Thread Suneel Marthi
In Step 3, u r generating tf vectors but r expecting tf-idf vectors in Step 4. Change the weight in Step 3 to tfidf (which is the default BTW if none specified). On , Suneel Marthi suneel_mar...@yahoo.com wrote: In Step #, u r generating tf vectors but r expecting tf-idf vectors in Step 4

Re: Problem with CVB

2014-01-27 Thread Suneel Marthi
, not tfidf).  it seems to be working.  thank you for your help Suneel Peyman   On Jan 27, 2014, at 1:51 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: In Step 3, u r generating tf vectors but r expecting tf-idf vectors in Step 4. Change the weight in Step 3 to tfidf (which is the default

Re: Test failure in TDigestTest

2014-01-28 Thread Suneel Marthi
These failures are not repeatable, and had seen this happen a few times. The tolerance margin for this statistical test is presently set at 0.005. I once had a test failure that read:  java.lang.AssertionError: expected:0.5 but was:0.50578 Maybe change the fuzzfactor for this test from present

Mahout 0.9 Release

2014-01-28 Thread Suneel Marthi
Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/

Re: Mahout 0.9 Release

2014-01-29 Thread Suneel Marthi
+1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here.

Re: Mahout 0.9 Release

2014-01-30 Thread Suneel Marthi
AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com

Re: Mahout 0.9 Release

2014-01-31 Thread Suneel Marthi
...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman

Re: Mahout 0.9 Release

2014-02-02 Thread Suneel Marthi
Mahout 0.9 has been pushed to the mirrors and is available for download at http://www.apache.org/dyn/closer.cgi/mahout/ On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The release has passed with the required votes from PMC, will be pushing 0.9

Re: Optimization of AbstractLogisticRegression.regularize?

2014-02-03 Thread Suneel Marthi
On Monday, February 3, 2014 7:08 PM, Ted Dunning ted.dunn...@gmail.com wrote: Optimization of this kind is definitely worthwhile, but 10% improvements probably are a bit small.  Your difficulty in reproducing the results indicates how ephemeral small improvements can be.  If you find a 2x

Re: Mahout 0.9 Release Notes - First Draft

2014-02-11 Thread Suneel Marthi
Here's a draft of the Release Notes for Mahout 0.9, Please review the same. -- The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries focused primarily in the areas of collaborative

Mahout unavailable from mirrors

2014-02-15 Thread Suneel Marthi
Apache Mahout (all releases) are presently unavailable for download as all the Mahout releases were accidentally blown out from all the mirrors during Infrastructure maintenance. Anyone looking to download Mahout latest or older releases can do so from the archives at

Mahout unavailable from mirrors

2014-02-15 Thread Suneel Marthi
Apache Mahout (all releases) are presently unavailable for download as all the Mahout releases were accidentally blown out from all the mirrors during Infrastructure maintenance. Anyone looking to download Mahout latest or older releases can do so from the archives at

Re: Mahout unavailable from mirrors

2014-02-15 Thread Suneel Marthi
. On Saturday, February 15, 2014 8:08 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Apache Mahout (all releases) are presently unavailable for download as all the Mahout releases were accidentally blown out from all the mirrors during Infrastructure maintenance. Anyone looking to download Mahout

Re: Mahout 0.9 Release Notes - First Draft

2014-02-18 Thread Suneel Marthi
, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's a draft of the Release Notes for Mahout 0.9, Please review the same. -- The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries

Apache Mahout 0.9 released

2014-02-18 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries focused primarily in the areas of collaborative filtering (recommenders), clustering and classification (known collectively as the 3Cs), as well as the necessary

Re: Mahout 0.9 Release Notes - First Draft

2014-02-18 Thread Suneel Marthi
? I've been asked to write a short blog on the release but wanted to wait until the site is updated. Thanks much Ellen On Tue, Feb 11, 2014 at 10:06 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's a draft of the Release Notes for Mahout 0.9, Please review the same

Re: Hadoop 2 support

2014-02-19 Thread Suneel Marthi
Thanks for the patch Sergey. I tested this with Hadoop 1 and 2 and can confirm that all unit tests pass and the examples work. On Wednesday, February 19, 2014 9:39 AM, Sean Owen sro...@gmail.com wrote: Hmm I thought there was already a profile for this, but on second look, I only see a

Re: Hadoop 2 support

2014-02-19 Thread Suneel Marthi
Yes On Wednesday, February 19, 2014 10:43 AM, Sergey Svinarchuk ssvinarc...@hortonworks.com wrote: Thanks! This patch will be added in mahout 1.0? On Wed, Feb 19, 2014 at 5:39 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks for the patch Sergey. I tested this with Hadoop 1

Re: Mahout on Spark?

2014-02-19 Thread Suneel Marthi
On Wednesday, February 19, 2014 7:22 PM, Ted Dunning ted.dunn...@gmail.com wrote: On Wed, Feb 19, 2014 at 1:55 PM, peng pc...@uowmail.edu.au wrote: But maybe mahout can include contribs that M/R is not fit for, like downpour SGD or graph-based algorithms? Yes.  Absolutely. Downpour

Re: Mahout newbiw question

2011-07-26 Thread Suneel Marthi
To: dev@mahout.apache.org; Suneel Marthi suneel_mar...@yahoo.com Sent: Tuesday, July 26, 2011 4:38 AM Subject: Re: Mahout newbiw question Yes, 0.5 goes with 0.20.2. HEAD/0.6 goes with 0.20.203.0 On Tue, Jul 26, 2011 at 9:02 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Never mind, figured

Re: [jira] [Created] (MAHOUT-934) Deploy sgd classifier trained model in an application

2011-12-21 Thread Suneel Marthi
You would have to package the Model into your war (if Web application) or into your jar (if its a standalone app). You can read the Model from your application using ModelSerializer.readBinary() method call. OnlineLogisticRegression sgd = ModelSerializer.readBinary( new FileInputStream(model),

Re: Minhash review

2012-01-16 Thread Suneel Marthi
Lance, I don't think this problem is confined to DisplayMinHash alone, even the regular MinHash clustering doesn't seem right when run on the Reuter's dataset (using cluster-reuters.sh) and a few other data sets I had tried.  I am playing with the the keyGroups values to determine if that

Re: Minhash review

2012-03-08 Thread Suneel Marthi
on we can work on other approaches. So if I understand correctly the vectorization step can be skipped and we can run SequenceFilesFromDirectory - CollocDriver - MinHashDriver Correct? On Thu, Mar 8, 2012 at 8:22 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Frank, I modified the present

Re: Possible issue in MinHashMapper

2012-07-30 Thread Suneel Marthi
Sean, I'll take care of this, I added this sometime last year but was never convinced that it ever worked right. From: Sean Owen sro...@gmail.com To: dev@mahout.apache.org Sent: Monday, July 30, 2012 5:18 PM Subject: Re: Possible issue in MinHashMapper

Re: Possible issue in MinHashMapper

2012-07-30 Thread Suneel Marthi
Done. From: Suneel Marthi suneel_mar...@yahoo.com To: dev@mahout.apache.org dev@mahout.apache.org Sent: Monday, July 30, 2012 6:44 PM Subject: Re: Possible issue in MinHashMapper Sean, I'll take care of this, I added this sometime last year but was never

Re: increase in warnings

2013-01-31 Thread Suneel Marthi
If there is a JIRA for this, I can work on it.  IntelliJ does highlight most of these warnings that are being reported. From: Ted Dunning ted.dunn...@gmail.com To: Mahout Dev List dev@mahout.apache.org Sent: Thursday, January 31, 2013 4:38 PM Subject:

Re: Fwd: Neural Network and Restricted Boltzman Machine in Mahout

2013-03-14 Thread Suneel Marthi
Have a look at http://www.neurosolutions.com/ for a start. From: Ying Liao yliao...@gmail.com To: dev@mahout.apache.org; Danny Busch da...@kurbel.net Sent: Thursday, March 14, 2013 2:26 PM Subject: Re: Fwd: Neural Network and Restricted Boltzman Machine in

Re: lucene 4.2.0?

2013-03-19 Thread Suneel Marthi
I can confirm that it does not, tested with Lucene 4.2. From: Grant Ingersoll gsing...@apache.org To: dev@mahout.apache.org Sent: Tuesday, March 19, 2013 10:00 AM Subject: Re: lucene 4.2.0? I wouldn't think so.  Go for it. On Mar 19, 2013, at 6:35 AM, Ted

Re: lucene 4.2.0?

2013-03-20 Thread Suneel Marthi
, 2013 at 5:12 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: I can confirm that it does not, tested with Lucene 4.2.   From: Grant Ingersoll gsing...@apache.org To: dev@mahout.apache.org Sent: Tuesday, March 19, 2013 10:00 AM Subject: Re

Re: Call to action – Mahout needs your help

2013-03-25 Thread Suneel Marthi
I have been occasionally contributing patches for JIRA tickets when time permits, always wanted to make a major contribution to Mahout but was not sure as to what the vision of the project and what was expected by way of contributions. I would be more than willing to make a major contribution.

Re: Mahout Suggestions - Refactoring Effort

2013-03-26 Thread Suneel Marthi
Gokhan, Thinking loud here, I have not tried running LDA so I could be wrong. As a precursor to Step 2 below, did u try running the RowIdJob that should create IntWritable, VectorWritable pairs.  It also a creates a 'docIndex' which is IntWritable, Text  to map the Ints back to the original

Re: ModelSerializer.writeToJson()

2013-03-31 Thread Suneel Marthi
Why would you want to do that?  Isn't it easier to just package the model into your application war/jar and read the same a s ResourceStream as opposed to what you are proposing? From: Arun Avanathan arun.avanat...@gmail.com To: dev@mahout.apache.org Sent:

Re: ModelSerializer.writeToJson()

2013-03-31 Thread Suneel Marthi
arun.avanat...@gmail.com To: dev@mahout.apache.org; Suneel Marthi suneel_mar...@yahoo.com Sent: Sunday, March 31, 2013 9:47 PM Subject: Re: ModelSerializer.writeToJson() Idea is to ETL the serialized models on a schedule basis from one application to another. War/Jar approach is another way

Re: CosineDistanceMeasure for 2 zero vectors?

2013-04-04 Thread Suneel Marthi
Code from CosineDistanceMeasure     // correct for zero-vector corner case     if (denominator == 0 dotProduct == 0) {   return 1;     }     Seems like a bug to me, agree with Dan it should be 0 (and not 1). From: Dan Filimon dangeorge.fili...@gmail.com

Re: Welcome Suneel Marthi and Dan Filimon

2013-04-07 Thread Suneel Marthi
AM Subject: Welcome Suneel Marthi and Dan Filimon In recognition of the contributions of Suneel Marthi and Dan Filimon to the Mahout project, the PMC is pleased to announce both have accepted our invitations to join the Mahout project as committers. As is customary, I will leave it to Suneel

Re: Code reviews and reviewers

2013-04-09 Thread Suneel Marthi
+3 From: Andrew Musselman andrew.mussel...@gmail.com To: dev@mahout.apache.org Sent: Tuesday, April 9, 2013 9:16 AM Subject: Re: Code reviews and reviewers +1 for code reviews +1 for Review Board +1 for unit tests and integration tests On Tue, Apr 9, 2013

Re: Mahout 1.0 goals

2014-02-27 Thread Suneel Marthi
With the announcement of http://deeplearning4j.org yesterday which is various Neural Networks implementations on Hadoop 2/JBlas that had been talked about in one of the other discussion threads on this mailing list. Do we wanna duplicate a similar effort in Mahout? In addition to what

Re: Mahout 1.0 goals

2014-02-28 Thread Suneel Marthi
First steps towards the loving care (in my view) :- a) Address the issues that Sean's brought up. I wasn't aware of (i) in that list else I would have ensured that they were addressed in 0.9. b) Most of the backlog JIRAs (about 28 of them today) go all the way back to the initial stages of

Re: [jira] [Updated] (MAHOUT-1178) GSOC 2013: Improve Lucene support in Mahout

2014-03-02 Thread Suneel Marthi
. On Sun 02 Mar 2014 05:01:26 PM EST, Suneel Marthi (JIRA) wrote: [ https://issues.apache.org/jira/browse/MAHOUT-1178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1178: -- Fix Version/s

Re: Mahout 1.0 goals

2014-03-03 Thread Suneel Marthi
Grant had setup a Google Hangout for Mahout sometime last year before 0.8 release.  I had one setup too for 0.9 release. I definitely wouldn't want to have a hangout on Saturday or weekend. On Monday, March 3, 2014 12:52 PM, Ted Dunning ted.dunn...@gmail.com wrote: Happy to organize a

Re: Mahout 1.0 goals

2014-03-03 Thread Suneel Marthi
PM, Sebastian Schelter s...@apache.org wrote: I would like to discuss whether we should start to have some Spark-related code in Mahout. --sebastian On 03/03/2014 06:56 PM, Suneel Marthi wrote: Grant had setup a Google Hangout for Mahout sometime last year before 0.8 release.  I had one

Re: Mahout 1.0 goals

2014-03-03 Thread Suneel Marthi
wrote: So Friday afterwork_ 2014-03-03 18:56 GMT+01:00 Suneel Marthi suneel_mar...@yahoo.com: Grant had setup a Google Hangout for Mahout sometime last year before 0.8 release. I had one setup too for 0.9 release. I definitely wouldn't want to have a hangout on Saturday or weekend

Re: Mahout 1.0 goals

2014-03-03 Thread Suneel Marthi
IRC channel Sent from my iPhone On Mar 3, 2014, at 1:58 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: There is an RISC channel for mahout on freenode that's still active . Sent from my iPhone On Mar 3, 2014, at 1:46 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: How

Re: kMeans Implementation

2014-03-04 Thread Suneel Marthi
He's talking about simple kmeans which is a mapper only job. Sean's already addressed his question Sent from my iPhone On Mar 4, 2014, at 5:49 AM, Sebastian Schelter s...@apache.org wrote: We have several implementations of k-Means, which one do you refer to? --sebastian On 03/04/2014

Re: heads up for MAHOUT-1346

2014-03-06 Thread Suneel Marthi
Have not seen this happen, this is the Frequent PAttern code that was resurrected at the last minute for .9 release; but haven't seen this failure. Does it fail consistently? On Thursday, March 6, 2014 11:41 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: I have this test failure during due

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Suneel Marthi
Congrats Andrew. On Friday, March 7, 2014 12:13 PM, Sebastian Schelter s...@apache.org wrote: Hi, this is to announce that the Project Management Committee (PMC) for Apache Mahout has asked Andrew Musselman to become committer and we are pleased to announce that he has accepted. Being a

Fwd: Light weight process for Examples contributions

2014-03-08 Thread Suneel Marthi
Forwarding on pat's behalf Sent from my iPhone Begin forwarded message: From: Pat Ferrel p...@occamsmachete.com Date: March 8, 2014 at 11:48:57 AM EST To: Suneel Marthi suneel_mar...@yahoo.com, Andrew Musselman andrew.mussel...@gmail.com Subject: Fwd: Light weight process for Examples

Re: Mahout 1.0 goals

2014-03-08 Thread Suneel Marthi
On Saturday, March 8, 2014 5:41 PM, Pat Ferrel p...@occamsmachete.com wrote: Ah, now back to freely babbling on the dev list. Mahout wishlist: 1) scaling:  I don’t get the need for R integration or running without hadoop or spark. You can run hadoop in local mode on your native file

Re: How To Release page

2014-03-09 Thread Suneel Marthi
Sure, will do it later tonight. In light of Mahout's recent migration to svnpubsub, release notes need to be updated to reflect that. On Sunday, March 9, 2014 9:12 AM, Sebastian Schelter s...@apache.org wrote: Hi Suneel, I have a favor to ask. Could you have a look at the How To Release

Re: How To Release page

2014-03-09 Thread Suneel Marthi
ssc, could u create a jira for this? thanks On Sunday, March 9, 2014 11:19 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Sure, will do it later tonight. In light of Mahout's recent migration to svnpubsub, release notes need to be updated to reflect that. On Sunday, March 9

Re: [jira] [Created] (MAHOUT-1450) Cleaning up k-means documentation on mahout website

2014-03-12 Thread Suneel Marthi
This is just not right, look at the example scripts first and update the documentation accordingly. Sent from my iPhone On Mar 12, 2014, at 6:29 AM, Pavan Kumar N (JIRA) j...@apache.org wrote: Pavan Kumar N created MAHOUT-1450: - Summary:

Re: Lucene issue in recommenditembased example

2014-03-13 Thread Suneel Marthi
Could u print the complete stacktrace? On Thursday, March 13, 2014 7:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: I'm getting this error repeated for several attempts in the last phase of the recommenditembased example on EMR with the default AMI and Hadoop version and a fresh

Re: Lucene issue in recommenditembased example

2014-03-13 Thread Suneel Marthi
(Minutes: 18.7459) On Thu, Mar 13, 2014 at 5:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Could u print the complete stacktrace? On Thursday, March 13, 2014 7:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: I'm getting this error repeated for several attempts in the last phase

Re: [jira] [Comment Edited] (MAHOUT-1426) GSOC 2013 Neural network algorithms

2014-03-16 Thread Suneel Marthi
I would suggest looking at deeplearning4j.org (they went public very recently) and see how they had utilized Iterative Reduce for implementing Neural Nets. Not sure given the present state of flux on the project if we should even be considering adding any new algorithms. The existing ones can

Re: [jira] [Created] (MAHOUT-1467) ClusterClassifier read/writePolicy leak file handles

2014-03-17 Thread Suneel Marthi
Could u submit a patch? Please work off of trunk as some if the clustering code was moved around . Sent from my iPhone On Mar 17, 2014, at 3:13 PM, Avi Shinnar (JIRA) j...@apache.org wrote: Avi Shinnar created MAHOUT-1467: --- Summary:

Re: Plan for 1.0

2014-03-19 Thread Suneel Marthi
I had a hangout setup for 0.9, not sure if its still valid;  I can check on that or can set one up now. When would people wanna have it? Mondays and Wednesdays don't work for me.  Would Tuesdays 6pm Eastern Time work ? On Wednesday, March 19, 2014 2:45 AM, Sebastian Schelter

Re: Plan for 1.0

2014-03-19 Thread Suneel Marthi
two weeks starting from Tuesday. Best, Sebastian On 03/19/2014 07:55 AM, Suneel Marthi wrote: I had a hangout setup for 0.9, not sure if its still valid; I can check on that or can set one up now. When would people wanna have it? Mondays and Wednesdays don't work for me. Would

Re: [GSOC 2014] Uniform API for Mahout Clustering

2014-03-19 Thread Suneel Marthi
On Wednesday, March 19, 2014 3:09 AM, Dmitriy Lyubimov dlie...@gmail.com wrote: On Tue, Mar 18, 2014 at 11:56 PM, chalitha udara Perera chalithaud...@gmail.com wrote: Hi Dmitriy, I agree with you that i need to be more specific on this matter. Here I was referring to some suggestion

Re: Plan for 1.0

2014-03-19 Thread Suneel Marthi
08:05 AM, Suneel Marthi wrote: Same here, travel next week and in Amsterdam the first week of April.   I avoid Sundays or weekends for obvious reasons. How bout this Friday? Sent from my iPhone   On Mar 19, 2014, at 3:02 AM, Sebastian Schelter s...@apache.org wrote: Would some time

Re: Plan for 1.0

2014-03-20 Thread Suneel Marthi
wrote: Friday would also work for me.   On 03/19/2014 08:05 AM, Suneel Marthi wrote: Same here, travel next week and in Amsterdam the first week of April.   I avoid Sundays or weekends for obvious reasons. How bout this Friday? Sent from my iPhone   On Mar 19, 2014, at 3:02 AM

<    1   2   3   4   5   6   7   8   9   10   >