Re: SVM implementaion

2013-11-19 Thread unmesha sreeveni
http://www.distilnetworks.com/ They are using svm right? On Mon, Nov 11, 2013 at 11:52 AM, unmesha sreeveni unmeshab...@gmail.comwrote: can any one share a file with 1/-1 classification dataset..?? On Mon, Nov 11, 2013 at 11:26 AM, unmesha sreeveni unmeshab...@gmail.comwrote: I am

Limit on total jobs running using fair scheduler

2013-11-19 Thread Ivan Tretyakov
Hello! We are using CDH 4.1.1 (Version: 2.0.0-mr1-cdh4.1.1) and fair-scheduler. We need to limit total number of jobs which can run at the same time on cluster. I can see maxRunningJobs options but it sets limit for pool or user. We wouldn't like to limit each pool or user we just need to set

Re: Time taken for starting AMRMClientAsync

2013-11-19 Thread Krishna Kishore Bonagiri
Hi Alejandro, I don't know what are managed and unmanaged AMs, can you please explain me what are the difference and how are each of them launched? I tried to google for these terms and came across hadoop-yarn-applications-unmanaged-am-launcher-2.2.0.jar, is it related to that? Thanks,

Fwd: class not found on namenode/datanode startup

2013-11-19 Thread Kevin D'Elia
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hello, I have configured hadoop install according to instructions I found on the internet; when I start hadoop namenode/datanode, I get: java.lang.NoClassDefFoundError: org/apache/hadoop/hdfs/server/namenode/NameNode Caused by:

Re: Limit on total jobs running using fair scheduler

2013-11-19 Thread Omkar Joshi
Not sure about the fair scheduler but in capacity scheduler you can achieve this by controlling number of jobs/applications per queue. Thanks, Omkar Joshi *Hortonworks Inc.* http://www.hortonworks.com On Tue, Nov 19, 2013 at 3:26 AM, Ivan Tretyakov itretya...@griddynamics.com wrote: Hello!

Re: Limit on total jobs running using fair scheduler

2013-11-19 Thread Sandy Ryza
Unfortunately, this is not possible in the MR1 fair scheduler without setting the jobs for individual pools. In MR2, fair scheduler hierarchical queues will allow setting maxRunningApps at the top of the hierarchy, which would have the effect you're looking for. -Sandy On Tue, Nov 19, 2013 at

How do organisations use Hadoop and associated technologies ?

2013-11-19 Thread Percy Mistry
Hi, I am new to Hadoop and would like to understand how exactly are Hadoop and its associated technologies (like Cassandra, Hive, Pig,etc..) used to achieve a business goal ? So lets say an organization has Hadoop and Cassandra or MongoDB and everything up and running. Then what ? What do you

Re: How do organisations use Hadoop and associated technologies ?

2013-11-19 Thread Martin, Nick
Hi Percy, Just my $.02 but there are quite a lot of case studies out there from the major distributions and tech journalists. Maybe start there. I think a search for Hadoop use cases or (sorry) Big Data use cases would turn up some useful examples about how organizations are using some of the

Re: How do organisations use Hadoop and associated technologies ?

2013-11-19 Thread Percy Mistry
I found use cases but I am looking for info at a lower level - implementation of retrieval and presentation. For example in these cases - http://www.slideshare.net/McK_CMSOForum/big-data-and-advanced-analytics-16-use-cases I want to know what technology stack would be used to retrieve,

missing command dfsadmin -upgradeProgress in hadoop 2.2.0 stable

2013-11-19 Thread Henry Hung
Hi All, I already upgrade hadoop 1.0.4 to hadoop 2.2.0, but when I want to check the upgrade progress by executing bin/hdfs dfsadmin -upgradeProgress, the command is unknown for hadoop 2.2.0. Could someone tell me how to get a upgradeProgress status in the new hadoop 2.2.0 stable? Best

答复: missing command dfsadmin -upgradeProgress in hadoop 2.2.0 stable

2013-11-19 Thread 麦树荣
Hi YES. I had same experience. When I upgraded hadoop1.x to hadoop2.2.0, I couldn’t execute command “bin/hdfs dfsadmin �CupgradeProgress” in hadoop 2.2.0 which could be executed in hadoop1.x(CDH3,hadoop0.20.X). 发件人: Henry Hung [mailto:ythu...@winbond.com] 发送时间: 2013年11月20日 11:26 收件人:

Large datasets for Hadoop

2013-11-19 Thread unmesha sreeveni
Where can i find a large dataset nearly 1GB for examining SVM training phase I have gone through 1. https://www.kaggle.com/ 2. http://archive.ics.uci.edu/ml/datasets/ But this did nt solved my problem they are all KB files I am in search of a large dataset with numeric data 1/-1 class or 1/0