Re: How to set mapreduce.input.fileinputformat.split.maxsize for a specific job

2015-05-17 Thread Answer Agrawal
); Docs for Configuration class: https://hadoop.apache.org/docs/current/api/org/apache/hadoop/conf/Configuration.html This will work as long as the property is not marked final Regards, Shahab On Sat, May 16, 2015 at 10:49 AM, Answer Agrawal yrsna.tse...@gmail.com wrote: Hi

How to set mapreduce.input.fileinputformat.split.maxsize for a specific job

2015-05-16 Thread Answer Agrawal
Hi, In xmls configuration file of Hadoop-2.x, mapreduce.input.fileinputformat.split.minsize is given which can be set but how to set mapreduce.input.fileinputformat.split.maxsize in xml file. I need to set it in my mapreduce code. Thanks,

How to find execution time of a MapReduce job?

2015-05-15 Thread Answer Agrawal
Hi, Date date; long start, end; // for recording start and end time of job date = new Date(); start = date.getTime(); // starting timer job.waitForCompletion(true) date = new Date(); end = date.getTime(); //end timer log.info(Total Time (in milliseconds) = + (end-start)); log.info(Total Time

How to access value of variable in Driver class which has been declared and modified inside Mapper class?

2015-05-12 Thread Answer Agrawal
Hi, I declared a variable and incremented/modified it inside Mapper class. Now I need to use the modified value of that variable in Driver class. I declared a static variable inside Mapper class and its modified value works in Driver class when I run the code in Eclipse IDE. But after creating

Re: Can we control data distribution and load balancing in Hadoop Cluster?

2015-05-04 Thread Answer Agrawal
at 11:59 PM, Answer Agrawal yrsna.tse...@gmail.com wrote: Hi As I studied that data distribution, load balancing, fault tolerance are implicit in Hadoop. But I need to customize it, can we do that? Thanks

Can we control data distribution and load balancing in Hadoop Cluster?

2015-05-03 Thread Answer Agrawal
Hi As I studied that data distribution, load balancing, fault tolerance are implicit in Hadoop. But I need to customize it, can we do that? Thanks

How to stop a mapreduce job from terminal running on Hadoop Cluster?

2015-04-12 Thread Answer Agrawal
To run a job we use the command $ hadoop jar example.jar inputpath outputpath If job is so time taken and we want to stop it in middle then which command is used? Or is there any other way to do that? Thanks,

Can we run mapreduce job from eclipse IDE on fully distributed mode hadoop cluster?

2015-04-11 Thread Answer Agrawal
A mapreduce job can be run as jar file from terminal or directly from eclipse IDE. When a job run as jar file from terminal it uses multiple jvm and all resources of cluster. Does the same thing happen when we run from IDE. I have run a job on both and it takes less time on IDE than jar file on

Re: Can we run mapreduce job from eclipse IDE on fully distributed mode hadoop cluster?

2015-04-11 Thread Answer Agrawal
. --- Regards, Jonathan Aquilina Founder Eagle Eye T On 2015-04-11 08:23, Answer Agrawal wrote: A mapreduce job can be run as jar file from terminal or directly from eclipse IDE. When a job run as jar file from terminal it uses multiple jvm and all resources of cluster. Does the same