Re: How can I get one plugin's root dir

Doug Cutting Tue, 16 Jan 2007 12:17:00 -0800

Andrzej Bialecki wrote:

The reason is that if you pack this file into your job JAR, the job jarwould become very large (presumably this 40MB is already compressed?).Job jar needs to be copied to each tasktracker for each task, so youwill experience performance hit just because of the size of the job jar... whereas if this file sits on DFS and is highly replicated, itscontent will always be available locally.

Note that the job jar is copied into HDFS with a highish replication(10?), and that it is only copied to each tasktracker node once per*job*, not per task. So it's only faster to manage this yourself if youhave a sequence of jobs that share this data, and if the time tore-replicate it per job is significant.


Doug

Re: How can I get one plugin's root dir

Reply via email to