Skip to site navigation (Press enter)

Re: Exception with AvroPathPerKeyTarget

Jeremy Lewi Fri, 28 Mar 2014 07:21:24 -0700

No luck. I get the same error even when using a single reducer. I'm
attaching the job configuration as shown in the web ui.


When I look at the job tracker for the job, it has no map tasks. Is that
expected? I've never heard of a reduce only job.

J


On Fri, Mar 28, 2014 at 6:45 AM, Jeremy Lewi <[email protected]> wrote:

> This is my first time on a  cluster I'll try what Josh suggests now.
>
> J
>
>
> On Fri, Mar 28, 2014 at 3:41 AM, Josh Wills <[email protected]> wrote:
>
>>
>> On Fri, Mar 28, 2014 at 1:22 AM, Gabriel Reid <[email protected]>wrote:
>>
>>> Hi Jeremy,
>>>
>>> On Thu, Mar 27, 2014 at 3:26 PM, Jeremy Lewi <[email protected]> wrote:
>>> > Hi
>>> >
>>> > I'm hitting the exception pasted below when using AvroPathPerKeyTarget.
>>> > Interestingly, my code works just fine when I run on a small dataset
>>> using
>>> > the LocalJobTracker. However, when I run on a large dataset using a
>>> hadoop
>>> > cluster I hit the exception.
>>> >
>>>
>>> Have you ever been able to successfully use the AvroPathPerKeyTarget
>>> on a real cluster, or is this the first try with it?
>>>
>>> I'm wondering if this could be a problem that's always been around (as
>>> the integration test for AvroPathPerKeyTarget also runs in the local
>>> jobtracker), or if this could be something new.
>>>
>>
>> +1-- Jeremy, if you force the job to run w/a single reducer on the
>> cluster (i.e., via groupByKey(1)), does it work?
>>
>>
>>>
>>> - Gabriel
>>>
>>
>>
>

Title: Job Configuration: JobId - job_201312300132_0420

Job Configuration: JobId - job_201312300132_0420

name	value
job.end.retry.interval	30000
avro.map.output.schema	{"type":"record","name":"Pair","namespace":"org.apache.avro.mapred","fields":[{"name":"key","type":"string","doc":""},{"name":"value","type":{"type":"record","name":"MatePair","namespace":"contrail.sequences","doc":"Mate File Record Structure","fields":[{"name":"left","type":{"type":"record","name":"FastQRecord","doc":"A FastQ Read","fields":[{"name":"id","type":"string"},{"name":"read","type":"string"},{"name":"qvalue","type":"string"}]}},{"name":"right","type":"FastQRecord"}]},"doc":"","order":"ignore"}]}
io.bytes.per.checksum	512
mapred.job.tracker.retiredjobs.cache.size	1000
mapreduce.jobhistory.cleaner.interval-ms	86400000
mapred.queue.default.acl-administer-jobs	*
dfs.image.transfer.bandwidthPerSec	0
mapred.task.profile.reduces	0-2
mapreduce.jobtracker.staging.root.dir	${hadoop.tmp.dir}/mapred/staging
mapreduce.job.cache.files.visibilities	true,true
mapred.job.reuse.jvm.num.tasks	1
dfs.block.access.token.lifetime	600
mapred.reduce.tasks.speculative.execution	true
mapred.job.name	contrail.scaffolding.FilterReads: Avro(/tmp/crunch-797729802/p6)+GBK+ungroup+AvroFilePerKey... (4/4)
hadoop.http.authentication.kerberos.keytab	${user.home}/hadoop.keytab
dfs.permissions.supergroup	supergroup
io.seqfile.sorter.recordlimit	1000000
hadoop.relaxed.worker.version.check	false
mapred.task.tracker.http.address	0.0.0.0:50060
dfs.namenode.delegation.token.renew-interval	86400000
fs.ramfs.impl	org.apache.hadoop.fs.InMemoryFileSystem
mapred.system.dir	${hadoop.tmp.dir}/mapred/system
dfs.namenode.edits.toleration.length	0
mapred.task.tracker.report.address	127.0.0.1:0
mapreduce.reduce.shuffle.connect.timeout	180000
mapreduce.job.counters.max	120
dfs.datanode.readahead.bytes	4193404
mapred.healthChecker.interval	60000
mapreduce.job.complete.cancel.delegation.tokens	true
dfs.namenode.replication.work.multiplier.per.iteration	2
fs.trash.interval	0
hadoop.jetty.logs.serve.aliases	true
mapred.skip.map.auto.incr.proc.count	true
hadoop.http.authentication.kerberos.principal	HTTP/localhost@LOCALHOST
mapred.child.tmp	./tmp
fs.gsb.impl	com.google.cloud.hadoop.fs.gcs.GoogleHadoopBucketRootedFileSystem
fs.gs.enable.service.account.auth	true
mapred.tasktracker.taskmemorymanager.monitoring-interval	5000
crunch.work.dir	/tmp/crunch-797729802/p9
dfs.datanode.http.address	0.0.0.0:50075
mapred.output.key.comparator.class	org.apache.avro.mapred.AvroKeyComparator
io.sort.spill.percent	0.80
dfs.namenode.write.stale.datanode.ratio	0.5f
dfs.client.use.datanode.hostname	false
mapred.job.shuffle.input.buffer.percent	0.70
dfs.max.objects	0
hadoop.skip.worker.version.check	false
hadoop.security.instrumentation.requires.admin	false
mapred.skip.map.max.skip.records	0
mapreduce.reduce.shuffle.maxfetchfailures	10
hadoop.security.authorization	false
user.name	hadoop
mapred.task.profile.maps	0-2
dfs.datanode.sync.behind.writes	false
dfs.https.server.keystore.resource	ssl-server.xml
dfs.replication.interval	3
mapred.local.dir	${hadoop.tmp.dir}/mapred/local
mapred.merge.recordsBeforeProgress	10000
mapred.job.tracker.http.address	0.0.0.0:50030
mapred.compress.map.output	false
mapred.userlog.retain.hours	24
mapred.used.genericoptionsparser	true
mapred.tasktracker.reduce.tasks.maximum	8
dfs.namenode.safemode.min.datanodes	0
hadoop.security.uid.cache.secs	14400
mapred.disk.healthChecker.interval	60000
fs.har.impl.disable.cache	true
mapred.cluster.map.memory.mb	-1
crunch.avro.mode	SPECIFIC
dfs.data.dir	/mnt/ed0/hadoop/dfs/data,/mnt/ed1/hadoop/dfs/data,/mnt/pd0/hadoop/dfs/data
dfs.access.time.precision	3600000
dfs.replication.min	1
mapreduce.job.submithost	desktop-0.c.biocloudops.internal
fs.checkpoint.dir	${hadoop.tmp.dir}/dfs/namesecondary
fs.s3n.impl	org.apache.hadoop.fs.s3native.NativeS3FileSystem
mapreduce.tasktracker.outofband.heartbeat	false
mapreduce.tasktracker.outofband.heartbeat.damper	1000000
mapred.jobtracker.restart.recover	false
hadoop.logfile.size	10000000
hadoop.security.token.service.use_ip	true
mapred.inmem.merge.threshold	1000
ipc.client.connection.maxidletime	10000
fs.checkpoint.size	67108864
dfs.namenode.invalidate.work.pct.per.iteration	0.32f
read_id_parser	contrail.sequences.ReadIdUtil$ReadParserUsingUnderscore
dfs.blockreport.intervalMsec	3600000
fs.s3.sleepTimeSeconds	10
mapreduce.job.counters.counter.name.max	64
dfs.client.block.write.retries	3
mapred.reduce.tasks	1
mapred.queue.names	default
crunch.planner.dotfile	digraph G { "Avro(/speciesA/scaffolding_2014_0218/FilteredBowtieAlignments/avro)" [label="Avro(/speciesA/scaffolding_2014_0218/FilteredBowtieAlignments/avro)" shape=folder]; "Avro(/tmp/crunch-797729802/p1)" [label="Avro(/tmp/crunch-797729802/p1)" shape=folder]; subgraph "cluster-job1" { label="Crunch Job 1"; subgraph "cluster-job1-map" { label = Map; color = blue; "S5@899134710@1644373942" [label="S5" shape=box]; "Aggregate.count@1348188201@1644373942" [label="Aggregate.count" shape=box]; } subgraph "cluster-job1-reduce" { label = Reduce; color = red; "PTables.values@1661020955@1644373942" [label="PTables.values" shape=box]; "GBK@729117068@1644373942" [label="GBK" shape=box]; "combine@2070479808@1644373942" [label="combine" shape=box]; } } "PTables.values@1661020955@1644373942" -> "Avro(/tmp/crunch-797729802/p1)"; "Avro(/speciesA/scaffolding_2014_0218/FilteredBowtieAlignments/*avro)" -> "S5@899134710@1644373942"; "combine@2070479808@1644373942" -> "PTables.values@1661020955@1644373942"; "GBK@729117068@1644373942" -> "combine@2070479808@1644373942"; "Aggregate.count@1348188201@1644373942" -> "GBK@729117068@1644373942"; "S5@899134710@1644373942" -> "Aggregate.count@1348188201@1644373942"; }
io.seqfile.lazydecompress	true
dfs.https.enable	false
dfs.replication	3
mapred.jobtracker.blacklist.fault-timeout-window	180
ipc.client.tcpnodelay	false
crunch.outputs.dir	out0;N29yZy5hcGFjaGUuY3J1bmNoLnR5cGVzLmF2cm8uQXZyb1BhdGhQZXJLZXlPdXRwdXRGb3JtYXQA AAACF2F2cm8ub3V0cHV0LnNjaGVtYS5vdXQwjgFWeyJ0eXBlIjoicmVjb3JkIiwibmFtZSI6Ik1h dGVQYWlyIiwibmFtZXNwYWNlIjoiY29udHJhaWwuc2VxdWVuY2VzIiwiZG9jIjoiTWF0ZSBGaWxl IFJlY29yZCBTdHJ1Y3R1cmUiLCJmaWVsZHMiOlt7Im5hbWUiOiJsZWZ0IiwidHlwZSI6eyJ0eXBl IjoicmVjb3JkIiwibmFtZSI6IkZhc3RRUmVjb3JkIiwiZG9jIjoiQSBGYXN0USBSZWFkIiwiZmll bGRzIjpbeyJuYW1lIjoiaWQiLCJ0eXBlIjoic3RyaW5nIn0seyJuYW1lIjoicmVhZCIsInR5cGUi OiJzdHJpbmcifSx7Im5hbWUiOiJxdmFsdWUiLCJ0eXBlIjoic3RyaW5nIn1dfX0seyJuYW1lIjoi cmlnaHQiLCJ0eXBlIjoiRmFzdFFSZWNvcmQifV19EGNydW5jaC5hdnJvLm1vZGUIU1BFQ0lGSUM= ;org.apache.avro.mapred.AvroWrapper;org.apache.hadoop.io.NullWritable
mapred.acls.enabled	false
mapred.tasktracker.dns.nameserver	default
mapred.submit.replication	10
io.compression.codecs	org.apache.hadoop.io.compress.DefaultCodec,org.apache.hadoop.io.compress.GzipCodec,org.apache.hadoop.io.compress.BZip2Codec,org.apache.hadoop.io.compress.SnappyCodec
io.file.buffer.size	4096
mapred.map.tasks.speculative.execution	true
bowtie_alignments	/speciesA/scaffolding_2014_0218/FilteredBowtieAlignments/*avro
mapreduce.job.split.metainfo.maxsize	10000000
mapred.map.max.attempts	4
mapred.job.shuffle.merge.percent	0.66
fs.har.impl	org.apache.hadoop.fs.HarFileSystem
hadoop.security.authentication	simple
fs.s3.buffer.dir	${hadoop.tmp.dir}/s3
mapred.skip.reduce.auto.incr.proc.count	true
dfs.http.address	0.0.0.0:50070
mapred.job.tracker.jobhistory.lru.cache.size	5
dfs.namenode.avoid.read.stale.datanode	false
dfs.datanode.drop.cache.behind.writes	false
dfs.replication.considerLoad	true
mapred.jobtracker.blacklist.fault-bucket-width	15
dfs.block.access.token.enable	false
mapreduce.job.acl-view-job
mapred.job.queue.name	default
dfs.permissions	true
mapred.job.tracker.persist.jobstatus.hours	0
fs.gs.impl	com.google.cloud.hadoop.fs.gcs.GoogleHadoopFileSystem
fs.file.impl	org.apache.hadoop.fs.LocalFileSystem
dfs.block.size	67108864
dfs.https.address	0.0.0.0:50470
ipc.client.kill.max	10
fs.gs.system.bucket	contrail-ghfs
mapred.healthChecker.script.timeout	600000
mapred.tasktracker.map.tasks.maximum	8
mapred.job.tracker.persist.jobstatus.dir	/jobtracker/jobsInfo
mapreduce.jobhistory.max-age-ms	2592000000
dfs.default.chunk.view.size	32768
mapred.reduce.slowstart.completed.maps	0.05
mapreduce.reduce.class	org.apache.crunch.impl.mr.run.CrunchReducer
help	false
io.sort.mb	100
dfs.datanode.failed.volumes.tolerated	0
dfs.https.need.client.auth	false
hadoop.http.authentication.type	simple
mapreduce.inputformat.class	org.apache.crunch.impl.mr.run.CrunchInputFormat
dfs.datanode.data.dir.perm	755
ipc.server.listen.queue.size	128
io.mapfile.bloom.size	1048576
fs.hsftp.impl	org.apache.hadoop.hdfs.HsftpFileSystem
mapred.cache.files.timestamps	1396015124294,1396015124313
mapred.combine.recordsBeforeProgress	10000
dfs.datanode.dns.nameserver	default
mapred.child.java.opts	-Xms512m -Xmx1024m
dfs.replication.max	512
mapred.queue.default.state	RUNNING
map.sort.class	org.apache.hadoop.util.QuickSort
hadoop.util.hash.type	murmur
topology.node.switch.mapping.impl	org.apache.hadoop.net.ScriptBasedMapping
dfs.block.access.key.update.interval	600
dfs.datanode.dns.interface	default
dfs.datanode.use.datanode.hostname	false
mapred.output.compression.type	RECORD
hadoop.security.use-weak-http-crypto	false
mapred.reducer.new-api	true
mapred.skip.attempts.to.start.skipping	2
mapreduce.job.dir	hdfs://biocloud-nn-0:8020/mnt/pd0/hadoop/tmp/mapred/staging/hadoop/.staging/job_201312300132_0420
io.map.index.skip	0
crunch.inputs.dir	LG9yZy5hcGFjaGUuY3J1bmNoLnR5cGVzLmF2cm8uQXZyb0lucHV0Rm9ybWF0AAAABBFhdnJvLmlu cHV0LnNjaGVtYY4CAHsidHlwZSI6InJlY29yZCIsIm5hbWUiOiJQYWlyIiwibmFtZXNwYWNlIjoi b3JnLmFwYWNoZS5hdnJvLm1hcHJlZCIsImZpZWxkcyI6W3sibmFtZSI6ImtleSIsInR5cGUiOiJz dHJpbmciLCJkb2MiOiIifSx7Im5hbWUiOiJ2YWx1ZSIsInR5cGUiOnsidHlwZSI6InJlY29yZCIs Im5hbWUiOiJNYXRlUGFpciIsIm5hbWVzcGFjZSI6ImNvbnRyYWlsLnNlcXVlbmNlcyIsImRvYyI6 Ik1hdGUgRmlsZSBSZWNvcmQgU3RydWN0dXJlIiwiZmllbGRzIjpbeyJuYW1lIjoibGVmdCIsInR5 cGUiOnsidHlwZSI6InJlY29yZCIsIm5hbWUiOiJGYXN0UVJlY29yZCIsImRvYyI6IkEgRmFzdFEg UmVhZCIsImZpZWxkcyI6W3sibmFtZSI6ImlkIiwidHlwZSI6InN0cmluZyJ9LHsibmFtZSI6InJl YWQiLCJ0eXBlIjoic3RyaW5nIn0seyJuYW1lIjoicXZhbHVlIiwidHlwZSI6InN0cmluZyJ9XX19 LHsibmFtZSI6InJpZ2h0IiwidHlwZSI6IkZhc3RRUmVjb3JkIn1dfSwiZG9jIjoiIiwib3JkZXIi OiJpZ25vcmUifV19GWNydW5jaC5yZWZsZWN0ZGF0YWZhY3Rvcnkvb3JnLmFwYWNoZS5jcnVuY2gu dHlwZXMuYXZyby5SZWZsZWN0RGF0YUZhY3RvcnkVYXZyby5pbnB1dC5pcy5yZWZsZWN0BWZhbHNl EGNydW5jaC5hdnJvLm1vZGUIU1BFQ0lGSUM= ;-1;/tmp/crunch-797729802/p6
mapred.cluster.max.map.memory.mb	-1
fs.s3.maxRetries	4
dfs.namenode.logging.level	info
mapred.task.tracker.task-controller	org.apache.hadoop.mapred.DefaultTaskController
mapred.userlog.limit.kb	0
mapreduce.ifile.readahead.bytes	4194304
hadoop.http.authentication.simple.anonymous.allowed	true
mapred.jobtracker.nodegroup.aware	false
hadoop.rpc.socket.factory.class.default	org.apache.hadoop.net.StandardSocketFactory
fs.hftp.impl	org.apache.hadoop.hdfs.HftpFileSystem
dfs.namenode.handler.count	10
fs.kfs.impl	org.apache.hadoop.fs.kfs.KosmosFileSystem
mapreduce.job.submithostaddress	10.240.0.142
mapred.map.tasks	0
mapred.local.dir.minspacekill	0
fs.hdfs.impl	org.apache.hadoop.hdfs.DistributedFileSystem
mapred.job.map.memory.mb	-1
mapred.jobtracker.completeuserjobs.maximum	100
dfs.namenode.avoid.write.stale.datanode	false
dfs.blockreport.initialDelay	0
mapred.min.split.size	0
hadoop.http.authentication.token.validity	36000
dfs.namenode.delegation.token.max-lifetime	604800000
fs.ftp.impl	org.apache.hadoop.fs.ftp.FTPFileSystem
dfs.secondary.http.address	0.0.0.0:50090
mapred.output.compression.codec	org.apache.hadoop.io.compress.DefaultCodec
mapred.cache.files	/tmp/crunch-797729802/p9/REDUCE,/tmp/crunch-797729802/p9/MAP
mapred.cluster.max.reduce.memory.mb	-1
mapred.cluster.reduce.memory.mb	-1
dfs.web.ugi	webuser,webgroup
mapred.task.profile	false
mapred.reduce.parallel.copies	5
dfs.heartbeat.interval	3
net.topology.impl	org.apache.hadoop.net.NetworkTopology
local.cache.size	10737418240
io.sort.factor	10
mapreduce.map.class	org.apache.crunch.impl.mr.run.CrunchMapper
mapreduce.job.counters.groups.max	50
mapred.task.timeout	600000
dfs.safemode.extension	30000
ipc.client.idlethreshold	4000
ipc.server.tcpnodelay	false
hadoop.logfile.count	10
dfs.namenode.stale.datanode.interval	30000
mapreduce.job.restart.recover	true
mapred.output.dir	/tmp/crunch-797729802/p9/output
mapred.heartbeats.in.second	100
fs.s3.block.size	67108864
mapred.jobtracker.jobSchedulable	org.apache.hadoop.mapred.JobSchedulable
mapred.map.output.compression.codec	org.apache.hadoop.io.compress.DefaultCodec
mapred.task.cache.levels	2
mapred.tasktracker.dns.interface	default
dfs.secondary.namenode.kerberos.internal.spnego.principal	${dfs.web.authentication.kerberos.principal}
mapred.job.reduce.memory.mb	-1
mapred.mapoutput.value.class	org.apache.avro.mapred.AvroValue
mapred.max.tracker.failures	4
hadoop.http.authentication.signature.secret.file	${user.home}/hadoop-http-auth-signature-secret
dfs.df.interval	60000
mapreduce.reduce.shuffle.read.timeout	180000
mapred.tasktracker.tasks.sleeptime-before-sigkill	5000
mapred.max.tracker.blacklists	4
fs.gs.project.id	biocloudops
jobclient.output.filter	FAILED
mapreduce.ifile.readahead	true
io.serializations	org.apache.hadoop.io.serializer.WritableSerialization,org.apache.crunch.types.avro.SafeAvroSerialization
io.seqfile.compress.blocksize	1000000
mapred.jobtracker.taskScheduler	org.apache.hadoop.mapred.JobQueueTaskScheduler
job.end.retry.attempts	0
ipc.client.connect.max.retries	10
dfs.namenode.delegation.key.update-interval	86400000
webinterface.private.actions	false
mapred.tasktracker.indexcache.mb	10
fs.checkpoint.edits.dir	${fs.checkpoint.dir}
mapreduce.reduce.input.limit	-1
mapred.mapper.new-api	true
tasktracker.http.threads	40
dfs.namenode.kerberos.internal.spnego.principal	${dfs.web.authentication.kerberos.principal}
mapreduce.job.counters.group.name.max	128
mapred.job.tracker.handler.count	10
keep.failed.task.files	false
mapred.output.compress	false
hadoop.security.group.mapping	org.apache.hadoop.security.ShellBasedUnixGroupsMapping
dfs.https.client.keystore.resource	ssl-client.xml
mapred.cache.files.filesizes	4614,4452
mapred.jobtracker.job.history.block.size	3145728
mapred.skip.reduce.max.skip.groups	0
dfs.datanode.address	0.0.0.0:50010
dfs.datanode.max.xcievers	4096
dfs.datanode.https.address	0.0.0.0:50475
fs.s3.impl	org.apache.hadoop.fs.s3.S3FileSystem
reads_fastq	/speciesA/scaffolding_2014_0218/reversed_reads/fastq,/speciesA/original_data_2013_1215/speciesA_200i_40xfastq,/speciesA/original_data_2013_1215/speciesA_300i_40x*fastq
dfs.datanode.drop.cache.behind.reads	false
mapred.jar	hdfs://biocloud-nn-0:8020/mnt/pd0/hadoop/tmp/mapred/staging/hadoop/.staging/job_201312300132_0420/job.jar
hadoop.tmp.dir	/mnt/ed0/hadoop/tmp
mapred.line.input.format.linespermap	1
dfs.webhdfs.enabled	false
dfs.datanode.du.reserved	0
topology.script.number.args	100
fs.default.name	hdfs://biocloud-nn-0:8020/
dfs.balance.bandwidthPerSec	1048576
mapred.local.dir.minspacestart	0
mapred.jobtracker.maxtasks.per.job	-1
mapred.user.jobconf.limit	5242880
mapred.reduce.max.attempts	4
mapred.job.tracker	biocloud-nn-0:9101
dfs.namenode.decommission.interval	30
dfs.name.edits.dir	${dfs.name.dir}
io.mapfile.bloom.error.rate	0.005
mapred.tasktracker.expiry.interval	600000
io.sort.record.percent	0.05
dfs.safemode.threshold.pct	0.999f
mapred.job.tracker.persist.jobstatus.active	false
outputpath	/speciesA/scaffolding_2014_0218/FilteredReads
dfs.name.dir	${hadoop.tmp.dir}/dfs/name
mapreduce.job.acl-modify-job
fs.checkpoint.period	3600
io.skip.checksum.errors	false
log_file	/users/jlewi/speciesA/FilterReads.0328_0658.log.txt
dfs.datanode.handler.count	3
dfs.namenode.decommission.nodes.per.interval	5
mapred.temp.dir	${hadoop.tmp.dir}/mapred/temp
mapred.mapoutput.key.class	org.apache.avro.mapred.AvroKey
ipc.client.fallback-to-simple-auth-allowed	false
hadoop.native.lib	true
fs.webhdfs.impl	org.apache.hadoop.hdfs.web.WebHdfsFileSystem
dfs.datanode.ipc.address	0.0.0.0:50020
mapred.working.dir	hdfs://biocloud-nn-0:8020/user/hadoop
mapred.job.reduce.input.buffer.percent	0.0

This is Apache Hadoop release 1.2.1