[
https://issues.apache.org/jira/browse/GOBBLIN-1611?focusedWorklogId=845454&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-845454
]
ASF GitHub Bot logged work on GOBBLIN-1611:
-------------------------------------------
Author: ASF GitHub Bot
Created on: 14/Feb/23 18:16
Start Date: 14/Feb/23 18:16
Worklog Time Spent: 10m
Work Description: Will-Lo merged PR #3470:
URL: https://github.com/apache/gobblin/pull/3470
Issue Time Tracking
-------------------
Worklog Id: (was: 845454)
Time Spent: 0.5h (was: 20m)
> Fix a wrong value for writer.codec.type in the document
> -------------------------------------------------------
>
> Key: GOBBLIN-1611
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1611
> Project: Apache Gobblin
> Issue Type: Bug
> Reporter: Kengo Seki
> Priority: Minor
> Time Spent: 0.5h
> Remaining Estimate: 0h
>
> According to
> https://gobblin.readthedocs.io/en/latest/user-guide/Configuration-Properties-Glossary/#Writer-Properties,
> possible values for the writer.codec.type property are NOCOMPRESSION,
> DEFLATE, SNAPPY.
> But if NOCOMPRESSION is actually specified, the job fails as follows.
> {code}
> 2022-02-16 14:15:33 JST ERROR [ForkExecutor-0]
> org.apache.gobblin.runtime.fork.Fork [d_t_1644988532694_0] - Fork 0 of task
> task_EmbeddedGobblin_1644988532694_0 failed.
> org.apache.avro.AvroRuntimeException: Unrecognized codec: nocompression
> at org.apache.avro.file.CodecFactory.fromString(CodecFactory.java:102)
> at
> org.apache.gobblin.util.WriterUtils.getCodecFactory(WriterUtils.java:263)
> at
> org.apache.gobblin.writer.AvroHdfsDataWriter.<init>(AvroHdfsDataWriter.java:65)
> at
> org.apache.gobblin.writer.AvroDataWriterBuilder.build(AvroDataWriterBuilder.java:45)
> at
> org.apache.gobblin.writer.PartitionedDataWriter$4.get(PartitionedDataWriter.java:220)
> at
> org.apache.gobblin.writer.PartitionedDataWriter$4.get(PartitionedDataWriter.java:215)
> at
> org.apache.gobblin.writer.CloseOnFlushWriterWrapper.<init>(CloseOnFlushWriterWrapper.java:73)
> at
> org.apache.gobblin.writer.PartitionedDataWriter.<init>(PartitionedDataWriter.java:214)
> at org.apache.gobblin.runtime.fork.Fork.buildWriter(Fork.java:571)
> at
> org.apache.gobblin.runtime.fork.Fork.buildWriterIfNotPresent(Fork.java:579)
> at org.apache.gobblin.runtime.fork.Fork.processRecord(Fork.java:525)
> at
> org.apache.gobblin.runtime.fork.AsynchronousFork.processRecord(AsynchronousFork.java:103)
> at
> org.apache.gobblin.runtime.fork.AsynchronousFork.processRecords(AsynchronousFork.java:86)
> at org.apache.gobblin.runtime.fork.Fork.run(Fork.java:257)
> at
> org.apache.gobblin.util.executors.MDCPropagatingRunnable.run(MDCPropagatingRunnable.java:39)
> at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)