parisni opened a new issue, #5482:
URL: https://github.com/apache/hudi/issues/5482
I am testing hudi 0.11.0, I get this error after 5 commits
(=hoodie.keep.max.commits).
When I turn either `hoodie.metadata.index.column.stats.enable=false` or
`hoodie.metadata.index.bloom.filter.enable=false`, or
`hoodie.metadata.index.async=true` then this works.
Also my COW table does not get that error
Hudi options:
```
{hoodie.datasource.hive_sync.table=archiver_table,
hoodie.datasource.hive_sync.partition_fields=version,event_date,event_hour,
hoodie.index.type=BLOOM,
hoodie.table.precombine.field=kafka_datetime,
hoodie.metadata.index.column.stats.enable=true,
hoodie.clean.automatic=true,
hoodie.write.markers.type=timeline_server_based,
hoodie.datasource.hive_sync.partition_extractor_class=org.apache.hudi.hive.MultiPartKeysValueExtractor,
hoodie.enable.data.skipping=true,
hoodie.clean.async=true,
hoodie.compact.inline=true,
hoodie.datasource.hive_sync.mode=hms,
hoodie.archive.merge.enable=true,
hoodie.datasource.write.recordkey.field=store_id,
hoodie.keep.max.commits=5,
hoodie.avro.schema.validate=true,
hoodie.metadata.enable=true,
hoodie.datasource.write.table.type=MERGE_ON_READ,
hoodie.datasource.write.hive_style_partitioning=true,
hoodie.embed.timeline.server.async=true,
hoodie.bulkinsert.shuffle.parallelism=1,
hoodie.datasource.hive_sync.enable=true,
hoodie.keep.min.commits=4,
hoodie.cleaner.commits.retained=3,
index.global.enabled=true,
hoodie.datasource.write.table.name=archiver_table,
hoodie.metadata.index.bloom.filter.enable=true,
hoodie.write.commit.callback.class=foo.bar.CommitS3Callback,
hoodie.datasource.write.partitionpath.field=version,event_date,event_hour,
hoodie.delete.shuffle.parallelism=1,
hoodie.upsert.shuffle.parallelism=1,
hoodie.table.name=archiver_table,
hoodie.write.commit.callback.s3.region=eu-west-1,
clean.retain_commits=3,
hoodie.insert.shuffle.parallelism=1,
hoodie.datasource.write.precombine.field=kafka_datetime,
hoodie.write.commit.callback.on=true,
hoodie.datasource.write.operation=upsert,
hoodie.datasource.hive_sync.database=default,
hoodie.compact.inline.max.delta.commits=2}
```
The error
```
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Scanning log
file
HoodieLogFile{pathStr='s3a://test-bucket/s3_path/table/.hoodie/metadata/column_stats/.col-stats-0000_00000000000000.log.4_1-134-204',
fileLen=-1}
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data
block from file
s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0000_00000000000000.log.2_0-34-64
at instant 20220502142217107
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Scanning log
file
HoodieLogFile{pathStr='s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0003_00000000000000.log.2_1-34-65',
fileLen=-1}
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data
block from file
s3a://test-bucket/s3_path/table/.hoodie/metadata/files/.files-0000_00000000000000.log.4_2-134-205
at instant 20220502142229068
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Number of log
files scanned => 3
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: MaxMemoryInBytes
allowed for compaction => 113246208
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Number of
entries in MemoryBasedMap in ExternalSpillableMap => 2
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Total size in
bytes of MemoryBasedMap in ExternalSpillableMap => 864032
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Number of
entries in BitCaskDiskMap in ExternalSpillableMap => 0
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Size of file
spilled to disk => 0
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data
block from file
s3a://test-bucket/s3_path/table/.hoodie/metadata/column_stats/.col-stats-0000_00000000000000.log.4_1-134-204
at instant 20220502142229068
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data
block from file
s3a://test-bucket/s3_path/table/.hoodie/metadata/column_stats/.col-stats-0001_00000000000000.log.2_2-34-66
at instant 20220502142217107
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.HoodieLogFormatReader: Moving to the next
reader for logfile
HoodieLogFile{pathStr='s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0000_00000000000000.log.3_0-334-564',
fileLen=-1}
2022-05-02 16:22:50 INFO
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data
block from file
s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0003_00000000000000.log.2_1-34-65
at instant 20220502142217107
2022-05-02 16:23:17 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout:
SocketChannelEndPoint@27844d51{/192.168.1.61:34542<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=28000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@5ad70cae[p=HttpParser{s=START,0
of
-1},g=HttpGenerator@4bda6d57{s=START}]=>HttpChannelOverHttp@7d1b5411{r=8,c=false,c=false/false,a=IDLE,uri=null,age=0}
idle timeout check, elapsed: 28000 ms, remaining: 2000 ms
2022-05-02 16:23:17 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout:
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=28434/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
idle timeout check, elapsed: 28434 ms, remaining: 1566 ms
2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout:
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
idle timeout check, elapsed: 30000 ms, remaining: 0 ms
2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout:
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
idle timeout expired
2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.FillInterest:
onFail
FillInterest@6a484a24{AC.ReadCB@635b5ed3{HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}}}
java.util.concurrent.TimeoutException: Idle timeout expired: 30000/30000 ms
at
org.apache.hudi.org.apache.jetty.io.IdleTimeout.checkIdleTimeout(IdleTimeout.java:166)
at
org.apache.hudi.org.apache.jetty.io.IdleTimeout$1.run(IdleTimeout.java:50)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.http.HttpParser:
close HttpParser{s=START,0 of -1}
2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.http.HttpParser:
START --> CLOSE
2022-05-02 16:23:19 DEBUG
org.apache.hudi.org.apache.jetty.io.AbstractConnection:
HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=-,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
onFillInterestedFailed {}
2022-05-02 16:23:19 DEBUG
org.apache.hudi.org.apache.jetty.io.AbstractEndPoint: shutdownOutput
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=-,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
2022-05-02 16:23:19 DEBUG
org.apache.hudi.org.apache.jetty.io.AbstractConnection: fillInterested
HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OSHUT,fill=-,flush=-,to=30001/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.FillInterest:
interested
FillInterest@6a484a24{AC.ReadCB@635b5ed3{HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OSHUT,fill=FI,flush=-,to=0/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}}}
2022-05-02 16:23:19 DEBUG
org.apache.hudi.org.apache.jetty.io.ChannelEndPoint: changeInterests p=false
1->1 for
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OSHUT,fill=FI,flush=-,to=0/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
of
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.WriteFlusher:
ignored: WriteFlusher@1465a0a2{IDLE}->null
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]