parisni opened a new issue, #5482:
URL: https://github.com/apache/hudi/issues/5482

   I am testing hudi 0.11.0, I get this error after 5 commits 
(=hoodie.keep.max.commits).
   When I turn either `hoodie.metadata.index.column.stats.enable=false` or 
`hoodie.metadata.index.bloom.filter.enable=false`, or 
`hoodie.metadata.index.async=true` then this works.
   Also my COW table does not get that error
   
   Hudi options:
   ```
   {hoodie.datasource.hive_sync.table=archiver_table,
   hoodie.datasource.hive_sync.partition_fields=version,event_date,event_hour,
   hoodie.index.type=BLOOM,
   hoodie.table.precombine.field=kafka_datetime,
   hoodie.metadata.index.column.stats.enable=true,
   hoodie.clean.automatic=true,
   hoodie.write.markers.type=timeline_server_based,
   
hoodie.datasource.hive_sync.partition_extractor_class=org.apache.hudi.hive.MultiPartKeysValueExtractor,
   hoodie.enable.data.skipping=true,
   hoodie.clean.async=true,
   hoodie.compact.inline=true,
   hoodie.datasource.hive_sync.mode=hms,
   hoodie.archive.merge.enable=true,
   hoodie.datasource.write.recordkey.field=store_id,
   hoodie.keep.max.commits=5,
   hoodie.avro.schema.validate=true,
   hoodie.metadata.enable=true,
   hoodie.datasource.write.table.type=MERGE_ON_READ,
   hoodie.datasource.write.hive_style_partitioning=true,
   hoodie.embed.timeline.server.async=true,
   hoodie.bulkinsert.shuffle.parallelism=1,
   hoodie.datasource.hive_sync.enable=true,
   hoodie.keep.min.commits=4,
   hoodie.cleaner.commits.retained=3,
   index.global.enabled=true,
   hoodie.datasource.write.table.name=archiver_table,
   hoodie.metadata.index.bloom.filter.enable=true,
   hoodie.write.commit.callback.class=foo.bar.CommitS3Callback,
   hoodie.datasource.write.partitionpath.field=version,event_date,event_hour,
   hoodie.delete.shuffle.parallelism=1,
   hoodie.upsert.shuffle.parallelism=1,
   hoodie.table.name=archiver_table,
   hoodie.write.commit.callback.s3.region=eu-west-1,
   clean.retain_commits=3,
   hoodie.insert.shuffle.parallelism=1,
   hoodie.datasource.write.precombine.field=kafka_datetime,
   hoodie.write.commit.callback.on=true,
   hoodie.datasource.write.operation=upsert,
   hoodie.datasource.hive_sync.database=default,
   hoodie.compact.inline.max.delta.commits=2}
   ```
   
   The error
   ```
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Scanning log 
file 
HoodieLogFile{pathStr='s3a://test-bucket/s3_path/table/.hoodie/metadata/column_stats/.col-stats-0000_00000000000000.log.4_1-134-204',
 fileLen=-1}
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data 
block from file 
s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0000_00000000000000.log.2_0-34-64
 at instant 20220502142217107
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Scanning log 
file 
HoodieLogFile{pathStr='s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0003_00000000000000.log.2_1-34-65',
 fileLen=-1}
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data 
block from file 
s3a://test-bucket/s3_path/table/.hoodie/metadata/files/.files-0000_00000000000000.log.4_2-134-205
 at instant 20220502142229068
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Number of log 
files scanned => 3
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: MaxMemoryInBytes 
allowed for compaction => 113246208
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Number of 
entries in MemoryBasedMap in ExternalSpillableMap => 2
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Total size in 
bytes of MemoryBasedMap in ExternalSpillableMap => 864032
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Number of 
entries in BitCaskDiskMap in ExternalSpillableMap => 0
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.HoodieMergedLogRecordScanner: Size of file 
spilled to disk => 0
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data 
block from file 
s3a://test-bucket/s3_path/table/.hoodie/metadata/column_stats/.col-stats-0000_00000000000000.log.4_1-134-204
 at instant 20220502142229068
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data 
block from file 
s3a://test-bucket/s3_path/table/.hoodie/metadata/column_stats/.col-stats-0001_00000000000000.log.2_2-34-66
 at instant 20220502142217107
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.HoodieLogFormatReader: Moving to the next 
reader for logfile 
HoodieLogFile{pathStr='s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0000_00000000000000.log.3_0-334-564',
 fileLen=-1}
   2022-05-02 16:22:50 INFO 
org.apache.hudi.common.table.log.AbstractHoodieLogRecordReader: Reading a data 
block from file 
s3a://test-bucket/s3_path/table/.hoodie/metadata/bloom_filters/.bloom-filters-0003_00000000000000.log.2_1-34-65
 at instant 20220502142217107
   2022-05-02 16:23:17 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout: 
SocketChannelEndPoint@27844d51{/192.168.1.61:34542<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=28000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@5ad70cae[p=HttpParser{s=START,0
 of 
-1},g=HttpGenerator@4bda6d57{s=START}]=>HttpChannelOverHttp@7d1b5411{r=8,c=false,c=false/false,a=IDLE,uri=null,age=0}
 idle timeout check, elapsed: 28000 ms, remaining: 2000 ms
   2022-05-02 16:23:17 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout: 
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=28434/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
 idle timeout check, elapsed: 28434 ms, remaining: 1566 ms
   2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout: 
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
 idle timeout check, elapsed: 30000 ms, remaining: 0 ms
   2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.IdleTimeout: 
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
 idle timeout expired
   2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.FillInterest: 
onFail 
FillInterest@6a484a24{AC.ReadCB@635b5ed3{HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=FI,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=START,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}}}
   java.util.concurrent.TimeoutException: Idle timeout expired: 30000/30000 ms
        at 
org.apache.hudi.org.apache.jetty.io.IdleTimeout.checkIdleTimeout(IdleTimeout.java:166)
        at 
org.apache.hudi.org.apache.jetty.io.IdleTimeout$1.run(IdleTimeout.java:50)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
        at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
   2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.http.HttpParser: 
close HttpParser{s=START,0 of -1}
   2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.http.HttpParser: 
START --> CLOSE
   2022-05-02 16:23:19 DEBUG 
org.apache.hudi.org.apache.jetty.io.AbstractConnection: 
HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=-,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
 onFillInterestedFailed {}
   2022-05-02 16:23:19 DEBUG 
org.apache.hudi.org.apache.jetty.io.AbstractEndPoint: shutdownOutput 
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OPEN,fill=-,flush=-,to=30000/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
   2022-05-02 16:23:19 DEBUG 
org.apache.hudi.org.apache.jetty.io.AbstractConnection: fillInterested 
HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OSHUT,fill=-,flush=-,to=30001/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
   2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.FillInterest: 
interested 
FillInterest@6a484a24{AC.ReadCB@635b5ed3{HttpConnection@635b5ed3::SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OSHUT,fill=FI,flush=-,to=0/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}}}
   2022-05-02 16:23:19 DEBUG 
org.apache.hudi.org.apache.jetty.io.ChannelEndPoint: changeInterests p=false 
1->1 for 
SocketChannelEndPoint@2f31590a{/192.168.1.61:34544<->/192.168.1.61:33361,OSHUT,fill=FI,flush=-,to=0/30000}{io=1/1,kio=1,kro=1}->HttpConnection@635b5ed3[p=HttpParser{s=CLOSE,0
 of 
-1},g=HttpGenerator@4687b2bb{s=START}]=>HttpChannelOverHttp@67f9634c{r=15,c=false,c=false/false,a=IDLE,uri=null,age=0}
   2022-05-02 16:23:19 DEBUG org.apache.hudi.org.apache.jetty.io.WriteFlusher: 
ignored: WriteFlusher@1465a0a2{IDLE}->null
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to