satishkotha edited a comment on issue #2346: URL: https://github.com/apache/hudi/issues/2346#issuecomment-755791465
@sumihehe I'm not able to reproduce this issue using dataset in [docker demo](https://hudi.apache.org/docs/docker_demo.html) 0: jdbc:hive2://hiveserver:10000> select `_hoodie_commit_time`, symbol, ts, volume, open, close from stock_ticks_mor_rt where `_hoodie_commit_time`='20210106235525'; +----------------------+---------+----------------------+---------+------------+------------+--+ | _hoodie_commit_time | symbol | ts | volume | open | close | +----------------------+---------+----------------------+---------+------------+------------+--+ | 20210106235525 | NIHD | 2018-08-31 10:29:00 | 2125 | 5.67 | 5.67 | | 20210106235525 | CDMO | 2018-08-31 09:58:00 | 8577 | 7.02 | 7.07 | | 20210106235525 | MRTX | 2018-08-31 10:28:00 | 1844 | 56.15 | 56.15 | | 20210106235525 | NTRA | 2018-08-31 09:59:00 | 1326 | 27.45 | 27.45 | | 20210106235525 | DRNA | 2018-08-31 09:51:00 | 334 | 16.0 | 16.0 | | 20210106235525 | STAA | 2018-08-31 09:59:00 | 1709 | 47.95 | 47.95 | ... 197 rows selected (0.423 seconds) Looked at code closely, I dont see anything obviously wrong. Could you help explain little more about your dataset so i can try to reproduce it? - How many rows does it have? - How many commits and how many deltacommits? - Can you try the same query on **mor_ro table? - Could you try other input format such as HoodieParquetRealtimeInputFormat. (as @bvaradar suggested) It'd be awesome if you have a reproducible test using above docker demo or on simple spark shell ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
