RajasekarSribalan commented on issue #1766: URL: https://github.com/apache/hudi/issues/1766#issuecomment-650712662
Thanks for your reply. I tried this setting as well but I get duplicate records when querying hudi table... ideally it has to pick up only latest commit but it fetches all parquet and returns duplicate records. On Sun, 28 Jun 2020, 12:42 pm Bhavani Sudha Saktheeswaran, < [email protected]> wrote: > It is strange you are seeing this for Hudi and non Hudi tables. Could you > try setting this config when querying Hive > > set hive.input.format=org.apache.hadoop.hive.ql.io.HiveInputFormat and > check once? > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <https://github.com/apache/hudi/issues/1766#issuecomment-650708997>, or > unsubscribe > <https://github.com/notifications/unsubscribe-auth/AFMO6I42ZJHBA4T2WLQF6NTRY3UNPANCNFSM4OIRVWCA> > . > ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
