Re: Alternatives to Streaming Mutation API in Hive 3.x

2020-01-16 Thread Nandakishore Mm
Here is some more context of what we are trying out. We have streaming mutable data in kafka and we want to stream these upserts into Hive ( HDFS backed) . I gather than hive has a streaming interface through which upserts can be streamed into Hive. Have u guys tested this and or done some POC

?????? Alternatives to Streaming Mutation API in Hive 3.x

2020-01-14 Thread ????????
You sent it to the wrong person ---- ??:"Nandakishore Mm"

Re: Alternatives to Streaming Mutation API in Hive 3.x

2020-01-14 Thread Nandakishore Mm
Hi David, Thanks for the response. I'm actually trying to do streaming upserts into hive. Since we already use Hive to perform our analytics we are looking for solutions based around Hive itself. Also as you mentioned Hive 3.x for upserts, could you point me to something specific in Hive that

Re: Alternatives to Streaming Mutation API in Hive 3.x

2020-01-13 Thread David Mollitor
Hello, Streaming? NiFi Upserts? HBase, Kudu, Hive 3.x Doing upserts on Hive can be cumbersome, depending on the use case. If Upserts are being submitted continuously and quickly, it can overwhelm the system because it will require a scan across the data set (for all intents and purposes) for

Fwd: Alternatives to Streaming Mutation API in Hive 3.x

2020-01-13 Thread Nandakishore Mm
Hi Hive Team, I have a use case where data has to be continuously streamed into a hive table as well as old records with the same primary key have to be updated/deleted (i.e. upserts). This document for 'HCatalog