Re: DB2 DDL to Hive DDL conversion **Need Help**

2016-02-19 Thread Mohit Durgapal
Oracle, SQL > Server and some DB2 DDL. If there are issues I can extend the tool, contact > me. > > Dmitry > > > On Fri, Feb 19, 2016 at 11:55 AM, Mohit Durgapal <durgapalmo...@gmail.com > <javascript:_e(%7B%7D,'cvml','durgapalmo...@gmail.com');>> wrote: >

Re: DB2 DDL to Hive DDL conversion **Need Help**

2016-02-19 Thread Mohit Durgapal
Have you considered using Sqoop? If not, then please have a look at the following links: https://sqoop.apache.org/docs/1.4.3/SqoopUserGuide.html#_importing_data_into_hive http://stackoverflow.com/questions/17064144/how-do-i-use-sqoop-for-importing-data-from-a-relational-db-to-sandbox-hive On

Re: PLEASE HELP :: HOW TO DO INNER JOIN IN HIVE

2014-10-15 Thread Mohit Durgapal
Hi Yogesh, Hive DOES support inner joins. For example for two tables a b, the inner join query could be SELECT a.id,a.item,b.color FROM a JOIN b ON (a.id = b.id) You also need to understand how hive converts the joins into mapreduce jobs for good performance. Please read about it here

How to use joins and averages both in the same hive query

2014-09-03 Thread Mohit Durgapal
I have two tables in hive: Table1: uid,txid,amt,vendor Table2: uid,txid Now I need to join the tables on txid which basically confirms a transaction is finally recorded. There will be some transactions which will be present only in Table1 and not in Table2. I need to find out number of avg of

Re: In need of desperate help and willing to pay for consulting work

2014-04-15 Thread Mohit Durgapal
Hive isn't really a replacement for mysql. Please read about hive and big data. The use cases for hive are quite different as compared to the use cass for mysql. Mysql is for real time ad-hoc queries, like showing some information on a webpage or search results. MySQL is a relational database,

hive query to select top 10 product of each subcategory and select most recent product info

2014-04-11 Thread Mohit Durgapal
I have a hive table partitioned by dates. It contains ecomm data in the format siteid,sitecatid,catid,subcatgid,pid,pname,pprice,pmrp,pdesc What I need to do is to run a query on table above in hive for top 10 products(count wise) in each sub category. What adds a bit more complexity is

Re: hive query to select top 10 product of each subcategory and select most recent product info

2014-04-11 Thread Mohit Durgapal
nitinpawar...@gmail.comwrote: may be you can share your table ddl, your query and what output r u looking for On Fri, Apr 11, 2014 at 12:26 PM, Mohit Durgapal durgapalmo...@gmail.comwrote: I have a hive table partitioned by dates. It contains ecomm data in the format siteid,sitecatid