Re: how to invoke hive command line client?

2011-05-26 Thread 김영우
AFAIK, Hive standalone server allows multiple clients to make connections. See https://issues.apache.org/jira/browse/HIVE-73 - Youngwoo 2011/5/26 jinhang du dujinh...@gmail.com So what's the difference between a embedded server and a standalone server? Can you help me understand it?

Re: HiveQL for 'rank() over (partition by ... order by ...)'?

2011-05-26 Thread thinker0
Hi. -- /** * @author thinker0 * * TiaraUDFRank. */ @Description(name = t_row_rank, value = _FUNC_() - Returns a

questions about statistics in 0.7

2011-05-26 Thread Guy Bayes
Hello all, I'm new to this list, I was wondering if anyone could answer a couple questions about the implementation of statistics in 0.7? I've reviewed http://wiki.apache.org/hadoop/Hive/StatsDev and have the following q

Re: questions about statistics in 0.7

2011-05-26 Thread Guy Bayes
Crap sorry hit send too early questions 1: Job overhead of generating statistics on the fly with set hive.stats.autogather=true;? 2: Is stat descriptions in describe table extended implemented? I've gathered stats on a table but do not see the expected entries (rowNum = , etc) in the describe

Hive assert()?

2011-05-26 Thread Igor Tatarinov
I would like to implement some kind of assert functionality in Hive QL. Here is how I do it in MySQL. I can assert that a given query returns no (bad) rows by creating a table with one row containing '1' and a unique index. Then, I try to insert into that table select 1 from (query). If the query

Re: Hive assert()?

2011-05-26 Thread Alex Kozlov
1) Would `select count(1) from (query)` do the same thing? I am a bit confused what is the semantic of assert: is it just no rows or some kind of syntax error check? 2) Hive is not an OLTP and is not optimized for single row inserts (or updates for this matter). In a trivial implementation one

Re: Hive assert()?

2011-05-26 Thread Edward Capriolo
You can write a UDF. If it throws an exception from the UDF that will end your hive job. On Thu, May 26, 2011 at 5:46 PM, Igor Tatarinov i...@decide.com wrote: Here is one example. I want to make sure I don't have negative prices in my data. I would like to write something like:

Re: questions about statistics in 0.7

2011-05-26 Thread Ning Zhang
On May 26, 2011, at 1:28 PM, Guy Bayes wrote: Crap sorry hit send too early questions 1: Job overhead of generating statistics on the fly with set hive.stats.autogather=true;? Overhead is minimum. The only accountable overhead is to insert a row into a RDBMS/HBase at the end of a task. At