hi Martin,

Mnay thanks for your kindly help.

I could find little performance case of greenplum/hawq on google,especialy on 10 billion row data.Your replying inspire confidence in me.:-)

our real-time query only returns hundreds of row from a huge table. I'll test and tuning HAWQ after our machines are avaliable to approve the performance.

Thank you again for your promptly repling.


Best regards!

Tony.


在 2016/1/29 17:29, Martin Visser 写道:
Hi,

for queries like that there are a couple of functionalities of HAWQ that will help you. One is columnar storage like Parquet. This will help you when you are only selecting columns a,b,c and the table has columns a,b,...z The other functionality that will help you is partitioning to reduce the initial set without having to read the data. How to choose the partitioning will depend on your query patterns and the selectivity of the column values. For example in your query you could partition on column a. But as mentioned if a only had values 1 and 2 that would only half the number of rows being scanned etc.

Another observation is that you are selecting individual rows in your example rather than grouped results. Potentially this could result in a lot of data having to be returned by the query. Is that the case? How many rows would you expect queries to return?

The answer for your 10 seconds is it is certainly possible due to HAWQs linear scalability but it depends on a number of factors.

hth
Martin

On Fri, Jan 29, 2016 at 5:34 AM, 陶进 <[email protected] <mailto:[email protected]>> wrote:

    hi guys,

    We have several huge tables,and some of the table would more than
    10 billion rows.each table had the same  columns,each row is about
    100 Byte.

    Our query run on each  singal table to filter and sort some
    records,such as select a,b,c from t where a=1 b='hello' order by 1,2.

    Now we use mongodb,and the bigest table had 4 billion rows.it
    <http://rows.it> could returned in 10 seconds.Now we want to use
    hawq as our query engine.Could they run the above query in 10
    seconds?  what the hardware of the server?how many node would need?


    Thanks.

    ---
    Avast 防毒软件已对此电子邮件执行病毒检查。
    https://www.avast.com/antivirus





---
Avast 防毒软件已对此电子邮件执行病毒检查。
https://www.avast.com/antivirus

Reply via email to