Hi John , I've removed the directory and repeat the process and it seems
fine now.

Regarding the TPCH , thanks for making me aware of fastbit limitation .

Here is what I intend to do , let me know what do you think :
1. date - can be save  as number (epoch ) and translate back and forth.
2. I've saw the fastbit can handle 2 table joins , so i guess I'll be able
to perform mutili-tables join in several phases?

3. I've some questions to understand better how the code works,  i didn't
find documentation (maybe I've missed it)
    a. how the group by operations are done ?
    b.how  the join algorithm works ?


thanks.


On Wed, Mar 6, 2013 at 8:30 AM, K. John Wu <[email protected]> wrote:

> Hi, Amihay,
>
> I just took the three rows you included in the message and tried it
> with your ardea command line.  Things seem to have completed
> successfully in my macbook.  In these three rows, the value of
> returnflag are all 'N'.  Not sure what platform you are using..
>
> Maybe your run of ardea has encountered some errors.  If you have
> captured the print out from this command line, would you mind share it
> with me?
>
> Not sure what you plan to do with the data files, but one thing you
> should know is that there are many queries from TPCH that can not be
> handled by FastBit.  For example, FastBit does not deal with date as
> nicely as DBMS.  FastBit does not do multi-table joins neither.
>
> John
>
>
> On 3/5/13 2:51 PM, amihay gonen wrote:
> > Hi ,
> > I'm trying to make environment for testing TPCH queries on fastbit (if
> > anyone has those queries "translated" to ibis query format it will be
> > great).
> >
> > I've started by trying to convert row data (1G rows) to col data using
> > ardea tool
> >
> > On the big table lineitem , the row data is about 780M and the col
> > data got 1.8G  ,this was strange , by looking into the directory where
> > the col files are i see the following :
> >
> > -rw-rw-r--. 1 agonen agonen 211M Mar  6 00:07 comment
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 commitdate
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 discount
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 linenumber
> > -rw-rw-r--. 1 agonen agonen 104M Mar  6 00:07 linestatus
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 orderkey
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 partkey
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 price
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 quantity
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 receipdate
> > -rw-rw-r--. 1 agonen agonen 958M Mar  6 00:07 returnflag  <--- too big
> > for varchar(1)
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 shipdate
> > -rw-rw-r--. 1 agonen agonen 256M Mar  6 00:07 shipinstuct
> > -rw-rw-r--. 1 agonen agonen 202M Mar  6 00:07 shipmode
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 suppkey
> > -rw-rw-r--. 1 agonen agonen  69M Mar  6 00:07 tax
> > -rw-rw-r--. 1 agonen agonen 1.6K Mar  6 00:07 -part.txt
> >
> > the returnflag ,for some reason contain the all record.
> >
> > I'm using the fastbit-ibis1.3.5 and the command to convet the csv to
> > col is :
> >
> > ~/Code/fastbit-ibis1.3.5/examples/ardea -d
> > ~/Code/fastbit_col_data/lineitem.tbl -b \| -m
> > "orderkey:int,partkey:int,suppkey:int,linenumber:int,quantity:
> >
> float,price:float,discount:float,tax:float,returnflag:text,linestatus:key,shipdate:int,commitdate:int,receipdate:int,shipinstuct:text,shipmode:key,comment:text"
> > -t ~/Code/rowdata/lineitem.tbl
> >
> >
> > the base file look like that:
> >
> >
> 1|155190|7706|1|17|21168.23|0.04|0.02|N|O|1996-03-13|1996-02-12|1996-03-22|DELIVER
> > IN PERSON|TRUCK|egular courts above the|
> >
> 1|67310|7311|2|36|45983.16|0.09|0.06|N|O|1996-04-12|1996-02-28|1996-04-20|TAKE
> > BACK RETURN|MAIL|ly final dependencies: slyly bold |
> >
> 1|63700|3701|3|8|13309.60|0.10|0.02|N|O|1996-01-29|1996-03-05|1996-01-31|TAKE
> > BACK RETURN|REG AIR|riously. regular, express dep|
> >
> >
> > any idea ?
> >
> > thanks amihay
> >
> >
> > _______________________________________________
> > FastBit-users mailing list
> > [email protected]
> > https://hpcrdm.lbl.gov/cgi-bin/mailman/listinfo/fastbit-users
> >
>
_______________________________________________
FastBit-users mailing list
[email protected]
https://hpcrdm.lbl.gov/cgi-bin/mailman/listinfo/fastbit-users

Reply via email to