Tajo C++ worker

Yu Dongmin Wed, 22 Jan 2014 03:20:09 -0800

Hello,

I'm very glad to hear that Tajo is getting stable and production ready.


Here's another news. Some of us might have have seen some Hyunsik's 
presentations commenting about an experimental project, c++ tajo worker.


Recently, I've been working on the c++ worker, even though it is a little bit 
far from behind schedule than what I expected, 
it is able to communicate with Tajo Master and Query Master successfully. 

This is never a replacement of Java worker, but an exchangeable worker. We 
could use Java workers as is, C++ workers only, or mixed workers. 

It is designed as a vectorized execution engine hoping to process certain types 
of data structures very efficiently. 

These are the supported features right now.

- Reading and parsing csv files in the hadoop data node.
- Filtering rows within LLVM code generated evaluation 
- Simple scalar functions
- Simple group by aggregation functions

Now I'm working on 'order by' clause and doing profiling to get expected 
performance.

Working on these stuffs, I hope community to allow creating a new git branch, 
say native_worker, cplus_worker or a nicer name, to make this project together. 
 


There's still a long way to go on this project, but it could be improved with 
help of the Tajo community.


Thanks
Min

Tajo C++ worker

Reply via email to