Re: [ANNOUNCEMENT] A query system for BSP processing

Leonidas Fegaras Thu, 30 Aug 2012 07:42:21 -0700

Yes sure. I have fixed the bug with the repeat stopping condition butI have only tested pagerank on my small cluster. I still need to fixthe k-means clustering (it's a special case because you improve afixed number of points).

Leonidas

On Aug 30, 2012, at 9:02 AM, Edward J. Yoon wrote:

Shall we work together?
On Fri, Aug 24, 2012 at 9:01 PM, Leonidas Fegaras<[email protected]> wrote:
Thank you very much for your interest and for testing my system.
It seems that my release was premature: It worked for some randomdata butdidn't for some others. It's a minor logical error that I will tryto fix inthe next few days. The problem is with the stopping condition ofthe repeatexpression that calculates the new pagerank from the old. It muststop ifALL peers reach the specified precision. This is done by havingthose peersthat need to continue send a message to others to continue. Itseems thatnow when all peers agree at the same time, the program works fine.But ifone finishes sooner, instead of continuing the repeat loop, it runsaway tothe next BSP step that follows the repeat, then exits prematurelyand thesystem hangs. The casting errors are due to the run-away peersexecuting thewrong BSP steps reading wrong messages. Queries without repeatthough are
OK.
By the way, I had a problem exchanging large amount of data duringsync (Idiscussed this with Thomas). My solution was to to break a BSPsuperstepinto multiple substeps so that each substep can handle a max numberofmessages. Of course my program has to collect all messages in avector inmemory. When the vector is too big, it is spilled in a local file.Thismoved the problem from the Hama side to my side and allowed me tohandlelarger data, especially in joins. I think this problem ofexchanging large
amount of data during a superstep is currently a weakness of Hama.
Leonidas



On 08/24/2012 04:15 AM, Thomas Jungblut wrote:
BTW, should we feature this on our website?

2012/8/24 Thomas Jungblut <[email protected]>
Hi Leonidas!

I have to admit that I have known what is going on (and had to keep
silent), but I have to say: Thank you very much!
This will help many people writing BSPs in a more easier way.
Of course this is not as fast as the native BSP code, Hive andPig suffer
from the same problems in MR.
But it gives people the opportunity to develop faster and gettheir code
in production with just a minor time expense.
And I think, that we will help you gladly on improving the BSPpart of
your framework. At least I would do ;)

Thanks!

2012/8/24 Edward J. Yoon <[email protected]>
Here's my few test results on Oracle BDA (40G/s infinibandnetwork).
It seems slow than our PageRank example.

P.S., There are some errors so I couldn't test large-scale.
(java.lang.ClassCastException: hadoop.mrql.MR_int cannot be casttohadoop.mrql.Inv and java.lang.Error: Cannot clear a non-materialized
sequence ..., etc.)



== 100K nodes and 1M edges ==
*** Using 10 BSP tasks (out of a max 10). Each task will handleabout
2383611 bytes of input data.

Run time: 30.384 secs
*** Using 20 BSP tasks (out of a max 20). Each task will handleabout
1191805 bytes of input data.

Run time: 24.412 secs
On Fri, Aug 24, 2012 at 9:36 AM, Edward J. Yoon <[email protected]>
wrote:
Wow, very interesting. I'm going to install and test on my large
cluster.
On Fri, Aug 24, 2012 at 4:41 AM, Leonidas Fegaras <[email protected]>
wrote:
Dear Hama users,
I am pleased to announce that the MRQL query processing systemcan nowevaluate SQL-like queries on a Hama cluster. MRQL is availableat:
http://lambda.uta.edu/mrql/
MRQL (the Map-Reduce Query Language) is an SQL-like querylanguage forlarge-scale, distributed data analysis. MRQL is powerfulenough toexpress most common data analysis tasks over many differentkinds ofraw data, including hierarchical data and nested collections,such asXML data. MRQL can run in two modes: in MR (Map-Reduce) modeusingApache Hadoop and in BSP (Bulk Synchronous Parallel) modeusing Apache
Hama. Both modes use Apache's HDFS to read and write their data.
Note that, the BSP mode is currently experimental (not fine-tuned yet)and lacks any fault-tolerance (if an error occurs, the entirejob mustbe restarted). Due to our limited resources, MRQL has onlybeen testedon a small cluster (7-nodes/28-cores). We compared the BSPmode withthe MR mode by evaluating a pagerank query over a small graph(100Knodes, 1M edges) and found that BSP mode is about 4.5 timesfasterthan the MR mode. Please let me know if you'd like tocontribute to
this project by testing MRQL on a larger cluster.
Best regards,
Leonidas Fegaras
University of Texas at Arlington
--
Best Regards, Edward J. Yoon
@eddieyoon
--
Best Regards, Edward J. Yoon
@eddieyoon
.
--
Best Regards, Edward J. Yoon
@eddieyoon

Re: [ANNOUNCEMENT] A query system for BSP processing

Reply via email to