Re: Giraph as Whirr service, see WHIRR-530

2012-04-05 Thread Brian Femiano
I've used it on clusters I started on EC2 launched by Whirr. Simply copy the fat jar to your client machine and it will distribute normally as a M/R dependency. It works very well. The only limitation I could potentially find (without much proof) was on VMs with limited IO the RPC message

Re: On helping new contributors pitch in quickly...

2012-04-05 Thread Dan Brickley
On 5 April 2012 05:49, Jakob Homan jgho...@gmail.com wrote: Ack!, I suck.  Sorry.  I hadn't realized we'd gone through most of them, which itself is a good thing.  I'll get some new ones added first thing in the morning.  Sorry. Do we have something around document a workflow to get RDF graph

Re: On helping new contributors pitch in quickly...

2012-04-05 Thread Avery Ching
Dan, you're definitely right that this has been mentioned a few times. The multigraph issue is one part of it, but a helper VertexInputFormat (and maybe VertexOutputFormat) would certainly still help as you mention. Can you please open a JIRA (and help if you have time)? Avery On 4/5/12

Re: On helping new contributors pitch in quickly...

2012-04-05 Thread Paolo Castagna
Hi Dan, I have not an answer to your questions/observations yet. However, I suspect N-Triples | N-Quads might not be the best option for something like Giraph. Something more like an adjacency list might be better. So, my intuition, is that if you start with RDF in N-Triples format, the first

Re: On helping new contributors pitch in quickly...

2012-04-05 Thread Avery Ching
Here is a related JIRA https://issues.apache.org/jira/browse/GIRAPH-155 Avery On 4/5/12 9:45 AM, Paolo Castagna wrote: Hi Dan, I have not an answer to your questions/observations yet. However, I suspect N-Triples | N-Quads might not be the best option for something like Giraph. Something more

[jira] [Commented] (GIRAPH-168) Simplify munge directive usage with new munge flag HADOOP_SECURE (rather than HADOOP_FACEBOOK) and remove usage of HADOOP

2012-04-05 Thread Eugene Koontz (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/GIRAPH-168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13247396#comment-13247396 ] Eugene Koontz commented on GIRAPH-168: -- Hi Jakob, I wonder if HADOOP_NO_SASL might be

[jira] [Commented] (GIRAPH-170) Workflow for loading RDF graph data into Giraph

2012-04-05 Thread Dan Brickley (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/GIRAPH-170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13247490#comment-13247490 ] Dan Brickley commented on GIRAPH-170: - From Paulo in email: I suspect N-Triples |

[jira] [Created] (GIRAPH-170) Workflow for loading RDF graph data into Giraph

2012-04-05 Thread Dan Brickley (Created) (JIRA)
Workflow for loading RDF graph data into Giraph --- Key: GIRAPH-170 URL: https://issues.apache.org/jira/browse/GIRAPH-170 Project: Giraph Issue Type: New Feature Reporter: Dan Brickley

Re: On helping new contributors pitch in quickly...

2012-04-05 Thread Dan Brickley
On 5 April 2012 17:05, Avery Ching ach...@apache.org wrote: Dan, you're definitely right that this has been mentioned a few times.  The multigraph issue is one part of it, but a helper VertexInputFormat (and maybe VertexOutputFormat) would certainly still help as you mention.  Can you please

[jira] [Commented] (GIRAPH-170) Workflow for loading RDF graph data into Giraph

2012-04-05 Thread Dan Brickley (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/GIRAPH-170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13247506#comment-13247506 ] Dan Brickley commented on GIRAPH-170: - Another architectural note around RDF: RDF is

[jira] [Commented] (GIRAPH-170) Workflow for loading RDF graph data into Giraph

2012-04-05 Thread Paolo Castagna (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/GIRAPH-170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13247542#comment-13247542 ] Paolo Castagna commented on GIRAPH-170: --- bq. we may want to consider therefore

[jira] [Created] (GIRAPH-171) total time in MasterThread.run() is calculated incorrectly

2012-04-05 Thread Eugene Koontz (Created) (JIRA)
total time in MasterThread.run() is calculated incorrectly -- Key: GIRAPH-171 URL: https://issues.apache.org/jira/browse/GIRAPH-171 Project: Giraph Issue Type: Bug

[jira] [Updated] (GIRAPH-171) total time in MasterThread.run() is calculated incorrectly

2012-04-05 Thread Eugene Koontz (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/GIRAPH-171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koontz updated GIRAPH-171: - Attachment: GIRAPH-171.patch total time in MasterThread.run() is calculated incorrectly