Hello, so I'm watching this thread with interest as well, since that's the exact example I'll be trying to run as well, and I see Silvio appears to be asking about running Giraph 1.0 against Hadoop 2.0.0.

I'm curious, is that a known working combination, 1.0 vs 2.0.x? And if so, are there patches to apply first and what maven command line should I build with? (ie. will a -Dhadoop.version=2.0.5 give the needed MR based API or will it use YARN?) I figure it's better to ask than aimlessly try all the combinations. :)

Thanks!

On 13-10-07 11:01 AM, Claudio Martella wrote:
try to increase the mappers' heap space through mapred.child.java.opts.


On Mon, Oct 7, 2013 at 1:18 PM, Silvio Di gregorio
<[email protected] <mailto:[email protected]>> wrote:


    I think now my issue is another, thanks to all.




    java.lang.IllegalStateException: run: Caught an unrecoverable exception 
waitFor: ExecutionException occurred while waiting for 
org.apache.giraph.utils.ProgressableUtils$FutureWaitable@73a01e8a
        at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:102)



        ......
    Caused by: java.lang.IllegalStateException: waitFor: ExecutionException 
occurred while waiting for 
org.apache.giraph.utils.ProgressableUtils$FutureWaitable@73a01e8a
        at 
org.apache.giraph.utils.ProgressableUtils.waitFor(ProgressableUtils.java:151)



        ... 7 more
    Caused by: java.util.concurrent.ExecutionException: 
java.lang.OutOfMemoryError:*Java heap space*
        at 
org.apache.giraph.utils.ProgressableUtils.waitFor(ProgressableUtils.java:143)



        ... 13 more



    2013/10/7 Claudio Martella <[email protected]
    <mailto:[email protected]>>

        OK, thanks.

        I really have to push that patch in.


        On Mon, Oct 7, 2013 at 12:17 PM, Silvio Di gregorio
        <[email protected]
        <mailto:[email protected]>> wrote:

            yes i do,
            i have seen this in your post in:
            http://www.mail-archive.com/[email protected]/msg00957.html

            excuse me if  i had checked in the mail-achive first I would
            have avoided the last post.

            Now zk issue are resolved.


            2013/10/7 Claudio Martella <[email protected]
            <mailto:[email protected]>>

                Can you try applying this one first?

                
http://www.mail-archive.com/[email protected]/msg00945/check.diff


                On Mon, Oct 7, 2013 at 8:40 AM, Silvio Di gregorio
                <[email protected]
                <mailto:[email protected]>> wrote:







                    *As i said i have builded*




                    
/giraph-examples-1.0.0-for-hadoop-2.0.0-cdh4.1.2-jar-with-dependencies.jar/













                    *for cdh4, successfully. The job start to monitoring the 
success rate:*













                    /13/10/07 08:28:45 INFO mapred.JobClient:  map 0% reduce 0%/




                    *but then*




                    /Error running child
                    java.lang.IllegalStateException: run: Caught an 
unrecoverable exception java.io.FileNotFoundException: File 
_bsp/_defaultZkManagerDir/job_201309181636_0678/_zkServer does not exist.
                    /













                    /.../







                    /Caused by: java.lang.RuntimeException: 
java.io.FileNotFoundException: File 
_bsp/_defaultZkManagerDir/job_201309181636_0678/_zkServer does not exist./










                    2013/10/5 Silvio Di gregorio
                    <[email protected]
                    <mailto:[email protected]>>

                        I ha ve build w/ hadoop_cdh4.1.2 parameter.
                        Something is changed, monday i report the
                        result. Now the farm is closed.

                        Il giorno 05/ott/2013 14:06, "Claudio Martella"
                        <[email protected]
                        <mailto:[email protected]>> ha scritto:

                            Oh, right, -vof is in trunk. Anyway it looks
                            like you built giraph for the wrong profile.
                            You mentioned you're running on 2.0, but
                            your giraph is built for 0.20.203
                            <tel:0.20.203>. try building with a profile
                            for your hadoop version.


                            On Fri, Oct 4, 2013 at 2:35 PM, Silvio Di
                            gregorio <[email protected]
                            <mailto:[email protected]>> wrote:


                                
org.apache.commons.cli.UnrecognizedOptionException:
                                Unrecognized option: -vof

                                in 1.0 version is
                                  -of,--outputFormat <arg>
                                Vertex output format
                                  -op,--outputPath <arg>
                                Vertex output path



                                2013/10/4 Claudio Martella
                                <[email protected]
                                <mailto:[email protected]>>

                                    did you try the argument (-vof) i
                                    suggested?


                                    On Fri, Oct 4, 2013 at 2:13 PM,
                                    Silvio Di gregorio
                                    <[email protected]
                                    <mailto:[email protected]>> wrote:


                                        i've specified  -of
                                          
org.apache.giraph.io.formats.IdWithValueTextOutputFormat

                                        but the same error was produced

                                        Exception in thread "main"
                                        java.lang.IncompatibleClassChangeError:
                                        Found interface
                                        org.apache.hadoop.mapreduce.JobContext,
                                        but class was expected
                                                 at
                                        
org.apache.giraph.bsp.BspOutputFormat.checkOutputSpecs(BspOutputFormat.java:43)
                                                 at
                                        
org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:984)
                                                 at
                                        
org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:945)
                                                 at
                                        
java.security.AccessController.doPrivileged(Native
                                        Method)
                                                 at
                                        
javax.security.auth.Subject.doAs(Subject.java:396)
                                                 at
                                        
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408)
                                                 at
                                        
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:945)
                                                 at
                                        
org.apache.hadoop.mapreduce.Job.submit(Job.java:566)
                                                 at
                                        
org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:596)
                                                 at
                                        
org.apache.giraph.job.GiraphJob.run(GiraphJob.java:237)
                                                 at
                                        
org.apache.giraph.GiraphRunner.run(GiraphRunner.java:94)
                                                 at
                                        
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
                                                 at
                                        
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
                                                 at
                                        
org.apache.giraph.GiraphRunner.main(GiraphRunner.java:124)
                                                 at
                                        
sun.reflect.NativeMethodAccessorImpl.invoke0(Native
                                        Method)
                                                 at
                                        
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
                                                 at
                                        
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
                                                 at
                                        
java.lang.reflect.Method.invoke(Method.java:597)
                                                 at
                                        
org.apache.hadoop.util.RunJar.main(RunJar.java:208)



                                        2013/10/4 Claudio Martella
                                        <[email protected]
                                        <mailto:[email protected]>>

                                            Hi,

                                            you need to specify the
                                            vertex outputformat class
                                            (-vof option), e.g.
                                            
org.apache.giraph.io.formats.IdWithValueTextOutputFormat.


                                            On Fri, Oct 4, 2013 at 1:06
                                            PM, Silvio Di gregorio
                                            <[email protected]
                                            
<mailto:[email protected]>>
                                            wrote:


                                                Hi,

                                                I hope I have sent to
                                                the right address.

                                                i have a graph (directed
                                                and unweighted) stored
                                                in hdfs like a adjacency
                                                list (140Milions of
                                                edges 6Milions of vertex)

                                                node<tab>neighbors

                                                23   2   1    3    4    3

                                                1 999 99923
                                                <tel:1%20999%2099923>
                                                909 .......

                                                ......

                                                hadoop version Hadoop
                                                2.0.0-cdh4.3.0 - java 1.6


                                                I have executed the
                                                giraph-1.0 connected
                                                components example, in
                                                this fashion

                                                    >  hadoop jar
                                                    
/usr/local/giraph/giraph-examples/target/giraph-examples-1.0.0-for-hadoop-0.20.203.0
                                                    
<tel:0.20.203.0>-jar-with-dependencies.jar
                                                    
org.apache.giraph.GiraphRunner
                                                    
org.apache.giraph.examples.ConnectedComponentsVertex
                                                    -vif
                                                    
org.apache.giraph.io.formats.IntIntNullTextInputFormat
                                                    -vip
                                                    /user/hdfs/lista_adj_txt
                                                    -op connectedgiraph
                                                    --workers 4

                                                and then fail with:

                                                13/10/04 09:28:29 INFO
                                                utils.ConfigurationUtils: No
                                                edge input format
                                                specified. Ensure your
                                                InputFormat does not
                                                require one.

                                                13/10/04 09:28:29 INFO
                                                utils.ConfigurationUtils: No
                                                output format specified.
                                                Ensure your OutputFormat
                                                does not require one.

                                                13/10/04 09:28:30 INFO
                                                job.GiraphJob: run:
                                                Since checkpointing is
                                                disabled (default), do
                                                not allow any task
                                                retries (setting
                                                mapred.map.max.attempts
                                                = 0, old value = 4)

                                                13/10/04 09:28:31 WARN
                                                mapred.JobClient: Use
                                                GenericOptionsParser for
                                                parsing the arguments.
                                                Applications should
                                                implement Tool for the same.

                                                13/10/04 09:28:31 INFO
                                                mapred.JobClient:
                                                Cleaning up the staging
                                                area
                                                
hdfs://srv-bigdata-dev-01.int.sose.it:8020/user/hdfs/.staging/job_201309181636_0535
                                                
<http://srv-bigdata-dev-01.int.sose.it:8020/user/hdfs/.staging/job_201309181636_0535>

                                                Exception in thread
                                                "main"
                                                
java.lang.IncompatibleClassChangeError:
                                                Found interface
                                                
org.apache.hadoop.mapreduce.JobContext,
                                                but class was expected

                                                         at
                                                
org.apache.giraph.bsp.BspOutputFormat.checkOutputSpecs(BspOutputFormat.java:43)

                                                ......

                                                Thanks in advance





                                            --
                                                Claudio Martella
                                            [email protected]
                                            <mailto:[email protected]>






                                    --
                                        Claudio Martella
                                    [email protected]
                                    <mailto:[email protected]>





                            --
                                Claudio Martella
                            [email protected]
                            <mailto:[email protected]>





                --
                    Claudio Martella
                [email protected]
                <mailto:[email protected]>





        --
            Claudio Martella
        [email protected] <mailto:[email protected]>





--
    Claudio Martella
[email protected] <mailto:[email protected]>

--
Matthew Laird
Lead Software Developer, Bioinformatics
Brinkman Laboratory
Simon Fraser University, Burnaby, BC, Canada

Reply via email to