Re: Nutch 2.0 updatedb and gora query

2013-01-31 Thread kiran chitturi
Hi Lewis, I am using gora 0.2.1 and hbase 0.90.5. I started from scratch and did a step by step crawling (inject, generate, fetch, parse, dbUpdate). I am starting from a single seed. The first four phases went well so far and metadata, outlinks, fetch, parse fields are extracted and saved in

Re: Nutch 2.0 updatedb and gora query

2013-01-30 Thread kiran chitturi
Link to the reference ( http://lucene.472066.n3.nabble.com/Inlinks-not-being-saved-in-the-database-td4037067.html) and jira (https://issues.apache.org/jira/browse/NUTCH-1524) On Wed, Jan 30, 2013 at 12:25 PM, kiran chitturi chitturikira...@gmail.comwrote: Hi, I have posted a similar issue in

Re: Nutch 2.0 updatedb and gora query

2013-01-30 Thread alxsss
I see that inlinks are saved as ol in hbase. Alex. -Original Message- From: kiran chitturi chitturikira...@gmail.com To: user user@nutch.apache.org Sent: Wed, Jan 30, 2013 9:31 am Subject: Re: Nutch 2.0 updatedb and gora query Link to the reference ( http://lucene.472066.n3

Re: Nutch 2.0 updatedb and gora query

2013-01-30 Thread Lewis John Mcgibbney
To: user user@nutch.apache.org Sent: Wed, Jan 30, 2013 9:31 am Subject: Re: Nutch 2.0 updatedb and gora query Link to the reference ( http://lucene.472066.n3.nabble.com/Inlinks-not-being-saved-in-the-database-td4037067.html ) and jira (https://issues.apache.org/jira/browse/NUTCH-1524

Re: Nutch 2.0 updatedb and gora query

2013-01-30 Thread kiran chitturi
, Kiran. On Wed, Jan 30, 2013 at 1:43 PM, alx...@aim.com wrote: I see that inlinks are saved as ol in hbase. Alex. -Original Message- From: kiran chitturi chitturikira...@gmail.com To: user user@nutch.apache.org Sent: Wed, Jan 30, 2013 9:31 am Subject: Re: Nutch 2.0 updatedb

Re: Nutch 2.0 updatedb and gora query

2013-01-30 Thread alxsss
Subject: Re: Nutch 2.0 updatedb and gora query I have checked the database after the dbupdate job is ran and i could see only markers, signature and fetch fields. The initial seed which was crawled and parsed, has only outlinks. I notice one of the outlink is actually the inlink. Aren't inlinks

Re: Nutch 2.0 updatedb and gora query

2013-01-30 Thread Lewis John Mcgibbney
Hi Kiran, On Wed, Jan 30, 2013 at 11:10 AM, kiran chitturi chitturikira...@gmail.comwrote: I have checked the database after the dbupdate job is ran and i could see only markers, signature and fetch fields. Which Gora artifacts are you using? We've recently fixed a bug in gora-cassandra [0]