Hi,

I am mostly interested in fetcher job. In this job I see this code

StorageUtils.initMapperJob(currentJob, fields, IntWritable.class,  
FetchEntry.class, FetcherMapper.class, FetchEntryPartitioner.class, false);

In storage utils this function has

DataStore<String, WebPage> store = createWebStore(job.getConfiguration(), 
String.class, WebPage.class);
    if (store==null) throw new RuntimeException("Could not create datastore");
 Query<String, WebPage> query = store.newQuery();
 query.setFields(toStringArray(fields));
 GoraMapper.initMapperJob(job, query, store, outKeyClass, outValueClass, 
mapperClass, partitionerClass, reuseObjects);

 
I followed all these functions but did not find actual code that sends query to 
hbase table.
I believe it is somewhere in gora-hbase.

Thanks.
Alex.


 

 

-----Original Message-----
From: Renato Marroquín Mog
 rovejo <[email protected]>
To: Gora Dev <[email protected]>
Sent: Tue, Feb 26, 2013 8:01 pm
Subject: Re: gora-hbase query


Hi Alex,

The Gora-HBase module is only in charge of querying and persisting
data from any where, not only Nutch. That being said, you want the
part where Nutch populates a map used in different Nutch jobs? Which
jobs are you talking about? Generator? Fetcher? You can probably get
some more lights over in NutchLand.
I am happy to go over the code with you anyways, just please be a
little bit more specific.


Renato M.

2013/2/26  <[email protected]>:
>
> Hello,
>
> Can someone point me the code in gora-hbase that queries hbase and populates 
nutch map key values for varies nutch jobs?
> I plan to use SingleColumnValueFilter to see if it selects only subset of 
records.
>
> Thanks.
> Alex.
>
>

 

Reply via email to