I think most hadoop related projects do prefer configuration, so I
think it's a good idea that Gora does this as well.
Pig prefers configuration as well, but I think studying on how to make
Gora more compatible with other projects should be discussed on
separate threads. Just my 2 cents (:

Renato M.

2012/5/2 Lewis John Mcgibbney <[email protected]>:
> Hello Everyone,
>
> Just to add to this thread, it became plain obvious to me that
> improvements/simplifications could be made here when I ran into
> problems when trying to address the issue with gora-cassandra tests.
> HBase store seems to use Configuration and hbase-site.xml, Cassandra
> store needs to use properties and cassandra.yaml, Sql store needs to
> use properties etc.
>
> I see Ferdy's suggestion to deprecate the properties object moving
> forward, instead making the Gora API prefer Configuration. Although my
> knowledge regarding Pig etc is limited at this stage, I can only
> assume that this also ties in with our future vision to improve the
> analysis aspect of the Gora API.
>
> Thanks
>
> On Tue, May 1, 2012 at 12:16 PM, Ferdy Galema <[email protected]> 
> wrote:
>> Hi,
>>
>> While Lewis and I were discussing over NUTCH-1205, we identified
>> the Properties object as the major source of trouble/confusion when
>> configuring datastores. First and foremost, it's makes no sense that we
>> have 2 ways to configure a store, namely via Configuration and Properties.
>> Besides this there seems to be some trouble with the serialization and
>> initialization of the stores. (Sometimes runtime Properties settings are
>> not correctly used). We have a few ways to solve this problem:
>>
>> -Stop supporting the adding of dynamic properties (runtime settings) and
>> only support the static gora.properties file. People wanting to use runtime
>> props shall use Configuration somehow.
>> -Completely remove the Properties object from Gora altogether. Migrate
>> existing properties to Configuration.
>> -Trying to make both Properties and Configuration work (that's what the
>> current direction seems to be). Difficult it seems.
>> -Something else?
>>
>> I think the second option is best. The advantage of Configuration is that
>> it inheritently works with mapreduce because it is automatically
>> (de)serialized and available in the mappers and reducers. What do you think?
>>
>> Ferdy.
>
>
>
> --
> Lewis

Reply via email to