Re: Delta Index

2014-12-17 Thread Dominik Wagenknecht
Hej,

I may be completely wrong, but this is IMHO not a good approach to sync
data. A real solution for an Elasticsearch River would be a hook to the
HBase Cluster Replication [1] and not by polling the dataset with select
*... However that would be a lot more work, but would be a real push.

*Maybe* the river @ github could be made smarter once native HBase
Timestamps are made in Phoenix somehow; as each cell in HBase is fully
timestamped (and Phoenix doesn't mess with it, so it's always the last
UPSERT) this could be an approach. But in contrast to Cluster Replication
this is still polling and will be slow on big datasets.

[1] http://hbase.apache.org/book.html#cluster_replication

Regards


On Thu, Dec 18, 2014 at 6:51 AM, Subacini B subac...@gmail.com wrote:

 Hi,

 http://lessc0de.github.io/connecting_hbase_to_elasticsearch.html

 *curl -XPUT 'localhost:9200/_river/phoenix_jdbc_river/_meta' -d '{ type
 : jdbc, jdbc : { url : jdbc:phoenix:localhost, user : ,
 password : , sql : select * from test.orders } }'*

 Followed the steps and i am able to successfully index the data to
 Elasticsearch from Hbase.But if i do updates to  records , the changes did
 not reflect  in ElasticSearch. Is it possible to automatically sync
 updated/changed data?

 Thanks



Re: Phoenix Input Format for Map Reduce

2014-11-14 Thread Dominik Wagenknecht
Yes Thanks! Up until now we just used the standard Input formats and used the 
type converters inside Phoenix core :-)

Sent from mobile device

 Am 14.11.2014 um 09:13 schrieb Gagan Agrawal agrawalga...@gmail.com:
 
 Thanks Ravi for providing required details.
 
 Gagan
 
 On Fri, Nov 14, 2014 at 1:24 PM, Ravi Kiran maghamraviki...@gmail.com 
 wrote:
 Hi Gagan,
 
The support for MR is a WIP. We are working on it and the first draft of 
 the code is at https://github.com/apache/phoenix/pull/23 . A preview of the 
 API is available at https://gist.github.com/mravi/501fa00b942764eb0dca.
I have also created a new JIRA ticket 
 https://issues.apache.org/jira/browse/PHOENIX-1454 for this.
 
Regarding using the PhoenixInputFormat of the phoenix-pig module, you 
 will need to manually set few configuration parameters like the table name, 
 columns etc. Apparently, the integration of MR with the PhoenixInputFormat 
 of pig-module isn't a smooth one without lot of customizations. 
 
 Regards
 Ravi
 
 On Thu, Nov 13, 2014 at 11:18 PM, Gagan Agrawal agrawalga...@gmail.com 
 wrote:
 
 Hi,
 I was looking for Phoenix Input Format to read from phoenix powered hbase 
 in my Map Reduce jobs. Came across following project which provides pig 
 implementation of Phonenix Input Format.
 
 https://github.com/apache/phoenix/tree/master/phoenix-pig/src/main/java/org/apache/phoenix/pig/hadoop
 
 I wanted to understand if there is anything specific to pig in this? Or can 
 we use it in Map Reduce also? Will I need to do any customization to get it 
 work with MR? Is this stable and has been used in production? Can I use it 
 with latest phoenix version or only specific version is supported.
 
 Thanks and Regards,
 Gagan Agrawal