Re: HBase Schema for IPTC News ML G2

2014-03-03 Thread James Taylor
Hi Jigar, Take a look at Apache Phoenix: http://phoenix.incubator.apache.org/ It allows you to use SQL to query over your HBase data and supports composite primary keys, so you could create a schema like this: create table news_message(guid varchar not null, version bigint not null,

Re: org.apache.hadoop.hbase.ipc.SecureRpcEngine class not found in HBase jar

2014-03-04 Thread James Taylor
Let's just target your patch for the Phoenix 4.0 release so we can rely on Maven having what we need. Thanks, James On Tue, Mar 4, 2014 at 11:29 AM, anil gupta anilgupt...@gmail.com wrote: Phoenix refers to maven artifact of HBase. If its not in Maven repo of HBase then either we add the

Re: Filters failing to compare negative numbers (int,float,double or long)

2014-03-19 Thread James Taylor
Another option is to use Apache Phoenix ( http://phoenix.incubator.apache.org/) as it takes care of all these details for you automatically. Cheers, James On Wed, Mar 19, 2014 at 7:49 AM, Ted Yu yuzhih...@gmail.com wrote: In 0.96+, extensible data type API is provided. Please take a look at

Re: how to reverse an integer for rowkey?

2014-03-27 Thread James Taylor
Another option is to use Apache Phoenix and let it do these things for you: CREATE TABLE my_table( intField INTEGER, strField VARCHAR, CONSTRAINT pk PRIMARY KEY (intField DESC, strField)); Thanks, James @JamesPlusPlus http://phoenix.incubator.apache.org/ On Thu, Mar

Re: [VOTE] The 4th HBase 0.98.1 release candidate (RC3) is available for download

2014-04-03 Thread James Taylor
I implore you to stick with releasing RC3. Phoenix 4.0 has no release it can currently run on. Phoenix doesn't use SingleColumnValueFilter, so it seems that HBASE-10850 has no impact wrt Phoenix. Can't we get these additional bugs in 0.98.2 - it's one month away [1]? James [1]

Re: [VOTE] The 4th HBase 0.98.1 release candidate (RC3) is available for download

2014-04-03 Thread James Taylor
am find with giving the next RC a bit shorter voting period. Cheers On Thu, Apr 3, 2014 at 8:57 AM, James Taylor jtay...@salesforce.com wrote: I implore you to stick with releasing RC3. Phoenix 4.0 has no release it can currently run on. Phoenix doesn't use SingleColumnValueFilter, so

Re: [VOTE] The 4th HBase 0.98.1 release candidate (RC3) is available for download

2014-04-03 Thread James Taylor
a definitive statement on if a critical/blocker bug exists for Phoenix or not? If not, we have sufficient votes at this point to carry the RC and can go forward with the release at the end of the vote period. On Apr 3, 2014, at 5:57 PM, James Taylor jtay...@salesforce.com wrote: I implore

[ANNOUNCE] Apache Phoenix releases next major version

2014-04-12 Thread James Taylor
The Apache Phoenix team is pleased to announce the release of its next major version (3.0 and 4.0) from the Apache Incubator. Phoenix is a SQL query engine for Apache HBase, a NoSQL data store. It is accessed as a JDBC driver and enables querying and managing HBase tables using SQL. Major new

Re: How to get specified rows and avoid full table scanning?

2014-04-21 Thread James Taylor
Tao, Just wanted to give you a couple of relevant pointers to Apache Phoenix for your particular problem: - Preventing hotspotting by salting your table: http://phoenix.incubator.apache.org/salted.html - Pig Integration for your map/reduce job:

Re: How to implement sorting in HBase scans for a particular column

2014-04-29 Thread James Taylor
Hi Vikram, I see you sent the Phoenix mailing list back in Dec a question on how to use Phoenix 2.1.2 with Hadoop 2 for HBase 0.94. Looks like you were having trouble building Phoenix with the hadoop2 profile. In our 3.0/4.0 we bundle the phoenix jars pre-built with both hadoop1 and hadoop2, so

Re: Questions on FuzzyRowFilter

2014-05-16 Thread James Taylor
Hi Mike, I agree with you - the way you've outlined is exactly the way Phoenix has implemented it. It's a bit of a problem with terminology, though. We call it salting: http://phoenix.incubator.apache.org/salted.html. We hash the key, mod the hash with the SALT_BUCKET value you provide, and

Re: Prefix salting pattern

2014-05-17 Thread James Taylor
No, there's nothing wrong with your thinking. That's exactly what Phoenix does - use the modulo of the hash of the key. It's important that you can calculate the prefix byte so that you can still do fast point lookups. Using a modulo that's bigger than the number of region servers can make sense

Re: Questions on FuzzyRowFilter

2014-05-18 Thread James Taylor
in the first place and just store the index… ;-) (Yes, I thought about this too.) -Mike On May 16, 2014, at 7:50 PM, James Taylor jtay...@salesforce.com wrote: Hi Mike, I agree with you - the way you've outlined is exactly the way Phoenix has implemented it. It's a bit of a problem

Re: Questions on FuzzyRowFilter

2014-05-18 Thread James Taylor
when you say salt. On May 18, 2014, at 7:16 PM, James Taylor jtay...@salesforce.com wrote: @Mike, The biggest problem is you're not listening. Please actually read my response (and you'll understand the what we're calling salting is not a random seed). Phoenix already has secondary

Re: Questions on FuzzyRowFilter

2014-05-18 Thread James Taylor
://phoenix.incubator.apache.org/secondary_indexing.html Thanks, James On Sun, May 18, 2014 at 1:56 PM, James Taylor jtay...@salesforce.comwrote: The top two hits when you Google for HBase salt are - Sematext blog describing salting as I described it in my email - Phoenix blog again describing salting

Re: Prefix salting pattern

2014-05-18 Thread James Taylor
@Software Dev - might be feasible to implement a Thrift client that speaks Phoenix JDBC. I believe this is similar to what Hive has done. Thanks, James On Sun, May 18, 2014 at 1:19 PM, Mike Axiak m...@axiak.net wrote: In our measurements, scanning is improved by performing against n range

Re: hbase key design to efficient query on base of 2 or more column

2014-05-19 Thread James Taylor
If you use Phoenix, queries would leverage our Skip Scan: http://phoenix-hbase.blogspot.com/2013/05/demystifying-skip-scan-in-phoenix.html Assuming a row key made up of a low cardinality first value (like a byte representing an enum), followed by a high cardinality second value (like a date/time

[ANNOUNCE] Apache Phoenix has graduated as a top level project

2014-05-22 Thread James Taylor
I'm pleased to announce that Apache Phoenix has graduated from the incubator to become a top level project. Thanks so much for all your help and support - we couldn't have done it without the fantastic HBase community! We're looking forward to continued collaboration. Regards, The Apache Phoenix

Re: Copy some records from Huge hbase table to another table

2014-05-23 Thread James Taylor
Hi Riyaz, You can do this with a single SQL command using Apache Phoenix, a SQL engine on top of HBase, and you'll get better performance than if you hand coded it using the HBase client APIs. Depending on your current schema, you may be able to run this command with no change to your data. Let's

Re: Region not assigned

2014-08-14 Thread James Taylor
On the first connection to the cluster when you've installed Phoenix 2.2.3 and were previously using Phoenix 2.2.2, Phoenix will upgrade your Phoenix tables to use the new coprocessor names (org.apache.phoenix.*) instead of the old coprocessor names (com.salesforce.phoenix.*). Thanks, James On

[ANNOUNCE] Apache Phoenix 3.1 and 4.1 released

2014-09-01 Thread James Taylor
Hello everyone, On behalf of the Apache Phoenix [1] project, a SQL database on top of HBase, I'm pleased to announce the immediate availability of our 3.1 and 4.1 releases [2]. These include many bug fixes along with support for nested/derived tables, tracing, and local indexing. For details of

Re: Connecting Hbase to Elasticsearch with Phoenix

2014-09-10 Thread James Taylor
+1. Thanks, Alex. I added a blog pointing folks there as well: https://blogs.apache.org/phoenix/entry/connecting_hbase_to_elasticsearch_through On Wed, Sep 10, 2014 at 2:12 PM, Andrew Purtell apurt...@apache.org wrote: Thanks for writing in with this pointer Alex! On Wed, Sep 10, 2014 at 11:11

[ANNOUNCE] Apache Phoenix 4.2.2 and 3.2.2 released

2014-12-10 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.2.2/3.2.2 release. For details of the release, see our release announcement[1]. The Apache Phoenix team [1] https://blogs.apache.org/phoenix/entry/announcing_phoenix_4_2_2

[ANNOUNCE] Apache Phoenix meetup in SF on Tue, Feb 24th

2015-01-22 Thread James Taylor
I'm excited to announce the first ever Apache Phoenix meetup, hosted by salesforce.com in San Francisco on Tuesday, February 24th @ 6pm. More details here: http://www.meetup.com/San-Francisco-Apache-Phoenix-Meetup/events/220009583/ Please ping me if you're interested in presenting your companies

[ANNOUNCE] Apache Phoenix 4.3 released

2015-02-25 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.3 release. Highlights include: - functional indexes [1] - map-reduce over Phoenix tables [2] - cross join support [3] - query hint to force index usage [4] - set HBase properties through ALTER TABLE - ISO-8601 date

[ANNOUNCE] Apache Phoenix 4.5 released

2015-08-05 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.5 release with support for HBase 0.98/1.0/1.1. Together with the 4.4 release, highlights include: Spark Integration (4.4) [1] User Defined Functions (4.4) [2] Query Server with thin driver (4.4) [3] Pherf tool for

[ANNOUNCE] PhoenixCon 2016 on Wed, May 25th 9am-1pm

2016-04-26 Thread James Taylor
We invite you to attend the inaugural PhoenixCon on Wed, May 25th 9am-1pm (the day after HBaseCon) hosted by Salesforce.com in San Francisco. There will be two tracks: one for use cases and one for internals. Drop me a note if you're interested in giving a talk. To RSVP and for more details, see

Re: [ANNOUNCE] PhoenixCon 2016 on Wed, May 25th 9am-1pm

2016-04-27 Thread James Taylor
ement for final approval. I am assuming > there is still a slot for my talk in use case srction. I should go ahead > with my approval process. Correct? > > Thanks, > Anil Gupta > Sent from my iPhone > > > On Apr 26, 2016, at 5:56 PM, James Taylor <jamestay...@apache.org &

[ANNOUNCE] PhoenixCon the day after HBaseCon

2016-05-19 Thread James Taylor
The inaugural PhoenixCon will take place 9am-1pm on Wed, May 25th (at Salesforce @ 1 Market St, SF), the day after HBaseCon. We'll have two tracks: one for Apache Phoenix use cases and one for Apache Phoenix internals. To RSVP and for more details see here[1]. We hope you can make it! James

[ANNOUNCE] Apache Phoenix 4.9 released

2016-12-01 Thread James Taylor
Apache Phoenix enables OLTP and operational analytics for Apache Hadoop through SQL support using Apache HBase as its backing store and providing integration with other projects in the ecosystem such as Apache Spark, Apache Hive, Apache Pig, Apache Flume, and Apache MapReduce. We're pleased to

[ANNOUNCE] Apache Phoenix 4.10 released

2017-03-23 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.10.0 release. Apache Phoenix enables SQL-based OLTP and operational analytics for Hadoop using Apache HBase as its backing store and providing integration with other projects in the ecosystem such as Spark, Hive,

[ANNOUNCE] PhoenixCon 2017 is a go!

2017-03-15 Thread James Taylor
I'm excited to announce that the 2nd Annual Apache Phoenix conference, PhoenixCon 2017 will take place the day after HBaseCon in San Francisco on Tuesday, June 13th from 10:30am-6pm. For more details, including to RSVP and submit a talk proposal, click here:

[ANNOUNCE] Apache Phoenix 4.11 released

2017-07-07 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.11.0 release. Apache Phoenix enables SQL-based OLTP and operational analytics for Apache Hadoop using Apache HBase as its backing store and providing integration with other projects in the Apache ecosystem such as

[ANNOUNCE] Apache Phoenix 4.12 released

2017-10-11 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.12.0 release [1]. Apache Phoenix enables SQL-based OLTP and operational analytics for Apache Hadoop using Apache HBase as its backing store and providing integration with other projects in the Apache ecosystem such

[ANNOUNCE] Apache Phoenix 4.13 released

2017-11-11 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.13.0 release. Apache Phoenix enables SQL-based OLTP and operational analytics for Apache Hadoop using Apache HBase as its backing store and providing integration with other projects in the Apache ecosystem such as

Re: [ANNOUNCE] Apache Phoenix 4.13 released

2017-11-19 Thread James Taylor
/70cffa798d5f21ef87b02e07aeca8c7982b0b30251411b7be17fadf9@%3Cdev.phoenix.apache.org%3E On Sun, Nov 19, 2017 at 12:23 PM, Kumar Palaniappan < kpalaniap...@marinsoftware.com> wrote: > Are there any plans to release Phoenix 4.13 compatible with HBase 1.2? > > On Sat, Nov 11, 2017 at 5:57 PM, James T

[ANNOUNCE] Apache Phoenix 4.14 released

2018-06-11 Thread James Taylor
The Apache Phoenix team is pleased to announce the immediate availability of the 4.14.0 release. Apache Phoenix enables SQL-based OLTP and operational analytics for Apache Hadoop using Apache HBase as its backing store and providing integration with other projects in the Apache ecosystem such as

Re: [ANNOUNCE] Apache Phoenix 4.13.2 for CDH 5.11.2 released

2018-01-20 Thread James Taylor
On Sat, Jan 20, 2018 at 12:29 PM Pedro Boado wrote: > The Apache Phoenix team is pleased to announce the immediate availability > of the 4.13.2 release for CDH 5.11.2. Apache Phoenix enables SQL-based OLTP > and operational analytics for Apache Hadoop using Apache HBase as

<    1   2