Hello Luke, Jamie, others. Sorry I missed this discussion; here's (as best I can) answers to a few of the previous questions about the contributed indexing:
0. Work has been slow on hbase-trx (which includes both transaction and indexing contrib) due to recent changes here. I'm doing my best to dive in/catch up. Definitely fork it, download it, file issues, and the usual. Best way to ping us directly is via github or [email protected] - I do try and monitor the hbase lists, but (obviously) don't catch everything. 1. The current code at hbase-trx relies on an in-progress patch to this issue: https://issues.apache.org/jira/browse/HBASE-2641 Which factors log splitting out of HLog, and adds instance methods (which we need to override for the transactional log). This is only a requirement for the transactional aspect and indexing should work using calls to the existing static methods in trunk, but I have not tested this approach. If this would be of immediate use, I could probably push a branch to github in pretty short order; ping me directly if so. (I'd be happy to take a patch, too) 2. Controlling keys: you should be able to implement IndexKeyGenerator for a significant degree of control over, er, key generation. The default implementation (SimpleIndexKeyGenerator) functions as Samuru describes here: http://search-hadoop.com/m/U1H6O1M6dpr/Secondary+indexes+in+0.89 best regards, Matthew Smillie Troove Technologies, Canada.
