RE: HBase data model question

Hegner, Travis Mon, 24 Aug 2009 13:41:56 -0700

If you need to access the data from either perspective, then you'll probably 
have to create two separate tables, one with the indexing as described before, 
and one that looks like <word><doc id><pos>, so that you could scan per word. 
This would have to be handled through your application as well.


There are some indexing tools available to automate some of this for Hbase, but 
I'm not very versed in how to use them. I believe one is call "IndexTable".

Maybe someone with more experience there could jump in and offer a possible 
solution?

Travis Hegner
http://www.travishegner.com/

-----Original Message-----
From: llpind [mailto:[email protected]]
Sent: Monday, August 24, 2009 4:30 PM
To: [email protected]
Subject: RE: HBase data model question


Thanks, I think thats a good starting point.  Along the lines i was thinking,
but I couldn't figure out how to get all for a given lemma (not by doc id,
WP).  Looking at scanners again to see if can pull that off.
--
View this message in context: 
http://www.nabble.com/HBase-data-model-question-tp25120285p25123069.html
Sent from the HBase User mailing list archive at Nabble.com.


The information contained in this communication is confidential and is intended 
only for the use of the named recipient.  Unauthorized use, disclosure, or 
copying is strictly prohibited and may be unlawful.  If you have received this 
communication in error, you should know that you are bound to confidentiality, 
and should please immediately notify the sender or our IT Department at  
866.459.4599.

RE: HBase data model question

Reply via email to