On Wed, Feb 22, 2012 at 8:40 AM, Jacques <[email protected]> wrote:
> We have a crawl table and here are a couple quick thoughts:
>
> - I'd suggest that you use reverse url as your primary key.  Specifically,
> reversed host name but normal path and query string.

Maybe this utility in hbase helps do what Jacque suggests above?
http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/util/Keying.html#createKey(java.lang.String)

St.Ack

Reply via email to