Hi All,
I am using HBase for storing the data, which is around 100 GBs in an HBase
Table, which I am trying to query.
Also, there are around 100 Regions, for the table, each region having 1 GB
of Data. The region, I am trying to query is around 850 mb. But when I do a
scan on the part of the key using the Prefix Filter, it is taking more than
10 mins. Please find below the query result.
Could anybody help me answering the question, why is the scan taking so much
time????
======================================================
hbase(main):005:0> scan 'store',
{FILTER=>org.apache.hadoop.hbase.filter.PrefixFilter.new(org.apache.hadoop.hbase.util.Bytes.toBytes('U_mykey|AABrCwADAAABP/IQXolH'))}
ROW COLUMN+CELL
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/usr/lib/hbase/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/usr/lib/hadoop/lib/slf4j-log4j12-1.4.3.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
U_mykey|AABrCwADAAABP/IQXolH.aliases
column=datafile:value, timestamp=1394275248787, value=[PHONE|991xxxxxx9]
U_mykey|AABrCwADAAABP/IQXolH.birthdate
column=datafile:value, timestamp=1393418222358, value=null
U_mykey|AABrCwADAAABP/IQXolH.created
column=datafile:value, timestamp=1392098938946, value=1392098938955
U_mykey|AABrCwADAAABP/IQXolH.email
column=datafile:value, timestamp=1393418222292, value=null
U_mykey|AABrCwADAAABP/IQXolH.fname
column=datafile:value, timestamp=1393418222303, value=null
U_mykey|AABrCwADAAABP/IQXolH.friends
column=datafile:value, timestamp=1374156544788, value=[]
U_mykey|AABrCwADAAABP/IQXolH.gender
column=datafile:value, timestamp=1393418221863, value=null
U_mykey|AABrCwADAAABP/IQXolH.image
column=datafile:value, timestamp=1393418222313, value=null
U_mykey|AABrCwADAAABP/IQXolH.lname
column=datafile:value, timestamp=1393418221873, value=null
U_mykey|AABrCwADAAABP/IQXolH.mname
column=datafile:value, timestamp=1393418222344, value=null
U_mykey|AABrCwADAAABP/IQXolH.nick
column=datafile:value, timestamp=1393418221795, value=null
U_mykey|AABrCwADAAABP/IQXolH.nid
column=datafile:value, timestamp=1393418221809,
value=mykey|AABrCwADAAABP/IQXolH
U_mykey|AABrCwADAAABP/IQXolH.phone
column=datafile:value, timestamp=1384419197956, value=991xxxxxx9
U_mykey|AABrCwADAAABP/IQXolH.phone2
column=datafile:value, timestamp=1393510696484, value=953xxxxxx5
U_mykey|AABrCwADAAABP/IQXolH.token
column=datafile:value, timestamp=1392135639003, value=[]
15 row(s) in 880.4260 seconds
======================================================
Regards,
Parkirat
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/HBase-Rowkey-Scan-Taking-more-than-10-minutes-tp4056816.html
Sent from the HBase User mailing list archive at Nabble.com.