[ 
https://issues.apache.org/jira/browse/PHOENIX-877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13990955#comment-13990955
 ] 

Sean Huo commented on PHOENIX-877:
----------------------------------

I still see the exception when i run a distinct count of a large size. I am 
using phoenix-3.0.0 release build.
java.lang.RuntimeException: native snappy library not available
        at 
org.apache.hadoop.io.compress.SnappyCodec.getDecompressorType(SnappyCodec.java:189)
        at 
org.apache.hadoop.io.compress.CodecPool.getDecompressor(CodecPool.java:125)
        at 
org.apache.hadoop.hbase.io.hfile.Compression$Algorithm.getDecompressor(Compression.java:331)
        at 
org.apache.phoenix.expression.aggregator.DistinctValueWithCountClientAggregator.aggregate(DistinctValueWithCountClientAggregator.java:66)
        at 
org.apache.phoenix.expression.aggregator.ClientAggregators.aggregate(ClientAggregators.java:63)
        at 
org.apache.phoenix.iterate.GroupedAggregatingResultIterator.next(GroupedAggregatingResultIterator.java:75)
        at 
org.apache.phoenix.iterate.UngroupedAggregatingResultIterator.next(UngroupedAggregatingResultIterator.java:39)
        at 
org.apache.phoenix.jdbc.PhoenixResultSet.next(PhoenixResultSet.java:732)

Why is it so hard to write a unit test case for this?

To simulate a large query size, all we need to do is to set 
QueryServices.DISTINCT_VALUE_COMPRESS_THRESHOLD_ATTRIB to a small value,
the default value for the property is 1024 * 1024 ( 1MB).


> Snappy native library is not available
> --------------------------------------
>
>                 Key: PHOENIX-877
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-877
>             Project: Phoenix
>          Issue Type: Bug
>    Affects Versions: 3.0.0, 4.0.0
>            Reporter: alex kamil
>            Assignee: Mujtaba Chohan
>            Priority: Minor
>
> still getting this error with the most recent phoenix v3.0  (i think it has 
> been fixed in 2.2.3)
> "Snappy native library is not available" when running SELECT DISTINCT on 
> large table (>300k rows) in sqlline, on linux 64bit (intel) 
> in order to fix had to add to incubator-phoenix/bin/sqlline.py:
>     ' -Djava.library.path= /var/lib/hadoop/lib/native/Linux-amd64-64'+\
> snappy binaries were installed 
> sudo yum install snappy snappy-devel
> ln -sf /usr/lib64/libsnappy.so /var/lib/hadoop/lib/native/Linux-amd64-64/.
> ln -sf /usr/lib64/libsnappy.so /var/lib/hbase/lib/native/Linux-amd64-64/.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to