[ https://issues.apache.org/jira/browse/CARBONDATA-3234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
charles horrell updated CARBONDATA-3234: ---------------------------------------- Description: We are unable to use presto to query a carbondata table stored in S3. {code:java} presto:default> select count(*) from test_table; Query 20190107_135333_00026_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation presto:default> select * from test_table; Query 20190107_135610_00028_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation {code} The catalog appears to have been picked up okay as show tables works as expected as does describing the table it is just when actually trying to access the data that we see the error. We configured presto as per the examples here: [http://carbondata.apache.org/quick-start-guide.html] Querying from Spark works okay however it is vital for our use case that presto also works and with S3. Amazon EMR version 5.19 Spark 2.3.2 Hadoop 2.8.5 Presto 0.212 was: We are unable to use presto to query a carbondata table stored in S3. {code:java} presto:default> select count(*) from test_table; Query 20190107_135333_00026_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation presto:default> select * test_table; Query 20190107_135610_00028_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation {code} The catalog appears to have been picked up okay as show tables works as expected as does describing the table it is just when actually trying to access the data that we see the error. We configured presto as per the examples here: [http://carbondata.apache.org/quick-start-guide.html] Querying from Spark works okay however it is vital for our use case that presto also works and with S3. Amazon EMR version 5.19 Spark 2.3.2 Hadoop 2.8.5 Presto 0.212 > Unable to read data from carbondata table stored in S3 with Presto running on > EMR > --------------------------------------------------------------------------------- > > Key: CARBONDATA-3234 > URL: https://issues.apache.org/jira/browse/CARBONDATA-3234 > Project: CarbonData > Issue Type: Bug > Components: presto-integration > Environment: Amazon EMR 5.19 > Reporter: charles horrell > Priority: Major > > We are unable to use presto to query a carbondata table stored in S3. > {code:java} > presto:default> select count(*) from test_table; > Query 20190107_135333_00026_8r2c8 failed: tried to access method > org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V > from class org.apache.hadoop.fs.s3a.S3AInstrumentation > > presto:default> select * from test_table; > Query 20190107_135610_00028_8r2c8 failed: tried to access method > org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V > from class org.apache.hadoop.fs.s3a.S3AInstrumentation > {code} > The catalog appears to have been picked up okay as show tables works as > expected as does describing the table it is just when actually trying to > access the data that we see the error. > We configured presto as per the examples here: > [http://carbondata.apache.org/quick-start-guide.html] > Querying from Spark works okay however it is vital for our use case that > presto also works and with S3. > Amazon EMR version 5.19 > Spark 2.3.2 > Hadoop 2.8.5 > Presto 0.212 -- This message was sent by Atlassian JIRA (v7.6.3#76005)