[jira] [Updated] (CARBONDATA-3234) Unable to read data from carbondata table stored in S3 with Presto running on EMR

charles horrell (JIRA) Mon, 07 Jan 2019 07:20:40 -0800


     [ 
https://issues.apache.org/jira/browse/CARBONDATA-3234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


charles horrell updated CARBONDATA-3234:
----------------------------------------
    Description: 
We are unable to use presto to query a carbondata table stored in S3.
{code:java}
presto:default> select count(*) from test_table;
Query 20190107_135333_00026_8r2c8 failed: tried to access method 
org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V
 from class org.apache.hadoop.fs.s3a.S3AInstrumentation

 

presto:default> select * from test_table;
Query 20190107_135610_00028_8r2c8 failed: tried to access method 
org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V
 from class org.apache.hadoop.fs.s3a.S3AInstrumentation
{code}
The catalog appears to have been picked up okay as show tables works as 
expected as does describing the table it is just when actually trying to access 
the data that we see the error.

We configured presto as per the examples here: 

[http://carbondata.apache.org/quick-start-guide.html]

Querying from Spark works okay however it is vital for our use case that presto 
also works and with S3.

Amazon EMR version 5.19
 Spark 2.3.2
 Hadoop 2.8.5
 Presto 0.212

  was:
We are unable to use presto to query a carbondata table stored in S3.
{code:java}
presto:default> select count(*) from test_table;
Query 20190107_135333_00026_8r2c8 failed: tried to access method 
org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V
 from class org.apache.hadoop.fs.s3a.S3AInstrumentation

 

presto:default> select * test_table;
Query 20190107_135610_00028_8r2c8 failed: tried to access method 
org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V
 from class org.apache.hadoop.fs.s3a.S3AInstrumentation
{code}
The catalog appears to have been picked up okay as show tables works as 
expected as does describing the table it is just when actually trying to access 
the data that we see the error.

We configured presto as per the examples here: 

[http://carbondata.apache.org/quick-start-guide.html]

Querying from Spark works okay however it is vital for our use case that presto 
also works and with S3.

Amazon EMR version 5.19
 Spark 2.3.2
 Hadoop 2.8.5
 Presto 0.212


> Unable to read data from carbondata table stored in S3 with Presto running on 
> EMR
> ---------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-3234
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3234
>             Project: CarbonData
>          Issue Type: Bug
>          Components: presto-integration
>         Environment: Amazon EMR 5.19
>            Reporter: charles horrell
>            Priority: Major
>
> We are unable to use presto to query a carbondata table stored in S3.
> {code:java}
> presto:default> select count(*) from test_table;
> Query 20190107_135333_00026_8r2c8 failed: tried to access method 
> org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V
>  from class org.apache.hadoop.fs.s3a.S3AInstrumentation
>  
> presto:default> select * from test_table;
> Query 20190107_135610_00028_8r2c8 failed: tried to access method 
> org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V
>  from class org.apache.hadoop.fs.s3a.S3AInstrumentation
> {code}
> The catalog appears to have been picked up okay as show tables works as 
> expected as does describing the table it is just when actually trying to 
> access the data that we see the error.
> We configured presto as per the examples here: 
> [http://carbondata.apache.org/quick-start-guide.html]
> Querying from Spark works okay however it is vital for our use case that 
> presto also works and with S3.
> Amazon EMR version 5.19
>  Spark 2.3.2
>  Hadoop 2.8.5
>  Presto 0.212



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Updated] (CARBONDATA-3234) Unable to read data from carbondata table stored in S3 with Presto running on EMR

Reply via email to