anyone explain to me what exactly is needed
>>> to support a new data type in SparkSQL's Parquet storage engine?
>>>
>>> Thanks.
>>>
>>> Alex
>>>
>>> On Mon, Dec 29, 2014 at 10:20 PM, Wang, Daoyuan
>>> wrote:
>>>
>>
29, 2014 at 10:20 PM, Wang, Daoyuan
>> wrote:
>>
>>> By adding a flag in SQLContext, I have modified #3822 to include
>>> nanoseconds now. Since passing too many flags is ugly, now I need the whole
>>> SQLContext, so that we can put more flags there.
>&g
nanoseconds now. Since passing too many flags is ugly, now I need the whole
>> SQLContext, so that we can put more flags there.
>>
>>
>>
>> Thanks,
>>
>> Daoyuan
>>
>>
>>
>> *From:* Michael Armbrust [mailto:mich...@databricks.com]
>> *Se
2014 10:43 AM
> *To:* Alessandro Baretta
> *Cc:* Wang, Daoyuan; dev@spark.apache.org
> *Subject:* Re: Unsupported Catalyst types in Parquet
>
>
>
> Yeah, I saw those. The problem is that #3822 truncates timestamps that
> include nanoseconds.
>
>
>
> On Mon, Dec 29, 2014 a
10:43 AM
To: Alessandro Baretta
Cc: Wang, Daoyuan; dev@spark.apache.org
Subject: Re: Unsupported Catalyst types in Parquet
Yeah, I saw those. The problem is that #3822 truncates timestamps that include
nanoseconds.
On Mon, Dec 29, 2014 at 5:14 PM, Alessandro Baretta
mailto:alexbare...@gmail.com
Yeah, I saw those. The problem is that #3822 truncates timestamps that
include nanoseconds.
On Mon, Dec 29, 2014 at 5:14 PM, Alessandro Baretta
wrote:
> Michael,
>
> Actually, Adrian Wang already created pull requests for these issues.
>
> https://github.com/apache/spark/pull/3820
> https://git
Michael,
Actually, Adrian Wang already created pull requests for these issues.
https://github.com/apache/spark/pull/3820
https://github.com/apache/spark/pull/3822
What do you think?
Alex
On Mon, Dec 29, 2014 at 3:07 PM, Michael Armbrust
wrote:
> I'd love to get both of these in. There is so
I'd love to get both of these in. There is some trickiness that I talk
about on the JIRA for timestamps since the SQL timestamp class can support
nano seconds and I don't think parquet has a type for this. Other systems
(impala) seem to use INT96. It would be great to maybe ask on the parquet
ma
Daoyuan,
Thanks for creating the jiras. I need these features by... last week, so
I'd be happy to take care of this myself, if only you or someone more
experienced than me in the SparkSQL codebase could provide some guidance.
Alex
On Dec 29, 2014 12:06 AM, "Wang, Daoyuan" wrote:
> Hi Alex,
>
>
Hi Alex,
I'll create JIRA SPARK-4985 for date type support in parquet, and SPARK-4987
for timestamp type support. For decimal type, I think we only support decimals
that fits in a long.
Thanks,
Daoyuan
-Original Message-
From: Alessandro Baretta [mailto:alexbare...@gmail.com]
Sent: Sa
10 matches
Mail list logo