[ 
https://issues.apache.org/jira/browse/METAMODEL-82?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152082#comment-14152082
 ] 

Kasper Sørensen commented on METAMODEL-82:
------------------------------------------

Well yes and no. It's a tough one...

The trouble is that the data here can never be guaranteed as any other type. If 
you put a number in a CSV file or an Excel sheet, it can only be treated as a 
number if all the other rows have a number in this column. And how do we 
guarantee that (except for obviously eager-reading all records, which isn't 
acceptable IMO).

If we change this I would suggest to make a "type detection" optional (and 
default disabled for compatibility). Then we could eager-read up to 1000 
records or so and use the schema inference utils to determine a pretty precise 
type.

> Metadata type is always VARCHAR for CSV & Excel modules
> -------------------------------------------------------
>
>                 Key: METAMODEL-82
>                 URL: https://issues.apache.org/jira/browse/METAMODEL-82
>             Project: Metamodel
>          Issue Type: Bug
>            Reporter: Alberto Rodriguez
>
> The metadata types fetched from CSV and Excel DataContexts are always VARCHAR 
> although the data of some columns are not a string.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to