subject:"\[Discussion\]read latest schema in case of external table and file format"

Re: [Discussion]read latest schema in case of external table and file format

2019-02-04 Thread akashrn5

Hi rahul, Actually we are not skipping the old file, currently we are just listing the carbondata files in the location and then take first one to infer the schema, but now i just take the latest carbon data file to infer schema, and while giving the data, if the column is not present in

Re: [Discussion]read latest schema in case of external table and file format

2019-02-04 Thread akashrn5

Hi Liang, When we create a table using location in file format case or when i create an external table from a location, user can place multiple carbondata files with different schema in that location and want to read the data at once, in that scenario we can expect the above condition. So

Re: [Discussion]read latest schema in case of external table and file format

2019-02-04 Thread Rahul Kumar

hi akash, I have one concern related to this change: *Concern*: why we are skipping old datafile? if user is not giving the schema then also i think we should read old data file. we can fill column *d* and *e* with *None* value. i guess *if data file is present at given location it

Re: [Discussion]read latest schema in case of external table and file format

2019-02-04 Thread Liang Chen

Hi Can you explain which scenario will generate two carbondata files with different schema? Regards Liang akashrn5 wrote > Hi dev, > > Currently we have a validation that if there are two carbondata files in a > location with different schema, then we fail the query. I think there is > no >