[ https://issues.apache.org/jira/browse/HIVE-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Syam updated HIVE-17984: ------------------------ Summary: getMaxLength is not returning the correct lengths for Char/Varchar types while reading the ORC file from WebHDFS file system (was: getMaxLength is not returning the previously set length in ORC file) > getMaxLength is not returning the correct lengths for Char/Varchar types > while reading the ORC file from WebHDFS file system > ---------------------------------------------------------------------------------------------------------------------------- > > Key: HIVE-17984 > URL: https://issues.apache.org/jira/browse/HIVE-17984 > Project: Hive > Issue Type: Bug > Components: Hive, ORC > Environment: tested it against hive-exec 2.1 > Reporter: Syam > Original Estimate: 24h > Remaining Estimate: 24h > > getMaxLength is not returning the correct length for char/varchar datatypes. > I see that getMaxLength is returning 255 for CHAR type and 65535 for VARCHAR > type. > When I checked the same file using orcfiledump utility, I could see the > correct lengths. > Here is the snippet of the code: > Reader _reader = OrcFile.createReader(new > Path(_fileName),OrcFile.readerOptions(conf).filesystem(fs)) ; > TypeDescription metarec = _reader.getSchema() ; > List <TypeDescription> cols = metarec.getChildren(); > List <String> colNames = metarec.getFieldNames(); > for (int i=0; i < cols.size(); i++) > { > TypeDescription fieldSchema = cols.get(i); > switch (fieldSchema.getCategory()) > { > case CHAR: > header += "char(" + fieldSchema.getMaxLength() + ")" ; > break; > ---------- > ---------- > } > } > Please let me know your pointers please. -- This message was sent by Atlassian JIRA (v6.4.14#64029)