[ https://issues.apache.org/jira/browse/DRILL-6970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16741288#comment-16741288 ]
jean-claude commented on DRILL-6970: ------------------------------------ I fixed the issue by using the setSafe method instead of the set. {code:java} private static class VarCharDefn extends ColumnDefn { private NullableVarCharVector.Mutator mutator; @Override public void load(int rowIndex, String value) { byte[] bytes = value.getBytes(); //mutator.setS(rowIndex, value.getBytes()); mutator.setSafe(rowIndex, bytes, 0, bytes.length); } } {code} I will create a pull request soon. > LogRegex format plugin > ---------------------- > > Key: DRILL-6970 > URL: https://issues.apache.org/jira/browse/DRILL-6970 > Project: Apache Drill > Issue Type: Bug > Affects Versions: 1.15.0 > Reporter: jean-claude > Priority: Major > > The log format plugin does re-allocate the drillbuf when it fills up. You can > query small log files but larger ones will fail with this error: > 0: jdbc:drill:zk=local> select * from dfs.root.`/prog/test.log`; > Error: INTERNAL_ERROR ERROR: index: 32724, length: 108 (expected: range(0, > 32768)) > Fragment 0:0 > Please, refer to logs for more information. > > I'm running drill-embeded. The log storage plugin is configured like so > {code:java} > "log": { > "type": "logRegex", > "regex": "(.+)", > "extension": "log", > "maxErrors": 10, > "schema": [ > { > "fieldName": "line" > } > ] > }, > {code} > The log files is very simple > {code:java} > jdsaljfldaksjfldsajfldasjflkjdsfldsjfljsdalfk > jdsaljfldaksjfldsajfldasjflkjdsfldsjfljsdalfk > jdsaljfldaksjfldsajfldasjflkjdsfldsjfljsdalfk > jdsaljfldaksjfldsajfldasjflkjdsfldsjfljsdalfk > jdsaljfldaksjfldsajfldasjflkjdsfldsjfljsdalfk > jdsaljfldaksjfldsajfldasjflkjdsfldsjfljsdalfk > ...{code} > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)