[ 
https://issues.apache.org/jira/browse/DRILL-5590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16052536#comment-16052536
 ] 

ASF GitHub Bot commented on DRILL-5590:
---------------------------------------

GitHub user paul-rogers opened a pull request:

    https://github.com/apache/drill/pull/855

    DRILL-5590: Bugs in CSV field matching, null columns

    Please see the problem and solution descriptions in DRILL-5590.
    
    Also cleaned up some dead code left over from DRILL-5498.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/paul-rogers/drill DRILL-5590

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/drill/pull/855.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #855
    
----
commit e2a234776817933467400116ef2987db965ce479
Author: Paul Rogers <prog...@maprtech.com>
Date:   2017-06-16T05:46:56Z

    DRILL-5590: Bugs in CSV field matching, null columns
    
    Please see the problem and solution descriptions in DRILL-5590.
    
    Also cleaned up some dead code left over from DRILL-5498.

----


> Drill return IndexOutOfBoundsException when a (Text) file > 4096 rows
> ---------------------------------------------------------------------
>
>                 Key: DRILL-5590
>                 URL: https://issues.apache.org/jira/browse/DRILL-5590
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Text & CSV
>    Affects Versions: 1.10.0
>         Environment: OS: Oracle Linux Enterprise 7, OSX 10.10.1
> JVM: 1.8
> Drill Installation type: Embebed or distributed(Cluster 2 Nodes)
>            Reporter: Victor Garcia
>            Assignee: Paul Rogers
>         Attachments: xaa_19.txt
>
>
> I describe below, the storage (name lco):
> {
>   "type": "file",
>   "enabled": true,
>   "connection": "file:///",
>   "config": null,
>   "workspaces": {
>     "root": {
>       "location": "/data/source/lco",
>       "writable": false,
>       "defaultInputFormat": "psv"
>     }
>   },
>   "formats": {
>     "psv": {
>       "type": "text",
>       "extensions": [
>         "txt"
>       ],
>       "extractHeader": true,
>       "delimiter": "|"
>     }
>   }
> }
> Querying a CSV file with 3 columns and when the file have > 4096 (including 
> the header), Drill return a error, but when i reduce the rows to 4095 rows 
> the query work.
> Query used: (Select count(1) from lco.root.* as lc where lc.rfc like 
> 'CUBA7706%')
> The original file have 35M of rows, but i test reducing the rows until that 
> find the number of rows that produce the error.
> The original source file is in this URL 
> (http://cfdisat.blob.core.windows.net/lco/l_RFC_2017_05_11_2.txt.gz)
> First part of error:
> at 
> org.apache.drill.exec.rpc.user.QueryResultHandler.resultArrived(QueryResultHandler.java:123)
>  [drill-java-exec-1.10.0.jar:1.10.0]
>       at 
> org.apache.drill.exec.rpc.user.UserClient.handle(UserClient.java:343) 
> [drill-java-exec-1.10.0.jar:1.10.0]
>       at org.apache.drill.exec.rpc.user.UserClient.handle(UserClient.java:88) 
> [drill-java-exec-1.10.0.jar:1.10.0]
>       at 
> org.apache.drill.exec.rpc.RpcBus$InboundHandler.decode(RpcBus.java:274) 
> [drill-rpc-1.10.0.jar:1.10.0]
>       at 
> org.apache.drill.exec.rpc.RpcBus$InboundHandler.decode(RpcBus.java:244) 
> [drill-rpc-1.10.0.jar:1.10.0]
>       at 
> io.netty.handler.codec.MessageToMessageDecoder.channelRead(MessageToMessageDecoder.java:89)
>  [netty-codec-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:339)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:324)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.handler.timeout.IdleStateHandler.channelRead(IdleStateHandler.java:254)
>  [netty-handler-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:339)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:324)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.handler.codec.MessageToMessageDecoder.channelRead(MessageToMessageDecoder.java:103)
>  [netty-codec-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:339)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:324)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.handler.codec.ByteToMessageDecoder.channelRead(ByteToMessageDecoder.java:242)
>  [netty-codec-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:339)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:324)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.ChannelInboundHandlerAdapter.channelRead(ChannelInboundHandlerAdapter.java:86)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:339)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:324)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.DefaultChannelPipeline.fireChannelRead(DefaultChannelPipeline.java:847)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:131)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:511) 
> [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:468)
>  [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:382) 
> [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:354) 
> [netty-transport-4.0.27.Final.jar:4.0.27.Final]
>       at 
> io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:111)
>  [netty-common-4.0.27.Final.jar:4.0.27.Final]
>       at java.lang.Thread.run(Thread.java:748) [na:1.8.0_131]
> 2017-06-15 14:45:03,056 [qtp2036240117-58] ERROR 
> o.a.d.e.server.rest.QueryResources - Query from Web UI Failed
> org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR: 
> IndexOutOfBoundsException: index: 16384, length: 4 (expected: range(0, 16384))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to