[
https://issues.apache.org/jira/browse/DRILL-6737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833051#comment-16833051
]
mehran commented on DRILL-6737:
-------------------------------
I checked the problem in 1.16 release and it is not the parquet writer issue.
there is two step at the end of plan that takes each 2 minutes to finish:
PROJECT_ALLOW_DUP, PROJECT.
h3.
{panel:bgColor=#ffffff}
{panel:bgColor=#f5f5f5}
[Overview|http://10.233.50.111:8047/profiles/23329968-0346-e658-b1c9-92fd7dc60d2a#operator-overview]{panel}
{panel}
||Operator ID||Type||Avg Setup Time||Max Setup Time||Avg Process Time||Max
Process Time||Min Wait Time||Avg Wait Time||Max Wait Time||% Fragment Time||%
Query Time||Rows||Avg Peak Memory||Max Peak Memory||
|00-xx-00|SCREEN|0.000s|0.000s|1.262s|2.510s|0.004s|0.075s|0.145s|0.82%|0.82%|111,491|10MB|20MB|
|00-xx-01|PROJECT|0.002s|0.002s|0.001s|0.001s|0.000s|0.000s|0.000s|0.00%|0.00%|1|-|-|
|00-xx-02|PARQUET_WRITER|0.293s|0.293s|50.750s|50.750s|0.000s|0.000s|0.000s|16.44%|16.44%|111,490|-|-|
|00-xx-03|PROJECT_ALLOW_DUP|0.032s|0.032s|2m0s|2m0s|0.000s|0.000s|0.000s|39.01%|39.01%|111,490|13MB|13MB|
|00-xx-04|PROJECT|16.092s|16.092s|2m15s|2m15s|0.000s|0.000s|0.000s|43.73%|43.73%|111,490|13MB|13MB|
{panel}
{panel}
I do not know what these steps do after parquet writer is finished
But it takes strangely long time to run.
> Ctas json to Parquet is very very slow
> --------------------------------------
>
> Key: DRILL-6737
> URL: https://issues.apache.org/jira/browse/DRILL-6737
> Project: Apache Drill
> Issue Type: Bug
> Components: Query Planning & Optimization
> Affects Versions: 1.14.0
> Reporter: mehran
> Assignee: salim achouche
> Priority: Critical
> Attachments: drill.bmp
>
>
> 5 minute takes to insert a json file to parquet, where in 1.13 it takes 10
> seconds. it seems to be a blocker bug.
> In plan it is Parquet writer that takes this duration.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)