[ 
https://issues.apache.org/jira/browse/TAJO-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14740725#comment-14740725
 ] 

ASF GitHub Bot commented on TAJO-1340:
--------------------------------------

Github user hyunsik commented on a diff in the pull request:

    https://github.com/apache/tajo/pull/671#discussion_r39267261
  
    --- Diff: tajo-common/src/main/java/org/apache/tajo/conf/TajoConf.java ---
    @@ -377,6 +381,7 @@ public static int setDateOrder(int dateOrder) {
           // ResultSet 
---------------------------------------------------------
         $RESULT_SET_FETCH_ROWNUM("tajo.resultset.fetch.rownum", 200),
    --- End diff --
    
    When we use compression, ``200`` may be too small to get some performance 
benefits from compression. Later, we may need some experiments to find the best 
row number for compression or non-compression mode.


> Change the default output file format.
> --------------------------------------
>
>                 Key: TAJO-1340
>                 URL: https://issues.apache.org/jira/browse/TAJO-1340
>             Project: Tajo
>          Issue Type: Improvement
>          Components: Java Client, JDBC Driver, Offheap, Storage
>            Reporter: Hyunsik Choi
>            Assignee: Jinho Kim
>             Fix For: 0.11.0, 0.12.0
>
>         Attachments: TAJO-1340.patch, TAJO-1340_2.patch
>
>
> Currently, the default output file is CSV. Due to its nature, CSV has mainly 
> three problems:
>  * Its line or field delimiter can be duplicated to some character included 
> in the result data.
>  * Plan text file is likely to be larger than other file formats.
>  * Its read and write performance is slow.
> We need to change the default output file format into other file formats. We 
> also need to investigate which file format is the best for it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to