[ 
https://issues.apache.org/jira/browse/TAJO-736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13959651#comment-13959651
 ] 

Jihoon Son commented on TAJO-736:
---------------------------------

[~hyunsik] and [~jhkim],
first of all, appreciate for your efforts.
These documents will be very useful and helpful to Tajo users.

Documents look nice, but I have some simple suggestions.
* In CSV, there are some characters which are forbidden for delimiters. For 
example, the line feed (\n) cannot be used as the delimiter, because it is used 
to distinguish each line. It would be great to add some descriptions about this.
* In RCFile, you may miss to put a period at the end of the first paragraph. 
* In Parquet, it would be great to add an example of DDL that creates a table 
with compression.
* In Column Partitioning, the "Todo" section should be removed. Also, I think 
that there is a compatibility issue with Hive. For example, can Tajo directly 
read partitioned tables of Hive? Whether it can or cannot, it would be better 
to add a simple description of the compatibility. 

In addition, I think that [~davidzchen]'s review will be very helpful for the 
Parquet document.
[~davidzchen], would you mind reviewing the Parquet document, please?

Best regards,
Jihoon Son

> Add table management documentation
> ----------------------------------
>
>                 Key: TAJO-736
>                 URL: https://issues.apache.org/jira/browse/TAJO-736
>             Project: Tajo
>          Issue Type: Sub-task
>          Components: documentation
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 0.8-incubating, 1.0-incubating
>
>         Attachments: TAJO-736.patch
>
>
> Jinho and I wrote some user documentations for file formats. This patch 
> contains documentations for CSV file, RCFile, and Parquet file.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to