[ 
https://issues.apache.org/jira/browse/TAJO-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jihoon Son updated TAJO-1555:
-----------------------------
    Summary: Cleanup duplicated code of python functions  (was: Duplicated code 
cleanup)

> Cleanup duplicated code of python functions
> -------------------------------------------
>
>                 Key: TAJO-1555
>                 URL: https://issues.apache.org/jira/browse/TAJO-1555
>             Project: Tajo
>          Issue Type: Task
>          Components: function/udf
>            Reporter: Jihoon Son
>
> I'm working on supporting Python UDF at TAJO-1344. This is still a prototype, 
> and has some problems. One of the problems is related to 
> serialization/deserialization protocol. For easy implementation, I simply 
> used CSV format to serialize/deserialize tuples. To do so, I copied some 
> bunch of codes from the tajo-storage package. This will incur a maintenance 
> issue in addition to the problem of low performance.
> To cleanup the duplicated codes, I think that we should use a well-known 
> serialization/deserialization protocol such as protocol buffers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to