[ 
https://issues.apache.org/jira/browse/AURORA-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403639#comment-15403639
 ] 

Stephan Erb commented on AURORA-1743:
-------------------------------------

Do you plan on switching our python client to the new protocol, or are you only 
interested in enabling the server-side support?

> Support Thrift Binary Protocol 
> -------------------------------
>
>                 Key: AURORA-1743
>                 URL: https://issues.apache.org/jira/browse/AURORA-1743
>             Project: Aurora
>          Issue Type: Task
>          Components: Scheduler
>            Reporter: Zameer Manji
>            Assignee: Zameer Manji
>
> Currently the scheduler serves its thrift API over the TJSON protocol in 
> thrift. This has some benefits, as it allows lots of tools to consume the 
> data without understanding the IDL, the wire format is easy to understand and 
> it can power our AJAX UI.
> However the performance of the TJSON protocol in Python is abysmal. On a 
> small cluster (~2k tasks) which uses Docker and stores some metadata per 
> task, and unscoped {{getTasksWithoutConfigs}} returns about 22MB of JSON. In 
> Python, using the TJSON protocol, it takes more than 20 seconds to 
> deserialize this data. The equivalent data encoded in the binary protocol 
> takes about 3 seconds to deserialize. I suspect that similar performance 
> issues occur in other thrift implementations in other programming languages.
> I think the scheduler should support both the TJSON protocol and the binary 
> protocol over HTTP.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to