[
https://issues.apache.org/jira/browse/PIG-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Stefan Groschupf updated PIG-165:
---------------------------------
Attachment: PIG-165_r639015_v1.patch
This patch caches the byte array in the DataAtom. My performance tests show a
25 % performance improvement for read writes.
This is an big overall performance improvement for our application.
The higher memory usage is aceptable.
> caching the byte array in DataAtom can improve performance
> ----------------------------------------------------------
>
> Key: PIG-165
> URL: https://issues.apache.org/jira/browse/PIG-165
> Project: Pig
> Issue Type: Improvement
> Reporter: Stefan Groschupf
> Assignee: Stefan Groschupf
> Priority: Critical
> Attachments: PIG-165_r639015_v1.patch
>
>
> Many fields are passed through a processing step without changing there
> values. So pig basically just read and write them.
> So read write performance is critical.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.