Hi,
I'm currently profiling memory consumption of our Beam pipeline and have
noticed that
org.apache.beam.sdk.values.PCollectionViews$SimplePCollectionView.hashCode()
makes noticeable heap allocations. The implementation is:
return Objects.hash(tag);
That itself translates to:
return Arrays.hashCode(values);
Which performs implicit array creation in order to call:
public static int Arrays.hashCode(Object a[]);
Am I right that changing the SimplePCollectionView implementation to a
simple:
return tag.hashCode();
Is the right thing to do?
Regards,
Vojta