Re: [I] [Python] `pyarrow.compute.skew(skip_nulls=True)` still counts NULL as an observation? [arrow]

via GitHub Tue, 11 Mar 2025 01:25:47 -0700


pitrou commented on issue #45733:
URL: https://github.com/apache/arrow/issues/45733#issuecomment-2713121793


   > If `skew` is always calculating the unbiased skew
   
   No, it's computing the biased skew. Would unbiased be more/less useful? Or 
should we just add an option to make both variants available?
   
   > with pyarrow's value being lower than pandas's value it appears pyarrow 
might be counting `None` as an observation while pandas is not considering it's 
missing value as an observation.
   
   No, it's always the same value regardless of the number of nulls:
   ```pycon
   >>> pc.skew([1.0, 2.0, 3.0, 40.0], skip_nulls=True)
   <pyarrow.DoubleScalar: 1.14831951332278>
   >>> pc.skew([1.0, 2.0, 3.0, 40.0, None], skip_nulls=True)
   <pyarrow.DoubleScalar: 1.14831951332278>
   >>> pc.skew([1.0, 2.0, 3.0, 40.0, None, None], skip_nulls=True)
   <pyarrow.DoubleScalar: 1.14831951332278>
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [I] [Python] `pyarrow.compute.skew(skip_nulls=True)` still counts NULL as an observation? [arrow]

Reply via email to