pitrou commented on issue #45733: URL: https://github.com/apache/arrow/issues/45733#issuecomment-2713121793
> If `skew` is always calculating the unbiased skew No, it's computing the biased skew. Would unbiased be more/less useful? Or should we just add an option to make both variants available? > with pyarrow's value being lower than pandas's value it appears pyarrow might be counting `None` as an observation while pandas is not considering it's missing value as an observation. No, it's always the same value regardless of the number of nulls: ```pycon >>> pc.skew([1.0, 2.0, 3.0, 40.0], skip_nulls=True) <pyarrow.DoubleScalar: 1.14831951332278> >>> pc.skew([1.0, 2.0, 3.0, 40.0, None], skip_nulls=True) <pyarrow.DoubleScalar: 1.14831951332278> >>> pc.skew([1.0, 2.0, 3.0, 40.0, None, None], skip_nulls=True) <pyarrow.DoubleScalar: 1.14831951332278> ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
