HyukjinKwon commented on pull request #33384:
URL: https://github.com/apache/spark/pull/33384#issuecomment-881153833
looks like there's a valid test failure:
```
======================================================================
ERROR [1.825s]: test_intersection
(pyspark.pandas.tests.indexes.test_category.CategoricalIndexTest)
----------------------------------------------------------------------
Traceback (most recent call last):
File
"/__w/apache-spark/apache-spark/python/pyspark/testing/pandasutils.py", line
138, in assertPandasEqual
assert_index_equal(left, right, check_exact=check_exact)
File
"/__t/Python/3.6.14/x64/lib/python3.6/site-packages/pandas/_testing.py", line
747, in assert_index_equal
_check_types(left, right, obj=obj)
File
"/__t/Python/3.6.14/x64/lib/python3.6/site-packages/pandas/_testing.py", line
717, in _check_types
assert_attr_equal("dtype", l, r, obj=obj)
AssertionError: Index are different
Attribute "dtype" are different
[left]: CategoricalDtype(categories=['z', 'y', 'x', 'w'], ordered=False)
[right]: CategoricalDtype(categories=['x', 'y'], ordered=False)
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File
"/__w/apache-spark/apache-spark/python/pyspark/pandas/tests/indexes/test_category.py",
line 163, in test_intersection
psidx1.intersection(psidx2).sort_values(),
pidx1.intersection(pidx2).sort_values()
File
"/__w/apache-spark/apache-spark/python/pyspark/testing/pandasutils.py", line
223, in assert_eq
self.assertPandasEqual(lobj, robj, check_exact=check_exact)
File
"/__w/apache-spark/apache-spark/python/pyspark/testing/pandasutils.py", line
145, in assertPandasEqual
raise AssertionError(msg) from e
AssertionError: Index are different
Attribute "dtype" are different
[left]: CategoricalDtype(categories=['z', 'y', 'x', 'w'], ordered=False)
[right]: CategoricalDtype(categories=['x', 'y'], ordered=False)
Left:
CategoricalIndex(['y', 'x'], categories=['z', 'y', 'x', 'w'], ordered=False,
dtype='category')
category
Right:
CategoricalIndex(['x', 'y'], categories=['x', 'y'], ordered=False,
dtype='category')
category
----------------------------------------------------------------------
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]