Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-18 Thread via GitHub


conbench-apache-arrow[bot] commented on PR #46170:
URL: https://github.com/apache/arrow/pull/46170#issuecomment-2814272339

   After merging your PR, Conbench analyzed the 4 benchmarking runs that have 
been run so far on merge-commit cc56a5e78eeede2715f12b6ca61ebea741f38f60.
   
   There were no benchmark performance regressions. 🎉
   
   The [full Conbench report](https://github.com/apache/arrow/runs/40761610933) 
has more details. It also includes information about 6 possible false positives 
for unstable benchmarks that are known to sometimes produce them.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-17 Thread via GitHub


rok merged PR #46170:
URL: https://github.com/apache/arrow/pull/46170


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-17 Thread via GitHub


yyossy5 commented on PR #46170:
URL: https://github.com/apache/arrow/pull/46170#issuecomment-2813471967

   Thank you @rok !
   
   >  A minor ask: could we add a check like:
   > 
   > ```python
   > tensor_array_from_numpy = pa.FixedShapeTensorArray.from_numpy_ndarray(arr)
   > assert tensor_array_from_numpy.dim_names() is None
   > ```
   
   You're right, I've added the assertions.
   
https://github.com/apache/arrow/pull/46170/commits/800bb03c0c905487919ac586595aa13b31422df9
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-17 Thread via GitHub


rok commented on PR #46170:
URL: https://github.com/apache/arrow/pull/46170#issuecomment-2813377286

   Thanks for working on this @yyossy5!
   
   This looks good. A minor ask: could we add a check like:
   ```python
   tensor_array_from_numpy = pa.FixedShapeTensorArray.from_numpy_ndarray(arr)
   assert tensor_array_from_numpy.dim_names is None
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-17 Thread via GitHub


yyossy5 commented on code in PR #46170:
URL: https://github.com/apache/arrow/pull/46170#discussion_r2049144944


##
python/pyarrow/array.pxi:
##
@@ -4655,6 +4657,17 @@ cdef class FixedShapeTensorArray(ExtensionArray):
 "Cannot convert 1D array or scalar to fixed shape tensor 
array")
 if np.prod(obj.shape) == 0:
 raise ValueError("Expected a non-empty ndarray")
+if dim_names is not None:
+if not hasattr(dim_names, '__iter__'):

Review Comment:
   Thank you @raulcd !
   Fixed it in 
https://github.com/apache/arrow/pull/46170/commits/38ae0a4534d5bab2352f7a0d1bb89ea7b379e139
 and added a test case for generator input in 
https://github.com/apache/arrow/pull/46170/commits/ff2ff30f79abce1a7eef32c3f1945805ae6d6687
 .



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-17 Thread via GitHub


raulcd commented on code in PR #46170:
URL: https://github.com/apache/arrow/pull/46170#discussion_r2048956956


##
python/pyarrow/array.pxi:
##
@@ -4655,6 +4657,17 @@ cdef class FixedShapeTensorArray(ExtensionArray):
 "Cannot convert 1D array or scalar to fixed shape tensor 
array")
 if np.prod(obj.shape) == 0:
 raise ValueError("Expected a non-empty ndarray")
+if dim_names is not None:
+if not hasattr(dim_names, '__iter__'):

Review Comment:
   This will allow us to pass a generator but `len(din_names)` would fail with:
   ```
   >>> g = (x for x in range(2))
   >>> hasattr(g, '__iter__')
   True
   >>> len(g)
   Traceback (most recent call last):
 File "", line 1, in 
   TypeError: object of type 'generator' has no len()
   ```
   Should we check for `collections.abc.Sequence` as we do in other places like:
   
https://github.com/apache/arrow/blob/0ae5c860004293d0b81bbc31198931aeae9bc277/python/pyarrow/feather.py#L259-L261



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-17 Thread via GitHub


yyossy5 commented on PR #46170:
URL: https://github.com/apache/arrow/pull/46170#issuecomment-2812525229

   Thank you for reviewing! @AlenkaF
   
   > The document you linked will be updated automatically by the changes in 
this PR, since that section is generated from the docstrings.
   
   Oh, that's nice.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-17 Thread via GitHub


AlenkaF commented on PR #46170:
URL: https://github.com/apache/arrow/pull/46170#issuecomment-2812296116

   > Should the document be revised as well?
   
   Thanks for the contribution, @yyossy5! The document you linked will be 
updated automatically by the changes in this PR, since that section is 
generated from the docstrings.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-45531: [Python] Add the `dim_names` argument to `from_numpy_ndarray` [arrow]

2025-04-16 Thread via GitHub


yyossy5 commented on PR #46170:
URL: https://github.com/apache/arrow/pull/46170#issuecomment-2810155991

   Should the document be revised as well?
   
   
https://arrow.apache.org/docs/python/generated/pyarrow.FixedShapeTensorArray.html#pyarrow.FixedShapeTensorArray.from_numpy_ndarray


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org