jorisvandenbossche commented on pull request #8459:
URL: https://github.com/apache/arrow/pull/8459#issuecomment-823545315
One thing I noted while quickly trying it out:
```
In [6]: pc.extract_regex(pa.array(['a1', 'b2', 'c3']),
regex=r'(?P<letter>[ab])?(?P<digit>\d)')
Out[6]:
<pyarrow.lib.StructArray object at 0x7f8c1bdf9760>
-- is_valid: all not null
-- child 0 type: string
[
"a",
"b",
""
]
-- child 1 type: string
[
"1",
"2",
"3"
]
```
When having an optional group like the above, I would maybe expect a null
instead of an empty string in the first child?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]