westonpace commented on PR #14287:
URL: https://github.com/apache/arrow/pull/14287#issuecomment-1373020186
I'd like to revive this as it has been an ask for some time and I think it
is important. The technical issues of how the parser is created are probably
more minor than the maintenance issue of making sure we come up with an
expression syntax we are willing to support and expect to last.
There was a ML discussion on this but I feel it stalled out somewhat. Part
of the challenge is that there were two alternatives proposed. Another
challenge is that it would be unfortunate to adopt one standard in Arrow only
to have Substrait adopt a different standard later. I propose the following:
* Build up a corpus of example expressions (10-20 or so) that demonstrate
the various features (different types of scalars, escaping strings, etc.)
* Create a grammar for all proposals (I believe this will help when
communicating)
* Send a message to the Substrait mailing list with the proposal
* Revive the Arrow ML discussion and point any interested parties to the
Substrait discussion
* Once the Substrait discussion reaches consensus we can merge a parser
into arrow-c++
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]