Holden Karau created SPARK-55215:
------------------------------------
Summary: Explore if any commonly used libraries have a common
subset we should map
Key: SPARK-55215
URL: https://issues.apache.org/jira/browse/SPARK-55215
Project: Spark
Issue Type: Sub-task
Components: SQL
Affects Versions: 4.3
Reporter: Holden Karau
We'd only support the very limited usage patterns that make sense (if at all)
Possibilities include:
* nltk operations
* common pandas operations (for Pandas on Spark)
* simple numpy/scipy
* re
* json
Out of scope:
* Simple web endpoind (no direct mapping but if we see it a lot explore)
* datetime (area of flux on how we handle in PySpark, not a good time to try
and optimize)
Output from here should be some additional JIRAs for things we explore
transpiling
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]