Mike Sukmanowsky created PIG-4225:
-------------------------------------
Summary: Allow users to specify Python executable for Pig streaming
Key: PIG-4225
URL: https://issues.apache.org/jira/browse/PIG-4225
Project: Pig
Issue Type: Improvement
Components: internal-udfs
Affects Versions: 0.12.0, 0.12.1
Reporter: Mike Sukmanowsky
The [current
PythonScriptEngine|https://github.com/apache/pig/blob/release-0.12.0/src/org/apache/pig/scripting/streaming/python/PythonScriptEngine.java#L69]
uses whatever python is currently on the path in order to execute scripts.
Python users are accustomed to creating virtual environments (virtualenvs)
where associated requirements are installed without needing to worry about
"global" installs via, for example, sudo pip install <some package>.
Is it possible to have the Python executable specified either via the
{{DEFINE}} command syntax or, in a hadoop job configuration variable? Perhaps
{{pig.pythonstreaming.pythonpath}}?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)