[
https://issues.apache.org/jira/browse/SPARK-14744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-14744:
--------------------------------
Issue Type: Sub-task (was: Improvement)
Parent: SPARK-11806
> Put examples packaging on a diet
> --------------------------------
>
> Key: SPARK-14744
> URL: https://issues.apache.org/jira/browse/SPARK-14744
> Project: Spark
> Issue Type: Sub-task
> Components: Examples
> Affects Versions: 2.0.0
> Reporter: Marcelo Vanzin
> Priority: Minor
>
> Currently the examples bring in a lot of external dependencies, ballooning
> the size of the Spark distribution packages.
> I'd like to propose two things to slim down these dependencies:
> - make all non-Spark, and also Spark Streaming, dependencies "provided". This
> means, especially for streaming connectors, that launching examples becomes
> more like launching real applications (where you need to figure out how to
> provide those dependencies, e.g. using {{--packages}}).
> - audit examples and remove those that don't provide a lot of value. For
> example, HBase is working on full-featured Spark bindings, based on code that
> has already been in use for a while before being merged into HBase. The HBase
> example in Spark is very bare bones and, in comparison, not really useful and
> in fact a little misleading.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]