[
https://issues.apache.org/jira/browse/SPARK-14300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joseph K. Bradley updated SPARK-14300:
--------------------------------------
Description:
Duplicated code that I found in scala/examples/mllib:
* scala/mllib
** DenseGaussianMixture.scala
** StreamingLinearRegression.scala
(This is the updated list. The original list is copied below.)
h4. Original list of code examples to check
Original list:
* scala/mllib
** DecisionTreeRunner.scala
** DenseGaussianMixture.scala
** DenseKMeans.scala
** GradientBoostedTreesRunner.scala
** LDAExample.scala
** LinearRegression.scala
** SparseNaiveBayes.scala
** StreamingLinearRegression.scala
** StreamingLogisticRegression.scala
** TallSkinnyPCA.scala
** TallSkinnySVD.scala
* Unsure code duplications (need doube check)
** AbstractParams.scala
** BinaryClassification.scala
** Correlations.scala
** CosineSimilarity.scala
** DenseGaussianMixture.scala
** FPGrowthExample.scala
** MovieLensALS.scala
** MultivariateSummarizer.scala
** RandomRDDGeneration.scala
** SampledRDDs.scala
When merging and cleaning those code, be sure not disturb the previous example
on and off blocks.
was:
Duplicated code that I found in scala/examples/mllib:
* scala/mllib
** DecisionTreeRunner.scala
** DenseGaussianMixture.scala
** DenseKMeans.scala
** GradientBoostedTreesRunner.scala
** LDAExample.scala
** LinearRegression.scala
** SparseNaiveBayes.scala
** StreamingLinearRegression.scala
** StreamingLogisticRegression.scala
** TallSkinnyPCA.scala
** TallSkinnySVD.scala
* Unsure code duplications (need doube check)
** AbstractParams.scala
** BinaryClassification.scala
** Correlations.scala
** CosineSimilarity.scala
** DenseGaussianMixture.scala
** FPGrowthExample.scala
** MovieLensALS.scala
** MultivariateSummarizer.scala
** RandomRDDGeneration.scala
** SampledRDDs.scala
When merging and cleaning those code, be sure not disturb the previous example
on and off blocks.
> Scala MLlib examples code merge and clean up
> --------------------------------------------
>
> Key: SPARK-14300
> URL: https://issues.apache.org/jira/browse/SPARK-14300
> Project: Spark
> Issue Type: Sub-task
> Components: Examples
> Reporter: Xusen Yin
> Priority: Minor
> Labels: starter
>
> Duplicated code that I found in scala/examples/mllib:
> * scala/mllib
> ** DenseGaussianMixture.scala
> ** StreamingLinearRegression.scala
> (This is the updated list. The original list is copied below.)
> h4. Original list of code examples to check
> Original list:
> * scala/mllib
> ** DecisionTreeRunner.scala
> ** DenseGaussianMixture.scala
> ** DenseKMeans.scala
> ** GradientBoostedTreesRunner.scala
> ** LDAExample.scala
> ** LinearRegression.scala
> ** SparseNaiveBayes.scala
> ** StreamingLinearRegression.scala
> ** StreamingLogisticRegression.scala
> ** TallSkinnyPCA.scala
> ** TallSkinnySVD.scala
> * Unsure code duplications (need doube check)
> ** AbstractParams.scala
> ** BinaryClassification.scala
> ** Correlations.scala
> ** CosineSimilarity.scala
> ** DenseGaussianMixture.scala
> ** FPGrowthExample.scala
> ** MovieLensALS.scala
> ** MultivariateSummarizer.scala
> ** RandomRDDGeneration.scala
> ** SampledRDDs.scala
> When merging and cleaning those code, be sure not disturb the previous
> example on and off blocks.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]