[jira] [Updated] (SPARK-7119) ScriptTransform doesn't consider the output data type

2015-07-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7119: Shepherd: Josh Rosen ScriptTransform doesn't consider the output data type

[jira] [Updated] (SPARK-7119) ScriptTransform doesn't consider the output data type

2015-07-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7119: Priority: Critical (was: Blocker) ScriptTransform doesn't consider the output data type

[jira] [Created] (SPARK-9443) Explose sampleByKey in SparkR

2015-07-29 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-9443: - Summary: Explose sampleByKey in SparkR Key: SPARK-9443 URL: https://issues.apache.org/jira/browse/SPARK-9443 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-746) Automatically Use Avro Serialization for Avro Objects

2015-07-29 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-746. Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7004

[jira] [Commented] (SPARK-5567) Add prediction methods to LDA

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646902#comment-14646902 ] Apache Spark commented on SPARK-5567: - User 'feynmanliang' has created a pull request

[jira] [Resolved] (SPARK-9347) spark load of existing parquet files extremely slow if large number of files

2015-07-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-9347. - Resolution: Duplicate Okay, thanks for clarifying. I'm still going to close this since

[jira] [Updated] (SPARK-9450) HashedRelation.get() could return an Iterator[Row] instead of Seq[Row]

2015-07-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-9450: - Assignee: Andrew Or HashedRelation.get() could return an Iterator[Row] instead of Seq[Row]

[jira] [Assigned] (SPARK-6486) Add BlockMatrix in PySpark

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6486: --- Assignee: (was: Apache Spark) Add BlockMatrix in PySpark --

[jira] [Commented] (SPARK-6486) Add BlockMatrix in PySpark

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646927#comment-14646927 ] Apache Spark commented on SPARK-6486: - User 'dusenberrymw' has created a pull request

[jira] [Resolved] (SPARK-9448) GenerateUnsafeProjection should not share expressions across instances

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-9448. Resolution: Fixed Fix Version/s: 1.5.0 GenerateUnsafeProjection should not share

[jira] [Commented] (SPARK-7332) RpcCallContext.sender has a different name from the original sender's name

2015-07-29 Thread Perinkulam I Ganesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646931#comment-14646931 ] Perinkulam I Ganesh commented on SPARK-7332: I can't compile the first

[jira] [Created] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests

2015-07-29 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-9461: Summary: Possibly slightly flaky PySpark StreamingLinearRegressionWithTests Key: SPARK-9461 URL: https://issues.apache.org/jira/browse/SPARK-9461 Project:

[jira] [Updated] (SPARK-8977) Define the RateEstimator interface, and implement the ReceiverRateController

2015-07-29 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-8977: - Assignee: Iulian Dragos Define the RateEstimator interface, and implement the

[jira] [Resolved] (SPARK-9016) Make random forest classifier extend Classifier abstraction

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-9016. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7432

[jira] [Created] (SPARK-9449) inputFiles misses files from MetastoreRelation

2015-07-29 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-9449: --- Summary: inputFiles misses files from MetastoreRelation Key: SPARK-9449 URL: https://issues.apache.org/jira/browse/SPARK-9449 Project: Spark Issue

[jira] [Resolved] (SPARK-8921) Add @since tags to mllib.stat

2015-07-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8921. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7730

[jira] [Updated] (SPARK-9364) Fix array out of bounds and use-after-free bugs in UnsafeExternalSorter

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9364: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-9457 Fix array out of bounds and

[jira] [Updated] (SPARK-7078) Cache-aware binary processing in-memory sort

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7078: --- Issue Type: Sub-task (was: New Feature) Parent: SPARK-9457 Cache-aware binary processing

[jira] [Updated] (SPARK-7079) Cache-aware external sort

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7079: --- Issue Type: Sub-task (was: New Feature) Parent: SPARK-9457 Cache-aware external sort

[jira] [Updated] (SPARK-7542) Use LongArray for sort buffer in UnsafeExternalSorter

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7542: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-9457 Use LongArray for sort buffer in

[jira] [Created] (SPARK-9457) Sorting improvements

2015-07-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9457: -- Summary: Sorting improvements Key: SPARK-9457 URL: https://issues.apache.org/jira/browse/SPARK-9457 Project: Spark Issue Type: Umbrella Components:

[jira] [Updated] (SPARK-8160) Tungsten style external aggregation

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8160: --- Assignee: Yin Huai Tungsten style external aggregation ---

[jira] [Updated] (SPARK-7542) Use off-heap sort buffer in UnsafeExternalSorter

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7542: --- Summary: Use off-heap sort buffer in UnsafeExternalSorter (was: Use LongArray for sort buffer in

[jira] [Assigned] (SPARK-9290) DateExpressionsSuite is slow to run

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9290: --- Assignee: (was: Apache Spark) DateExpressionsSuite is slow to run

[jira] [Resolved] (SPARK-6793) Implement perplexity for LDA

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6793. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7705

[jira] [Assigned] (SPARK-6486) Add BlockMatrix in PySpark

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6486: --- Assignee: Apache Spark Add BlockMatrix in PySpark --

[jira] [Assigned] (SPARK-9440) LocalLDAModel should save docConcentration, topicConcentration, and gammaShap

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9440: --- Assignee: Apache Spark LocalLDAModel should save docConcentration, topicConcentration, and

[jira] [Commented] (SPARK-9440) LocalLDAModel should save docConcentration, topicConcentration, and gammaShap

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646947#comment-14646947 ] Apache Spark commented on SPARK-9440: - User 'feynmanliang' has created a pull request

[jira] [Created] (SPARK-9451) Support records larger than default page size in BytesToBytesMap

2015-07-29 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-9451: - Summary: Support records larger than default page size in BytesToBytesMap Key: SPARK-9451 URL: https://issues.apache.org/jira/browse/SPARK-9451 Project: Spark

[jira] [Updated] (SPARK-9440) LocalLDAModel should save docConcentration, topicConcentration, and gammaShap

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9440: - Assignee: Feynman Liang LocalLDAModel should save docConcentration, topicConcentration,

[jira] [Created] (SPARK-9455) Remove InternalRow's generic getter (the one without data type)

2015-07-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9455: -- Summary: Remove InternalRow's generic getter (the one without data type) Key: SPARK-9455 URL: https://issues.apache.org/jira/browse/SPARK-9455 Project: Spark

[jira] [Commented] (SPARK-9458) Avoid object allocation in prefix generation

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646995#comment-14646995 ] Apache Spark commented on SPARK-9458: - User 'rxin' has created a pull request for this

[jira] [Assigned] (SPARK-9458) Avoid object allocation in prefix generation

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9458: --- Assignee: Apache Spark (was: Reynold Xin) Avoid object allocation in prefix generation

[jira] [Created] (SPARK-9460) Avoid byte array allocation in StringPrefixComparator

2015-07-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9460: -- Summary: Avoid byte array allocation in StringPrefixComparator Key: SPARK-9460 URL: https://issues.apache.org/jira/browse/SPARK-9460 Project: Spark Issue Type:

[jira] [Commented] (SPARK-8921) Add @since tags to mllib.stat

2015-07-29 Thread bimal tandel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647016#comment-14647016 ] bimal tandel commented on SPARK-8921: - Xiangrui, can you please assign the ticket to

[jira] [Resolved] (SPARK-8917) Add @since tags to mllib.linalg

2015-07-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-8917. -- Resolution: Duplicate Add @since tags to mllib.linalg ---

[jira] [Assigned] (SPARK-9451) Support records larger than default page size in BytesToBytesMap

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9451: --- Assignee: Josh Rosen (was: Apache Spark) Support records larger than default page size in

[jira] [Commented] (SPARK-9451) Support records larger than default page size in BytesToBytesMap

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646963#comment-14646963 ] Apache Spark commented on SPARK-9451: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-9451) Support records larger than default page size in BytesToBytesMap

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9451: --- Assignee: Apache Spark (was: Josh Rosen) Support records larger than default page size in

[jira] [Created] (SPARK-9456) Remove InternalRow.toSeq

2015-07-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9456: -- Summary: Remove InternalRow.toSeq Key: SPARK-9456 URL: https://issues.apache.org/jira/browse/SPARK-9456 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-9370) Support DecimalType in UnsafeRow

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9370: --- Assignee: Davies Liu Support DecimalType in UnsafeRow

[jira] [Closed] (SPARK-9387) Support comparing UnsafeRow and normal InternalRow

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-9387. -- Resolution: Won't Fix Target Version/s: (was: 1.5.0) Marking this as won't fix since it is

[jira] [Created] (SPARK-9459) avoid copy for UTF8String during unsafe sorting

2015-07-29 Thread Davies Liu (JIRA)
Davies Liu created SPARK-9459: - Summary: avoid copy for UTF8String during unsafe sorting Key: SPARK-9459 URL: https://issues.apache.org/jira/browse/SPARK-9459 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-9322) Add rbind as a synonym for `unionAll`

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9322: --- Assignee: (was: Apache Spark) Add rbind as a synonym for `unionAll`

[jira] [Assigned] (SPARK-9321) Add nrow, ncol, dim for SparkR data frames

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9321: --- Assignee: (was: Apache Spark) Add nrow, ncol, dim for SparkR data frames

[jira] [Commented] (SPARK-9322) Add rbind as a synonym for `unionAll`

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647031#comment-14647031 ] Apache Spark commented on SPARK-9322: - User 'falaki' has created a pull request for

[jira] [Assigned] (SPARK-9324) Add `unique` as a synonym for `distinct`

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9324: --- Assignee: Apache Spark Add `unique` as a synonym for `distinct`

[jira] [Resolved] (SPARK-6129) Create MLlib metrics user guide with algorithm definitions and complete code examples.

2015-07-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6129. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7655

[jira] [Updated] (SPARK-8921) Add @since tags to mllib.stat

2015-07-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8921: - Assignee: bimal tandel Add @since tags to mllib.stat -

[jira] [Created] (SPARK-9463) Expose model coefficients with names in SparkR RFormula

2015-07-29 Thread Eric Liang (JIRA)
Eric Liang created SPARK-9463: - Summary: Expose model coefficients with names in SparkR RFormula Key: SPARK-9463 URL: https://issues.apache.org/jira/browse/SPARK-9463 Project: Spark Issue Type:

[jira] [Issue Comment Deleted] (SPARK-9246) DistributedLDAModel predict top docs per topic

2015-07-29 Thread Meihua Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Meihua Wu updated SPARK-9246: - Comment: was deleted (was: Got it. Thanks!) DistributedLDAModel predict top docs per topic

[jira] [Commented] (SPARK-9246) DistributedLDAModel predict top docs per topic

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647093#comment-14647093 ] Apache Spark commented on SPARK-9246: - User 'rotationsymmetry' has created a pull

[jira] [Assigned] (SPARK-9246) DistributedLDAModel predict top docs per topic

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9246: --- Assignee: Apache Spark DistributedLDAModel predict top docs per topic

[jira] [Assigned] (SPARK-9246) DistributedLDAModel predict top docs per topic

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9246: --- Assignee: (was: Apache Spark) DistributedLDAModel predict top docs per topic

[jira] [Issue Comment Deleted] (SPARK-9246) DistributedLDAModel predict top docs per topic

2015-07-29 Thread Meihua Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Meihua Wu updated SPARK-9246: - Comment: was deleted (was: Got it. Thanks!) DistributedLDAModel predict top docs per topic

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2015-07-29 Thread Swapan Golla (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647098#comment-14647098 ] Swapan Golla commented on SPARK-2356: - Same issue for me. I am on Win7/64bit and using

[jira] [Assigned] (SPARK-8735) Expose metrics for runtime memory usage

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8735: --- Assignee: Apache Spark (was: Andrew Or) Expose metrics for runtime memory usage

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-07-29 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647104#comment-14647104 ] Saisai Shao commented on SPARK-4352: Hi [~tgraves], location is got from DAGScheduler

[jira] [Created] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-07-29 Thread StanZhai (JIRA)
StanZhai created SPARK-9465: --- Summary: Could not read parquet table after recreating it with the same table name Key: SPARK-9465 URL: https://issues.apache.org/jira/browse/SPARK-9465 Project: Spark

[jira] [Assigned] (SPARK-9454) LDASuite should use vector comparisons

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9454: --- Assignee: (was: Apache Spark) LDASuite should use vector comparisons

[jira] [Resolved] (SPARK-9462) Initialize nondeterministic expressions in code gen fallback mode

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-9462. Resolution: Fixed Fix Version/s: 1.5.0 Initialize nondeterministic expressions in code gen

[jira] [Commented] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647155#comment-14647155 ] Joseph K. Bradley commented on SPARK-9461: -- Not the only one apparently. {code}

[jira] [Updated] (SPARK-9192) add initialization phase for nondeterministic expression

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9192: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-8599 add initialization phase for

[jira] [Updated] (SPARK-9462) Initialize nondeterministic expressions in code gen fallback mode

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9462: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-8599 Initialize nondeterministic expressions

[jira] [Commented] (SPARK-9466) Flaky test: org.apache.spark.sql.hive.thriftserver.CliSuite

2015-07-29 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647164#comment-14647164 ] Yin Huai commented on SPARK-9466: - Tests in this sutie have 1 minute timeout. But, seems

[jira] [Created] (SPARK-9466) Flaky test: org.apache.spark.sql.hive.thriftserver.CliSuite

2015-07-29 Thread Yin Huai (JIRA)
Yin Huai created SPARK-9466: --- Summary: Flaky test: org.apache.spark.sql.hive.thriftserver.CliSuite Key: SPARK-9466 URL: https://issues.apache.org/jira/browse/SPARK-9466 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4144) Support incremental model training of Naive Bayes classifier

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647173#comment-14647173 ] Joseph K. Bradley commented on SPARK-4144: -- [~cfregly] Reassigned to you.

[jira] [Updated] (SPARK-4144) Support incremental model training of Naive Bayes classifier

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4144: - Assignee: Chris Fregly (was: Jeremy Freeman) Support incremental model training of

[jira] [Updated] (SPARK-4144) Support incremental model training of Naive Bayes classifier

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4144: - Target Version/s: (was: 1.5.0) Support incremental model training of Naive Bayes

[jira] [Commented] (SPARK-9460) Avoid byte array allocation in StringPrefixComparator

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647046#comment-14647046 ] Apache Spark commented on SPARK-9460: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-9246) DistributedLDAModel predict top docs per topic

2015-07-29 Thread Meihua Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647096#comment-14647096 ] Meihua Wu commented on SPARK-9246: -- I have submitted a PR including the code, ScalaDoc

[jira] [Commented] (SPARK-5133) Feature Importance for Decision Tree (Ensembles)

2015-07-29 Thread Nam Ma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647115#comment-14647115 ] Nam Ma commented on SPARK-5133: --- As far as I know, feature importance in scikit-learn is

[jira] [Comment Edited] (SPARK-5133) Feature Importance for Decision Tree (Ensembles)

2015-07-29 Thread Nam Ma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647115#comment-14647115 ] Nam Ma edited comment on SPARK-5133 at 7/30/15 3:19 AM: As far as

[jira] [Updated] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-07-29 Thread StanZhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] StanZhai updated SPARK-9465: Description: I'am using SparkSQL in Spark 1.4.1. I encounter an error when using parquet table after

[jira] [Commented] (SPARK-9347) spark load of existing parquet files extremely slow if large number of files

2015-07-29 Thread Samphel Norden (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647126#comment-14647126 ] Samphel Norden commented on SPARK-9347: --- No. I havent tried the latest. Assuming

[jira] [Resolved] (SPARK-9458) Avoid object allocation in prefix generation

2015-07-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-9458. Resolution: Fixed Fix Version/s: 1.5.0 Avoid object allocation in prefix generation

[jira] [Commented] (SPARK-9347) spark load of existing parquet files extremely slow if large number of files

2015-07-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647134#comment-14647134 ] Liang-Chi Hsieh commented on SPARK-9347: OK. The latest development is, we will

[jira] [Commented] (SPARK-9347) spark load of existing parquet files extremely slow if large number of files

2015-07-29 Thread Samphel Norden (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647141#comment-14647141 ] Samphel Norden commented on SPARK-9347: --- That will be ideal. spark load of

[jira] [Commented] (SPARK-9466) Flaky test: org.apache.spark.sql.hive.thriftserver.CliSuite

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647166#comment-14647166 ] Apache Spark commented on SPARK-9466: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647165#comment-14647165 ] Joseph K. Bradley commented on SPARK-9461: -- Maybe from streaming?

[jira] [Created] (SPARK-9467) Specialized accumulators

2015-07-29 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-9467: --- Summary: Specialized accumulators Key: SPARK-9467 URL: https://issues.apache.org/jira/browse/SPARK-9467 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-9403) Implement code generation for In / InSet

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9403: --- Assignee: (was: Apache Spark) Implement code generation for In / InSet

[jira] [Commented] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests

2015-07-29 Thread Jeremy Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647186#comment-14647186 ] Jeremy Freeman commented on SPARK-9461: --- A couple initial observations: - the Scala

[jira] [Assigned] (SPARK-9460) Avoid byte array allocation in StringPrefixComparator

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9460: --- Assignee: (was: Apache Spark) Avoid byte array allocation in StringPrefixComparator

[jira] [Commented] (SPARK-8921) Add @since tags to mllib.stat

2015-07-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647047#comment-14647047 ] Xiangrui Meng commented on SPARK-8921: -- Done. Next time if you see someone already

[jira] [Assigned] (SPARK-9462) Initialize nondeterministic expressions in code gen fallback mode

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9462: --- Assignee: Apache Spark (was: Reynold Xin) Initialize nondeterministic expressions in code

[jira] [Commented] (SPARK-9408) Refactor mllib/linalg.py to mllib/linalg

2015-07-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647082#comment-14647082 ] Joseph K. Bradley commented on SPARK-9408: -- Is this needed for 1.5? Refactor

[jira] [Commented] (SPARK-9372) For a join operator, rows with null equal join key expression can be filtered out early

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647083#comment-14647083 ] Apache Spark commented on SPARK-9372: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-2205) Unnecessary exchange operators in a join on multiple tables with the same join key.

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647108#comment-14647108 ] Apache Spark commented on SPARK-2205: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-9347) spark load of existing parquet files extremely slow if large number of files

2015-07-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647136#comment-14647136 ] Liang-Chi Hsieh commented on SPARK-9347: You concern should be solved in the

[jira] [Assigned] (SPARK-8862) Add a web UI page that visualizes physical plans (SparkPlan)

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8862: --- Assignee: Apache Spark Add a web UI page that visualizes physical plans (SparkPlan)

[jira] [Assigned] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9461: --- Assignee: Jeremy Freeman (was: Apache Spark) Possibly slightly flaky PySpark

[jira] [Commented] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647159#comment-14647159 ] Apache Spark commented on SPARK-9461: - User 'jkbradley' has created a pull request for

[jira] [Assigned] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9461: --- Assignee: Apache Spark (was: Jeremy Freeman) Possibly slightly flaky PySpark

[jira] [Assigned] (SPARK-9466) Flaky test: org.apache.spark.sql.hive.thriftserver.CliSuite

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9466: --- Assignee: (was: Apache Spark) Flaky test:

[jira] [Assigned] (SPARK-9466) Flaky test: org.apache.spark.sql.hive.thriftserver.CliSuite

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9466: --- Assignee: Apache Spark Flaky test: org.apache.spark.sql.hive.thriftserver.CliSuite

[jira] [Commented] (SPARK-9403) Implement code generation for In / InSet

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647180#comment-14647180 ] Apache Spark commented on SPARK-9403: - User 'tarekauel' has created a pull request for

[jira] [Assigned] (SPARK-9403) Implement code generation for In / InSet

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9403: --- Assignee: Apache Spark Implement code generation for In / InSet

[jira] [Assigned] (SPARK-9460) Avoid byte array allocation in StringPrefixComparator

2015-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9460: --- Assignee: Apache Spark Avoid byte array allocation in StringPrefixComparator

[jira] [Created] (SPARK-9462) Initialize nondeterministic expressions in code gen fallback mode

2015-07-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9462: -- Summary: Initialize nondeterministic expressions in code gen fallback mode Key: SPARK-9462 URL: https://issues.apache.org/jira/browse/SPARK-9462 Project: Spark

<    1   2   3   4   >