[jira] [Updated] (SPARK-5268) CoarseGrainedExecutorBackend exits for irrelevant DisassociatedEvent

2015-01-15 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nan Zhu updated SPARK-5268: --- Summary: CoarseGrainedExecutorBackend exits for irrelevant DisassociatedEvent (was: ExecutorBackend exits

[jira] [Commented] (SPARK-5268) ExecutorBackend exits for irrelevant DisassociatedEvent

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278786#comment-14278786 ] Apache Spark commented on SPARK-5268: - User 'CodingCat' has created a pull request for

[jira] [Commented] (SPARK-5012) Python API for Gaussian Mixture Model

2015-01-15 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278947#comment-14278947 ] Travis Galoppo commented on SPARK-5012: --- This will probably be affected by

[jira] [Commented] (SPARK-5185) pyspark --jars does not add classes to driver class path

2015-01-15 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279040#comment-14279040 ] Marcelo Vanzin commented on SPARK-5185: --- BTW I talked to Uri offline about this. The

[jira] [Commented] (SPARK-5097) Adding data frame APIs to SchemaRDD

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279078#comment-14279078 ] Reynold Xin commented on SPARK-5097: [~hkothari] that is correct. It will be trivially

[jira] [Commented] (SPARK-5270) Elegantly check if RDD is empty

2015-01-15 Thread Al M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278983#comment-14278983 ] Al M commented on SPARK-5270: - I just noticed that rdd.partitions.size is set to 0 for empty

[jira] [Commented] (SPARK-5270) Elegantly check if RDD is empty

2015-01-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278993#comment-14278993 ] Sean Owen commented on SPARK-5270: -- I think it's conceivable to have an RDD with no

[jira] [Created] (SPARK-5270) Elegantly check if RDD is empty

2015-01-15 Thread Al M (JIRA)
Al M created SPARK-5270: --- Summary: Elegantly check if RDD is empty Key: SPARK-5270 URL: https://issues.apache.org/jira/browse/SPARK-5270 Project: Spark Issue Type: Improvement Affects Versions:

[jira] [Updated] (SPARK-5267) Add a streaming module to ingest Apache Camel Messages from a configured endpoints

2015-01-15 Thread Steve Brewin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Brewin updated SPARK-5267: Description: The number of input stream protocols supported by Spark Streaming is quite limited,

[jira] [Updated] (SPARK-5271) PySpark History Web UI issues

2015-01-15 Thread Andrey Zimovnov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrey Zimovnov updated SPARK-5271: --- Component/s: Web UI PySpark History Web UI issues -

[jira] [Created] (SPARK-5271) PySpark History Web UI issues

2015-01-15 Thread Andrey Zimovnov (JIRA)
Andrey Zimovnov created SPARK-5271: -- Summary: PySpark History Web UI issues Key: SPARK-5271 URL: https://issues.apache.org/jira/browse/SPARK-5271 Project: Spark Issue Type: Bug Affects

[jira] [Updated] (SPARK-5270) Elegantly check if RDD is empty

2015-01-15 Thread Al M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Al M updated SPARK-5270: Description: Right now there is no clean way to check if an RDD is empty. As discussed here:

[jira] [Commented] (SPARK-5246) spark/spark-ec2.py cannot start Spark master in VPC if local DNS name does not resolve

2015-01-15 Thread Vladimir Grigor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278780#comment-14278780 ] Vladimir Grigor commented on SPARK-5246:

[jira] [Created] (SPARK-5268) ExecutorBackend exits for irrelevant DisassociatedEvent

2015-01-15 Thread Nan Zhu (JIRA)
Nan Zhu created SPARK-5268: -- Summary: ExecutorBackend exits for irrelevant DisassociatedEvent Key: SPARK-5268 URL: https://issues.apache.org/jira/browse/SPARK-5268 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5267) Add a streaming module to ingest Apache Camel Messages from a configured endpoints

2015-01-15 Thread Steve Brewin (JIRA)
Steve Brewin created SPARK-5267: --- Summary: Add a streaming module to ingest Apache Camel Messages from a configured endpoints Key: SPARK-5267 URL: https://issues.apache.org/jira/browse/SPARK-5267

[jira] [Created] (SPARK-5269) BlockManager.dataDeserialize always creates a new serializer instance

2015-01-15 Thread Ivan Vergiliev (JIRA)
Ivan Vergiliev created SPARK-5269: - Summary: BlockManager.dataDeserialize always creates a new serializer instance Key: SPARK-5269 URL: https://issues.apache.org/jira/browse/SPARK-5269 Project: Spark

[jira] [Commented] (SPARK-5097) Adding data frame APIs to SchemaRDD

2015-01-15 Thread Hamel Ajay Kothari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278819#comment-14278819 ] Hamel Ajay Kothari commented on SPARK-5097: --- Am I correct in interpreting that

[jira] [Comment Edited] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-15 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279170#comment-14279170 ] Muhammad-Ali A'rabi edited comment on SPARK-5226 at 1/15/15 7:33 PM:

[jira] [Created] (SPARK-5273) Improve documentation examples for LinearRegression

2015-01-15 Thread Dev Lakhani (JIRA)
Dev Lakhani created SPARK-5273: -- Summary: Improve documentation examples for LinearRegression Key: SPARK-5273 URL: https://issues.apache.org/jira/browse/SPARK-5273 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5012) Python API for Gaussian Mixture Model

2015-01-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279251#comment-14279251 ] Joseph K. Bradley commented on SPARK-5012: -- [~MeethuMathew], [~tgaloppo] makes a

[jira] [Commented] (SPARK-5274) Stabilize UDFRegistration API

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279352#comment-14279352 ] Apache Spark commented on SPARK-5274: - User 'rxin' has created a pull request for this

[jira] [Created] (SPARK-5274) Stabilize UDFRegistration API

2015-01-15 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5274: -- Summary: Stabilize UDFRegistration API Key: SPARK-5274 URL: https://issues.apache.org/jira/browse/SPARK-5274 Project: Spark Issue Type: Sub-task

[jira] [Comment Edited] (SPARK-5272) Refactor NaiveBayes to support discrete and continuous labels,features

2015-01-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279235#comment-14279235 ] Joseph K. Bradley edited comment on SPARK-5272 at 1/15/15 8:13 PM:

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-15 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279250#comment-14279250 ] RJ Nowling commented on SPARK-4894: --- Thanks, [~josephkb]! I'd be happy to help with the

[jira] [Commented] (SPARK-5272) Refactor NaiveBayes to support discrete and continuous labels,features

2015-01-15 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279258#comment-14279258 ] RJ Nowling commented on SPARK-5272: --- Hi [~josephkb], I can see benefits to your

[jira] [Comment Edited] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279274#comment-14279274 ] Joseph K. Bradley edited comment on SPARK-1405 at 1/15/15 9:29 PM:

[jira] [Resolved] (SPARK-5224) parallelize list/ndarray is really slow

2015-01-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5224. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull request

[jira] [Updated] (SPARK-5224) parallelize list/ndarray is really slow

2015-01-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5224: -- Assignee: Davies Liu parallelize list/ndarray is really slow ---

[jira] [Created] (SPARK-5272) Refactor NaiveBayes to support discrete and continuous labels,features

2015-01-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5272: Summary: Refactor NaiveBayes to support discrete and continuous labels,features Key: SPARK-5272 URL: https://issues.apache.org/jira/browse/SPARK-5272

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279241#comment-14279241 ] Joseph K. Bradley commented on SPARK-4894: -- [~rnowling] I too don't want to hold

[jira] [Commented] (SPARK-5111) HiveContext and Thriftserver cannot work in secure cluster beyond hadoop2.5

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279216#comment-14279216 ] Apache Spark commented on SPARK-5111: - User 'zhzhan' has created a pull request for

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-01-15 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279524#comment-14279524 ] Imran Rashid commented on SPARK-4746: - This doesn't work as well as I thought -- all

[jira] [Resolved] (SPARK-5274) Stabilize UDFRegistration API

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5274. Resolution: Fixed Fix Version/s: 1.3.0 Stabilize UDFRegistration API

[jira] [Commented] (SPARK-5144) spark-yarn module should be published

2015-01-15 Thread Matthew Sanders (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279457#comment-14279457 ] Matthew Sanders commented on SPARK-5144: +1 -- I am in a similar situation and

[jira] [Commented] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-01-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279406#comment-14279406 ] Josh Rosen commented on SPARK-4879: --- I'm not sure that SparkHadoopWriter's use of

[jira] [Created] (SPARK-5275) pyspark.streaming is not included in assembly jar

2015-01-15 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5275: - Summary: pyspark.streaming is not included in assembly jar Key: SPARK-5275 URL: https://issues.apache.org/jira/browse/SPARK-5275 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5276) pyspark.streaming is not included in assembly jar

2015-01-15 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5276: - Summary: pyspark.streaming is not included in assembly jar Key: SPARK-5276 URL: https://issues.apache.org/jira/browse/SPARK-5276 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279274#comment-14279274 ] Joseph K. Bradley commented on SPARK-1405: -- I'll try out the statmt dataset if

[jira] [Commented] (SPARK-3622) Provide a custom transformation that can output multiple RDDs

2015-01-15 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279602#comment-14279602 ] Imran Rashid commented on SPARK-3622: - In some ways this kinda reminds of the problem

[jira] [Updated] (SPARK-5277) SparkSqlSerializer does not register user specified KryoRegistrators

2015-01-15 Thread Max Seiden (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Seiden updated SPARK-5277: -- Remaining Estimate: (was: 24h) Original Estimate: (was: 24h) SparkSqlSerializer does not

[jira] [Created] (SPARK-5277) SparkSqlSerializer does not register user specified KryoRegistrators

2015-01-15 Thread Max Seiden (JIRA)
Max Seiden created SPARK-5277: - Summary: SparkSqlSerializer does not register user specified KryoRegistrators Key: SPARK-5277 URL: https://issues.apache.org/jira/browse/SPARK-5277 Project: Spark

[jira] [Closed] (SPARK-5011) Add support for WITH SERDEPROPERTIES, TBLPROPERTIES in CREATE TEMPORARY TABLE

2015-01-15 Thread shengli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shengli closed SPARK-5011. -- Resolution: Later Add support for WITH SERDEPROPERTIES, TBLPROPERTIES in CREATE TEMPORARY TABLE

[jira] [Resolved] (SPARK-4857) Add Executor Events to SparkListener

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4857. Resolution: Fixed Fix Version/s: 1.3.0 Add Executor Events to SparkListener

[jira] [Commented] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279711#comment-14279711 ] Apache Spark commented on SPARK-4879: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-4874) Report number of records read/written in a task

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279712#comment-14279712 ] Apache Spark commented on SPARK-4874: - User 'ksakellis' has created a pull request for

[jira] [Commented] (SPARK-5012) Python API for Gaussian Mixture Model

2015-01-15 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279811#comment-14279811 ] Meethu Mathew commented on SPARK-5012: -- Once SPARK-5019 is resolved, we will make the

[jira] [Created] (SPARK-5278) ambiguous reference to fields in Spark SQL is incompleted

2015-01-15 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-5278: -- Summary: ambiguous reference to fields in Spark SQL is incompleted Key: SPARK-5278 URL: https://issues.apache.org/jira/browse/SPARK-5278 Project: Spark Issue

[jira] [Updated] (SPARK-5278) ambiguous reference to fields in Spark SQL is incompleted

2015-01-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-5278: --- Description: for json string like {a: {b: 1, B: 2}} The SQL `SELECT a.b from t` will report error for

[jira] [Updated] (SPARK-5278) ambiguous reference to fields in Spark SQL is incompleted

2015-01-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-5278: --- Description: for json string like {a: {b: 1, B: 2}} The SQL `SELECT a.b from t` will report error for

[jira] [Resolved] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2630. Resolution: Duplicate I think this is a dup of SPARK-4092. Input data size of

[jira] [Updated] (SPARK-4955) Dynamic allocation doesn't work in YARN cluster mode

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4955: --- Priority: Blocker (was: Critical) Dynamic allocation doesn't work in YARN cluster mode

[jira] [Updated] (SPARK-4955) Dynamic allocation doesn't work in YARN cluster mode

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4955: --- Target Version/s: 1.3.0 Dynamic allocation doesn't work in YARN cluster mode

[jira] [Commented] (SPARK-5216) Spark Ui should report estimated time remaining for each stage.

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279863#comment-14279863 ] Patrick Wendell commented on SPARK-5216: This has been proposed before, but in the

[jira] [Created] (SPARK-5279) Use java.math.BigDecimal as the exposed Decimal type

2015-01-15 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5279: -- Summary: Use java.math.BigDecimal as the exposed Decimal type Key: SPARK-5279 URL: https://issues.apache.org/jira/browse/SPARK-5279 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5278) ambiguous reference to fields in Spark SQL is incompleted

2015-01-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-5278: --- Description: for json string like {a: {b: 1, B: 2}} The SQL `SELECT a.b from t` will report error for

[jira] [Updated] (SPARK-5176) Thrift server fails with confusing error message when deploy-mode is cluster

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5176: --- Labels: starter (was: ) Thrift server fails with confusing error message when deploy-mode

[jira] [Commented] (SPARK-5176) Thrift server fails with confusing error message when deploy-mode is cluster

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279869#comment-14279869 ] Patrick Wendell commented on SPARK-5176: Yes, we should add a check here similar

[jira] [Updated] (SPARK-5278) ambiguous reference to fields in Spark SQL is incompleted

2015-01-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-5278: --- Description: at hive context for json string like {a: {b: 1, B: 2}} The SQL `SELECT a.b from t` will

[jira] [Comment Edited] (SPARK-5176) Thrift server fails with confusing error message when deploy-mode is cluster

2015-01-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279869#comment-14279869 ] Patrick Wendell edited comment on SPARK-5176 at 1/16/15 6:28 AM:

[jira] [Updated] (SPARK-5278) ambiguous reference to fields in Spark SQL is incompleted

2015-01-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-5278: --- Description: at hive context for json string like {code}{a: {b: 1, B: 2}}{code} The SQL `SELECT a.b

[jira] [Commented] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279883#comment-14279883 ] Yin Huai commented on SPARK-5260: - [~sonixbp] If you like, you can make the change and

[jira] [Commented] (SPARK-5278) ambiguous reference to fields in Spark SQL is incompleted

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279892#comment-14279892 ] Apache Spark commented on SPARK-5278: - User 'cloud-fan' has created a pull request for

[jira] [Updated] (SPARK-5251) Using `tableIdentifier` in hive metastore

2015-01-15 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei updated SPARK-5251: --- Target Version/s: 1.3.0 Using `tableIdentifier` in hive metastore

[jira] [Updated] (SPARK-5251) Using `tableIdentifier` in hive metastore

2015-01-15 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei updated SPARK-5251: --- Target Version/s: (was: 1.3.0) Using `tableIdentifier` in hive metastore

[jira] [Commented] (SPARK-2686) Add Length support to Spark SQL and HQL and Strlen support to SQL

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279914#comment-14279914 ] Reynold Xin commented on SPARK-2686: Do you mind closing the pull request? I will

[jira] [Reopened] (SPARK-2686) Add Length support to Spark SQL and HQL and Strlen support to SQL

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-2686: Add Length support to Spark SQL and HQL and Strlen support to SQL

[jira] [Commented] (SPARK-2686) Add Length support to Spark SQL and HQL and Strlen support to SQL

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279913#comment-14279913 ] Reynold Xin commented on SPARK-2686: [~javadba] I think Michael meant closing the pull

[jira] [Updated] (SPARK-4867) UDF clean up

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-4867: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-5166 UDF clean up

[jira] [Resolved] (SPARK-5211) Restore HiveMetastoreTypes.toDataType

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5211. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Yin Huai Restore

[jira] [Commented] (SPARK-2686) Add Length support to Spark SQL and HQL and Strlen support to SQL

2015-01-15 Thread Stephen Boesch (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279920#comment-14279920 ] Stephen Boesch commented on SPARK-2686: --- ok closed Add Length support to Spark

[jira] [Commented] (SPARK-2686) Add Length support to Spark SQL and HQL and Strlen support to SQL

2015-01-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279923#comment-14279923 ] Reynold Xin commented on SPARK-2686: Thanks. Let's pull it in once SPARK-4867 is

[jira] [Created] (SPARK-5262) coalesce should allow NullType and 1 another type in parameters

2015-01-15 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-5262: -- Summary: coalesce should allow NullType and 1 another type in parameters Key: SPARK-5262 URL: https://issues.apache.org/jira/browse/SPARK-5262 Project: Spark

[jira] [Commented] (SPARK-5262) coalesce should allow NullType and 1 another type in parameters

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278416#comment-14278416 ] Apache Spark commented on SPARK-5262: - User 'adrian-wang' has created a pull request

[jira] [Updated] (SPARK-1084) Fix most build warnings

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1084: -- Reporter: Sean Owen (was: Sean Owen) Fix most build warnings ---

[jira] [Updated] (SPARK-1181) 'mvn test' fails out of the box since sbt assembly does not necessarily exist

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1181: -- Reporter: Sean Owen (was: Sean Owen) 'mvn test' fails out of the box since sbt assembly does

[jira] [Updated] (SPARK-1315) spark on yarn-alpha with mvn on master branch won't build

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1315: -- Assignee: Sean Owen (was: Sean Owen) spark on yarn-alpha with mvn on master branch won't

[jira] [Updated] (SPARK-2879) Use HTTPS to access Maven Central and other repos

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-2879: -- Assignee: Sean Owen (was: Sean Owen) Use HTTPS to access Maven Central and other repos

[jira] [Updated] (SPARK-3803) ArrayIndexOutOfBoundsException found in executing computePrincipalComponents

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-3803: -- Assignee: Sean Owen (was: Sean Owen) ArrayIndexOutOfBoundsException found in executing

[jira] [Updated] (SPARK-2749) Spark SQL Java tests aren't compiling in Jenkins' Maven builds; missing junit:junit dep

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-2749: -- Assignee: Sean Owen (was: Sean Owen) Spark SQL Java tests aren't compiling in Jenkins' Maven

[jira] [Updated] (SPARK-1556) jets3t dep doesn't update properly with newer Hadoop versions

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1556: -- Assignee: Sean Owen (was: Sean Owen) jets3t dep doesn't update properly with newer Hadoop

[jira] [Updated] (SPARK-1071) Tidy logging strategy and use of log4j

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1071: -- Reporter: Sean Owen (was: Sean Owen) Tidy logging strategy and use of log4j

[jira] [Updated] (SPARK-1254) Consolidate, order, and harmonize repository declarations in Maven/SBT builds

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1254: -- Reporter: Sean Owen (was: Sean Owen) Consolidate, order, and harmonize repository

[jira] [Updated] (SPARK-1335) Also increase perm gen / code cache for scalatest when invoked via Maven build

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1335: -- Reporter: Sean Owen (was: Sean Owen) Also increase perm gen / code cache for scalatest when

[jira] [Updated] (SPARK-1316) Remove use of Commons IO

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1316: -- Reporter: Sean Owen (was: Sean Owen) Remove use of Commons IO

[jira] [Updated] (SPARK-2341) loadLibSVMFile doesn't handle regression datasets

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-2341: -- Assignee: Sean Owen (was: Sean Owen) loadLibSVMFile doesn't handle regression datasets

[jira] [Updated] (SPARK-1071) Tidy logging strategy and use of log4j

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1071: -- Assignee: Sean Owen (was: Sean Owen) Tidy logging strategy and use of log4j

[jira] [Updated] (SPARK-2798) Correct several small errors in Flume module pom.xml files

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-2798: -- Assignee: Sean Owen (was: Sean Owen) Correct several small errors in Flume module pom.xml

[jira] [Created] (SPARK-5263) `create table` DDL need to check if table exists first

2015-01-15 Thread shengli (JIRA)
shengli created SPARK-5263: -- Summary: `create table` DDL need to check if table exists first Key: SPARK-5263 URL: https://issues.apache.org/jira/browse/SPARK-5263 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5246) spark/spark-ec2.py cannot start Spark master in VPC if local DNS name does not resolve

2015-01-15 Thread Vladimir Grigor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vladimir Grigor updated SPARK-5246: --- Description: ##How to reproduce: 1)

[jira] [Updated] (SPARK-5246) spark/spark-ec2.py cannot start Spark master in VPC if local DNS name does not resolve

2015-01-15 Thread Vladimir Grigor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vladimir Grigor updated SPARK-5246: --- Description: How to reproduce: 1)

[jira] [Created] (SPARK-5264) support `drop table` DDL command

2015-01-15 Thread shengli (JIRA)
shengli created SPARK-5264: -- Summary: support `drop table` DDL command Key: SPARK-5264 URL: https://issues.apache.org/jira/browse/SPARK-5264 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-5263) `create table` DDL need to check if table exists first

2015-01-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278476#comment-14278476 ] Apache Spark commented on SPARK-5263: - User 'OopsOutOfMemory' has created a pull

[jira] [Commented] (SPARK-5243) Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster

2015-01-15 Thread Takumi Yoshida (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278502#comment-14278502 ] Takumi Yoshida commented on SPARK-5243: --- Hi! I found, Spark hangs with following

[jira] [Updated] (SPARK-1727) Correct small compile errors, typos, and markdown issues in (primarly) MLlib docs

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1727: -- Assignee: Sean Owen (was: Sean Owen) Correct small compile errors, typos, and markdown issues

[jira] [Updated] (SPARK-1789) Multiple versions of Netty dependencies cause FlumeStreamSuite failure

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1789: -- Assignee: Sean Owen (was: Sean Owen) Multiple versions of Netty dependencies cause

[jira] [Updated] (SPARK-1802) Audit dependency graph when Spark is built with -Phive

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1802: -- Assignee: Sean Owen (was: Sean Owen) Audit dependency graph when Spark is built with -Phive

[jira] [Updated] (SPARK-1248) Spark build error with Apache Hadoop(Cloudera CDH4)

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1248: -- Assignee: Sean Owen (was: Sean Owen) Spark build error with Apache Hadoop(Cloudera CDH4)

[jira] [Updated] (SPARK-1120) Send all dependency logging through slf4j

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1120: -- Assignee: Sean Owen (was: Sean Owen) Send all dependency logging through slf4j

[jira] [Updated] (SPARK-2363) Clean MLlib's sample data files

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-2363: -- Assignee: Sean Owen (was: Sean Owen) Clean MLlib's sample data files

[jira] [Updated] (SPARK-1254) Consolidate, order, and harmonize repository declarations in Maven/SBT builds

2015-01-15 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-1254: -- Assignee: Sean Owen (was: Sean Owen) Consolidate, order, and harmonize repository

  1   2   >