[jira] [Reopened] (SPARK-17969) I think it's user unfriendly to process standard json file with DataFrame

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-17969: - > I think it's user unfriendly to process standard json file with DataFrame >

[jira] [Closed] (SPARK-10840) SparkSQL doesn't work well with JSON

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-10840. --- Resolution: Duplicate > SparkSQL doesn't work well with JSON >

[jira] [Closed] (SPARK-17969) I think it's user unfriendly to process standard json file with DataFrame

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-17969. --- Resolution: Duplicate > I think it's user unfriendly to process standard json file with DataFrame >

[jira] [Reopened] (SPARK-7366) Support multi-line JSON objects

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-7366: > Support multi-line JSON objects > --- > > Key: SPARK-7366

[jira] [Updated] (SPARK-18352) Parse normal, multi-line JSON files (not just JSON Lines)

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18352: Summary: Parse normal, multi-line JSON files (not just JSON Lines) (was: Parse normal JSON files

[jira] [Closed] (SPARK-7366) Support multi-line JSON objects

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-7366. -- Resolution: Duplicate > Support multi-line JSON objects > --- > >

[jira] [Closed] (SPARK-7366) Support multi-line JSON objects

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-7366. -- Resolution: Fixed I'm closing this in favor of https://issues.apache.org/jira/browse/SPARK-18352 In

[jira] [Created] (SPARK-18352) Parse normal JSON files (not just JSON Lines)

2016-11-07 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18352: --- Summary: Parse normal JSON files (not just JSON Lines) Key: SPARK-18352 URL: https://issues.apache.org/jira/browse/SPARK-18352 Project: Spark Issue Type: New

[jira] [Resolved] (SPARK-18351) from_json and to_json for parsing JSON for string columns

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18351. - Resolution: Fixed Fix Version/s: 2.1.0 > from_json and to_json for parsing JSON for

[jira] [Updated] (SPARK-18295) Match up to_json to from_json in null safety

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18295: Issue Type: Sub-task (was: Bug) Parent: SPARK-18351 > Match up to_json to from_json in

[jira] [Updated] (SPARK-18295) Match up to_json to from_json in null safety

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18295: Assignee: Hyukjin Kwon > Match up to_json to from_json in null safety >

[jira] [Updated] (SPARK-17764) to_json function for parsing Structs to json Strings

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17764: Issue Type: Sub-task (was: Improvement) Parent: SPARK-18351 > to_json function for

[jira] [Created] (SPARK-18351) from_json and to_json for parsing JSON for string columns

2016-11-07 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18351: --- Summary: from_json and to_json for parsing JSON for string columns Key: SPARK-18351 URL: https://issues.apache.org/jira/browse/SPARK-18351 Project: Spark

[jira] [Updated] (SPARK-18260) from_json can throw a better exception when it can't find the column or be nullSafe

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18260: Issue Type: Sub-task (was: Bug) Parent: SPARK-18351 > from_json can throw a better

[jira] [Updated] (SPARK-17699) from_json function for parsing json Strings into Structs

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17699: Issue Type: Sub-task (was: New Feature) Parent: SPARK-18351 > from_json function for

[jira] [Commented] (SPARK-18350) Support session local timezone

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15646704#comment-15646704 ] Reynold Xin commented on SPARK-18350: - I'm guessing the easiest way to do this is to change all the

[jira] [Created] (SPARK-18350) Support session local timezone

2016-11-07 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18350: --- Summary: Support session local timezone Key: SPARK-18350 URL: https://issues.apache.org/jira/browse/SPARK-18350 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-16575) partition calculation mismatch with sc.binaryFiles

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16575. - Resolution: Fixed Assignee: Tarun Kumar Fix Version/s: 2.1.0 > partition

[jira] [Resolved] (SPARK-18217) Disallow creating permanent views based on temporary views or UDFs

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18217. - Resolution: Fixed Fix Version/s: 2.1.0 > Disallow creating permanent views based on

[jira] [Updated] (SPARK-16609) Single function for parsing timestamps/dates

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16609: Target Version/s: 2.2.0 (was: 2.1.0) > Single function for parsing timestamps/dates >

[jira] [Updated] (SPARK-17019) Expose off-heap memory usage in various places

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17019: Target Version/s: 2.2.0 (was: 2.1.0) > Expose off-heap memory usage in various places >

[jira] [Updated] (SPARK-16317) Add file filtering interface for FileFormat

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16317: Target Version/s: 2.2.0 (was: 2.1.0) > Add file filtering interface for FileFormat >

[jira] [Resolved] (SPARK-18261) Add statistics to MemorySink for joining

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18261. - Resolution: Fixed Assignee: Liwei Lin Fix Version/s: 2.1.0 > Add statistics to

[jira] [Resolved] (SPARK-18086) Regression: Hive variables no longer work in Spark 2.0

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18086. - Resolution: Fixed Assignee: Ryan Blue Fix Version/s: 2.1.0 > Regression: Hive

[jira] [Updated] (SPARK-17993) Spark spews a slew of harmless but annoying warning messages from Parquet when reading parquet files written by older versions of Parquet-mr

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17993: Target Version/s: 2.1.0 > Spark spews a slew of harmless but annoying warning messages from

[jira] [Resolved] (SPARK-16904) Removal of Hive Built-in Hash Functions and TestHiveFunctionRegistry

2016-11-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16904. - Resolution: Fixed Assignee: Xiao Li Fix Version/s: 2.1.0 > Removal of Hive

[jira] [Resolved] (SPARK-18296) Use consistent naming for expression test suites

2016-11-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18296. - Resolution: Fixed Fix Version/s: 2.1.0 > Use consistent naming for expression test suites

[jira] [Reopened] (SPARK-18167) Flaky test when hive partition pruning is enabled

2016-11-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-18167: - Assignee: (was: Eric Liang) This is unfortunately still flaky. Reopening it. > Flaky

[jira] [Created] (SPARK-18296) Use consistent naming for expression test suites

2016-11-06 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18296: --- Summary: Use consistent naming for expression test suites Key: SPARK-18296 URL: https://issues.apache.org/jira/browse/SPARK-18296 Project: Spark Issue Type:

[jira] [Updated] (SPARK-18278) Support native submission of spark jobs to a kubernetes cluster

2016-11-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18278: Flags: (was: Important) > Support native submission of spark jobs to a kubernetes cluster >

[jira] [Updated] (SPARK-18278) Support native submission of spark jobs to a kubernetes cluster

2016-11-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18278: Target Version/s: (was: 2.2.0) > Support native submission of spark jobs to a kubernetes cluster

[jira] [Updated] (SPARK-18278) Support native submission of spark jobs to a kubernetes cluster

2016-11-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18278: Affects Version/s: (was: 2.2.0) > Support native submission of spark jobs to a kubernetes

[jira] [Resolved] (SPARK-18173) data source tables should support truncating partition

2016-11-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18173. - Resolution: Fixed Fix Version/s: 2.1.0 > data source tables should support truncating

[jira] [Resolved] (SPARK-18269) NumberFormatException when reading csv for a nullable column

2016-11-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18269. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0 >

[jira] [Updated] (SPARK-17990) ALTER TABLE ... ADD PARTITION does not play nice with mixed-case partition column names

2016-11-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17990: Issue Type: Sub-task (was: Bug) Parent: SPARK-17861 > ALTER TABLE ... ADD PARTITION does

[jira] [Resolved] (SPARK-17183) put hive serde table schema to table properties like data source table

2016-11-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17183. - Resolution: Fixed Fix Version/s: 2.1.0 > put hive serde table schema to table properties

[jira] [Resolved] (SPARK-17983) Can't filter over mixed case parquet columns of converted Hive tables

2016-11-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17983. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.1.0 > Can't filter over

[jira] [Resolved] (SPARK-18101) ExternalCatalogSuite should test with mixed case fields

2016-11-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18101. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.1.0 >

[jira] [Commented] (SPARK-18258) Sinks need access to offset representation

2016-11-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15638887#comment-15638887 ] Reynold Xin commented on SPARK-18258: - What is OffsetSeq? > Sinks need access to offset

[jira] [Updated] (SPARK-18269) NumberFormatException when reading csv for a nullable column

2016-11-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18269: Description: Having a schema with a nullable column thrown an java.lang.NumberFormatException:

[jira] [Resolved] (SPARK-18260) from_json can throw a better exception when it can't find the column or be nullSafe

2016-11-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18260. - Resolution: Fixed Assignee: Burak Yavuz Fix Version/s: 2.1.0 > from_json can

[jira] [Created] (SPARK-18287) Move hash expressions from misc.scala into hash.scala

2016-11-05 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18287: --- Summary: Move hash expressions from misc.scala into hash.scala Key: SPARK-18287 URL: https://issues.apache.org/jira/browse/SPARK-18287 Project: Spark Issue

[jira] [Commented] (SPARK-18258) Sinks need access to offset representation

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15638470#comment-15638470 ] Reynold Xin commented on SPARK-18258: - This makes sense. It's just extra information you want to be

[jira] [Updated] (SPARK-16804) Correlated subqueries containing non-deterministic operators return incorrect results

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16804: Fix Version/s: 2.0.2 > Correlated subqueries containing non-deterministic operators return

[jira] [Updated] (SPARK-17337) Incomplete algorithm for name resolution in Catalyst paser may lead to incorrect result

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17337: Fix Version/s: 2.0.2 > Incomplete algorithm for name resolution in Catalyst paser may lead to >

[jira] [Resolved] (SPARK-18197) Optimise AppendOnlyMap implementation

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18197. - Resolution: Fixed Assignee: Adam Roberts Fix Version/s: 2.1.0 > Optimise

[jira] [Updated] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18000: Issue Type: Sub-task (was: New Feature) Parent: SPARK-16026 > Aggregation function for

[jira] [Created] (SPARK-18267) Distribute PySpark via Python Package Index (pypi)

2016-11-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18267: --- Summary: Distribute PySpark via Python Package Index (pypi) Key: SPARK-18267 URL: https://issues.apache.org/jira/browse/SPARK-18267 Project: Spark Issue Type:

[jira] [Updated] (SPARK-1267) Add a pip installer for PySpark

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-1267: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-18267 > Add a pip installer for

[jira] [Updated] (SPARK-18129) Sign pip artifacts

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18129: Issue Type: Sub-task (was: Improvement) Parent: SPARK-18267 > Sign pip artifacts >

[jira] [Updated] (SPARK-16026) Cost-based Optimizer framework

2016-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16026: Labels: releasenotes (was: ) > Cost-based Optimizer framework > -- >

[jira] [Resolved] (SPARK-18259) QueryExecution should not catch Throwable

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18259. - Resolution: Fixed Fix Version/s: 2.1.0 > QueryExecution should not catch Throwable >

[jira] [Resolved] (SPARK-18138) More officially deprecate support for Python 2.6, Java 7, and Scala 2.10

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18138. - Resolution: Fixed Assignee: Sean Owen Fix Version/s: 2.1.0 > More officially

[jira] [Updated] (SPARK-18138) More officially deprecate support for Python 2.6, Java 7, and Scala 2.10

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18138: Target Version/s: 2.1.0 (was: 2.2.0) > More officially deprecate support for Python 2.6, Java 7,

[jira] [Updated] (SPARK-18138) More officially deprecate support for Python 2.6, Java 7, and Scala 2.10

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18138: Summary: More officially deprecate support for Python 2.6, Java 7, and Scala 2.10 (was: Remove

[jira] [Resolved] (SPARK-18257) Improve error reporting for FileStressSuite in streaming

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18257. - Resolution: Fixed Fix Version/s: 2.1.0 > Improve error reporting for FileStressSuite in

[jira] [Commented] (SPARK-18086) Regression: Hive variables no longer work in Spark 2.0

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634172#comment-15634172 ] Reynold Xin commented on SPARK-18086: - [~rdblue] Does my explanation make sense? Can you change the

[jira] [Created] (SPARK-18257) Improve error reporting for FileStressSuite in streaming

2016-11-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18257: --- Summary: Improve error reporting for FileStressSuite in streaming Key: SPARK-18257 URL: https://issues.apache.org/jira/browse/SPARK-18257 Project: Spark Issue

[jira] [Updated] (SPARK-18237) hive.exec.stagingdir have no effect in spark2.0.1

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18237: Fix Version/s: (was: 2.0.3) > hive.exec.stagingdir have no effect in spark2.0.1 >

[jira] [Resolved] (SPARK-18237) hive.exec.stagingdir have no effect in spark2.0.1

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18237. - Resolution: Fixed Assignee: ClassNotFoundExp Fix Version/s: 2.1.0

[jira] [Resolved] (SPARK-18244) Rename partitionProviderIsHive -> tracksPartitionsInCatalog

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18244. - Resolution: Fixed Fix Version/s: 2.1.0 > Rename partitionProviderIsHive ->

[jira] [Updated] (SPARK-14220) Build and test Spark against Scala 2.12

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14220: Target Version/s: (was: 2.2.0) > Build and test Spark against Scala 2.12 >

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633773#comment-15633773 ] Reynold Xin commented on SPARK-14220: - Yea in reality it's going to be really painful to upgrade. >

[jira] [Resolved] (SPARK-18219) Move commit protocol API from sql to core module

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18219. - Resolution: Fixed Fix Version/s: 2.1.0 > Move commit protocol API from sql to core module

[jira] [Updated] (SPARK-18067) SortMergeJoin adds shuffle if join predicates have non partitioned columns

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18067: Issue Type: Sub-task (was: Bug) Parent: SPARK-18245 > SortMergeJoin adds shuffle if join

[jira] [Updated] (SPARK-16904) Removal of Hive Built-in Hash Functions and TestHiveFunctionRegistry

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16904: Issue Type: Sub-task (was: Bug) Parent: SPARK-15691 > Removal of Hive Built-in Hash

[jira] [Commented] (SPARK-17495) Hive hash implementation

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15632141#comment-15632141 ] Reynold Xin commented on SPARK-17495: - Yes! > Hive hash implementation > >

[jira] [Updated] (SPARK-17495) Hive hash implementation

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17495: Issue Type: Sub-task (was: Improvement) Parent: SPARK-18245 > Hive hash implementation >

[jira] [Updated] (SPARK-17495) Hive hash implementation

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17495: Fix Version/s: (was: 2.1.0) > Hive hash implementation > > >

[jira] [Updated] (SPARK-17487) Configurable bucketing info extraction

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17487: Issue Type: Sub-task (was: Bug) Parent: SPARK-18245 > Configurable bucketing info

[jira] [Updated] (SPARK-15453) FileSourceScanExec to extract `outputOrdering` information

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15453: Issue Type: Sub-task (was: New Feature) Parent: SPARK-18245 > FileSourceScanExec to

[jira] [Updated] (SPARK-17254) Filter operator should have “stop if false” semantics for sorted data

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17254: Issue Type: Sub-task (was: Improvement) Parent: SPARK-18245 > Filter operator should have

[jira] [Updated] (SPARK-17570) Avoid Hash and Exchange in Sort Merge join if bucketing factor is multiple for tables

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17570: Issue Type: Sub-task (was: Bug) Parent: SPARK-18245 > Avoid Hash and Exchange in Sort

[jira] [Updated] (SPARK-15867) Use bucket files for TABLESAMPLE BUCKET

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15867: Issue Type: Sub-task (was: Bug) Parent: SPARK-18245 > Use bucket files for TABLESAMPLE

[jira] [Updated] (SPARK-17497) Preserve order when scanning ordered buckets over multiple partitions

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17497: Issue Type: Sub-task (was: Improvement) Parent: SPARK-18245 > Preserve order when

[jira] [Updated] (SPARK-17729) Enable creating hive bucketed tables

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17729: Issue Type: Sub-task (was: Improvement) Parent: SPARK-18245 > Enable creating hive

[jira] [Updated] (SPARK-15867) Use bucket files for TABLESAMPLE BUCKET

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15867: Summary: Use bucket files for TABLESAMPLE BUCKET (was: TABLESAMPLE BUCKET semantics don't match

[jira] [Updated] (SPARK-18245) Improving support for bucketed table

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18245: Target Version/s: 2.2.0 > Improving support for bucketed table >

[jira] [Created] (SPARK-18245) Improving support for bucketed table

2016-11-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18245: --- Summary: Improving support for bucketed table Key: SPARK-18245 URL: https://issues.apache.org/jira/browse/SPARK-18245 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-18245) Improving support for bucketed table

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18245: Description: This is an umbrella ticket for improving various execution planning for bucketed

[jira] [Closed] (SPARK-12752) Can Thrift Server connect to Hive Metastore?

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-12752. --- Resolution: Not A Problem > Can Thrift Server connect to Hive Metastore? >

[jira] [Updated] (SPARK-18244) Rename partitionProviderIsHive -> tracksPartitionsInCatalog

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18244: Summary: Rename partitionProviderIsHive -> tracksPartitionsInCatalog (was:

[jira] [Created] (SPARK-18244) partitionProviderIsHive should be called tracksPartitionsInCatalog

2016-11-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18244: --- Summary: partitionProviderIsHive should be called tracksPartitionsInCatalog Key: SPARK-18244 URL: https://issues.apache.org/jira/browse/SPARK-18244 Project: Spark

[jira] [Created] (SPARK-18243) Converge the insert path of Hive tables with data source tables

2016-11-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18243: --- Summary: Converge the insert path of Hive tables with data source tables Key: SPARK-18243 URL: https://issues.apache.org/jira/browse/SPARK-18243 Project: Spark

[jira] [Updated] (SPARK-17861) Store data source partitions in metastore and push partition pruning into metastore

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17861: Assignee: Eric Liang > Store data source partitions in metastore and push partition pruning into

[jira] [Updated] (SPARK-18209) More robust view canonicalization without full SQL expansion

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18209: Description: Spark SQL currently stores views by analyzing the provided SQL and then generating

[jira] [Updated] (SPARK-18209) More robust view canonicalization without full SQL expansion

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18209: Description: Spark SQL currently stores views by analyzing the provided SQL and then generating

[jira] [Commented] (SPARK-18209) More robust view canonicalization without full SQL expansion

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15631946#comment-15631946 ] Reynold Xin commented on SPARK-18209: - No they shouldn't be allowed. I added that to SPARK-18217. >

[jira] [Commented] (SPARK-16475) Broadcast Hint for SQL Queries

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15631940#comment-15631940 ] Reynold Xin commented on SPARK-16475: - As discussed in SPARK-18209, we will merge this once

[jira] [Resolved] (SPARK-18200) GraphX Invalid initial capacity when running triangleCount

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18200. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.1.0

[jira] [Commented] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15631810#comment-15631810 ] Reynold Xin commented on SPARK-15507: - That's expected isn't that? > ClassCastException:

[jira] [Commented] (SPARK-18086) Regression: Hive variables no longer work in Spark 2.0

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15630855#comment-15630855 ] Reynold Xin commented on SPARK-18086: - Because the execution code no longer depends on Hive's

[jira] [Updated] (SPARK-18024) Introduce an internal commit protocol API along with OutputCommitter implementation

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18024: Summary: Introduce an internal commit protocol API along with OutputCommitter implementation

[jira] [Updated] (SPARK-18024) Introduce a commit protocol API along with OutputCommitter implementation

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18024: Description: This commit protocol API should wrap around Hadoop's output committer. Later we can

[jira] [Updated] (SPARK-18200) GraphX Invalid initial capacity when running triangleCount

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18200: Target Version/s: 2.0.3, 2.1.0 > GraphX Invalid initial capacity when running triangleCount >

[jira] [Resolved] (SPARK-18214) Simplify RuntimeReplaceable type coercion

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18214. - Resolution: Fixed Fix Version/s: 2.1.0 > Simplify RuntimeReplaceable type coercion >

[jira] [Updated] (SPARK-18234) Update mode in structured streaming

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18234: Summary: Update mode in structured streaming (was: Update mode) > Update mode in structured

[jira] [Commented] (SPARK-18086) Regression: Hive variables no longer work in Spark 2.0

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15630718#comment-15630718 ] Reynold Xin commented on SPARK-18086: - The thing is that we don't really propagate Hive's session

[jira] [Updated] (SPARK-11879) Checkpoint support for DataFrame/Dataset

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-11879: Fix Version/s: 2.1.0 > Checkpoint support for DataFrame/Dataset >

[jira] [Updated] (SPARK-11879) Checkpoint support for DataFrame/Dataset

2016-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-11879: Assignee: Cheng Lian > Checkpoint support for DataFrame/Dataset >

<    3   4   5   6   7   8   9   10   11   12   >