[jira] [Created] (SPARK-2355) Check for the number of clusters to avoid ArrayIndexOutOfBoundsException

2014-07-03 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-2355: -- Summary: Check for the number of clusters to avoid ArrayIndexOutOfBoundsException Key: SPARK-2355 URL: https://issues.apache.org/jira/browse/SPARK-2355 Project:

[jira] [Created] (SPARK-2490) StackOverflowError when RDD dependencies are too long

2014-07-15 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-2490: -- Summary: StackOverflowError when RDD dependencies are too long Key: SPARK-2490 URL: https://issues.apache.org/jira/browse/SPARK-2490 Project: Spark

[jira] [Created] (SPARK-3083) Unnecessarily removing sendingConnection twice when closing connections

2014-08-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3083: -- Summary: Unnecessarily removing sendingConnection twice when closing connections Key: SPARK-3083 URL: https://issues.apache.org/jira/browse/SPARK-3083 Project:

[jira] [Created] (SPARK-3252) Add missing condition in one SQL test

2014-08-27 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3252: -- Summary: Add missing condition in one SQL test Key: SPARK-3252 URL: https://issues.apache.org/jira/browse/SPARK-3252 Project: Spark Issue Type: Test

[jira] [Created] (SPARK-3300) The old buffer in ColumnBuilder should clear after copying its content

2014-08-29 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3300: -- Summary: The old buffer in ColumnBuilder should clear after copying its content Key: SPARK-3300 URL: https://issues.apache.org/jira/browse/SPARK-3300 Project:

[jira] [Updated] (SPARK-3300) No need to call clear in ensureFreeSpace and shorten build() in ColumnBuilder

2014-08-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-3300: --- Description: The function ensureFreeSpace in object ColumnBuilder doesn't need to call

[jira] [Updated] (SPARK-3300) No need to call clear() in ensureFreeSpace and shorten build() in ColumnBuilder

2014-08-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-3300: --- Summary: No need to call clear() in ensureFreeSpace and shorten build() in ColumnBuilder

[jira] [Created] (SPARK-3310) Directly use currentTable without unnecessary implicit conversion

2014-08-29 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3310: -- Summary: Directly use currentTable without unnecessary implicit conversion Key: SPARK-3310 URL: https://issues.apache.org/jira/browse/SPARK-3310 Project: Spark

[jira] [Created] (SPARK-3327) Make broadcasted value mutable for caching useful information

2014-08-30 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3327: -- Summary: Make broadcasted value mutable for caching useful information Key: SPARK-3327 URL: https://issues.apache.org/jira/browse/SPARK-3327 Project: Spark

[jira] [Created] (SPARK-3345) Do correct parameters for ShuffleFileGroup

2014-09-02 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3345: -- Summary: Do correct parameters for ShuffleFileGroup Key: SPARK-3345 URL: https://issues.apache.org/jira/browse/SPARK-3345 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-3801) Make app dir cleanup more efficient

2014-10-05 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3801: -- Summary: Make app dir cleanup more efficient Key: SPARK-3801 URL: https://issues.apache.org/jira/browse/SPARK-3801 Project: Spark Issue Type:

[jira] [Closed] (SPARK-3801) Make app dir cleanup more efficient

2014-10-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-3801. -- Resolution: Duplicate Make app dir cleanup more efficient ---

[jira] [Commented] (SPARK-3801) Make app dir cleanup more efficient

2014-10-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14159561#comment-14159561 ] Liang-Chi Hsieh commented on SPARK-3801: I think so. Thanks. Make app dir

[jira] [Created] (SPARK-3925) Considering the ordering of qualifiers when comparison

2014-10-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3925: -- Summary: Considering the ordering of qualifiers when comparison Key: SPARK-3925 URL: https://issues.apache.org/jira/browse/SPARK-3925 Project: Spark

[jira] [Updated] (SPARK-3925) Considering the ordering of qualifiers during comparison

2014-10-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-3925: --- Summary: Considering the ordering of qualifiers during comparison (was: Considering the

[jira] [Updated] (SPARK-3925) Do not consider the ordering of qualifiers during comparison

2014-10-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-3925: --- Description: The qualifiers orderings should not be considered during the comparison between

[jira] [Updated] (SPARK-3925) Do not considering the ordering of qualifiers during comparison

2014-10-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-3925: --- Summary: Do not considering the ordering of qualifiers during comparison (was: Considering

[jira] [Updated] (SPARK-3925) Do not consider the ordering of qualifiers during comparison

2014-10-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-3925: --- Summary: Do not consider the ordering of qualifiers during comparison (was: Do not

[jira] [Created] (SPARK-3970) Remove duplicate removal of local dirs

2014-10-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-3970: -- Summary: Remove duplicate removal of local dirs Key: SPARK-3970 URL: https://issues.apache.org/jira/browse/SPARK-3970 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-4358) Parsing NumericLit with more specified types

2014-11-11 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4358: -- Summary: Parsing NumericLit with more specified types Key: SPARK-4358 URL: https://issues.apache.org/jira/browse/SPARK-4358 Project: Spark Issue Type:

[jira] [Created] (SPARK-4382) Add locations parameter to Twitter Stream

2014-11-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4382: -- Summary: Add locations parameter to Twitter Stream Key: SPARK-4382 URL: https://issues.apache.org/jira/browse/SPARK-4382 Project: Spark Issue Type:

[jira] [Created] (SPARK-4597) Use proper exception and reset variable

2014-11-25 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4597: -- Summary: Use proper exception and reset variable Key: SPARK-4597 URL: https://issues.apache.org/jira/browse/SPARK-4597 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-4674) Refactor getCallSite in Utils

2014-12-01 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4674: -- Summary: Refactor getCallSite in Utils Key: SPARK-4674 URL: https://issues.apache.org/jira/browse/SPARK-4674 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-4741) Do not destroy and re-create FileInputStream

2014-12-04 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4741: -- Summary: Do not destroy and re-create FileInputStream Key: SPARK-4741 URL: https://issues.apache.org/jira/browse/SPARK-4741 Project: Spark Issue Type:

[jira] [Created] (SPARK-4797) Replace breezeSquaredDistance

2014-12-09 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4797: -- Summary: Replace breezeSquaredDistance Key: SPARK-4797 URL: https://issues.apache.org/jira/browse/SPARK-4797 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-4797) Replace breezeSquaredDistance

2014-12-09 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-4797: --- Description: This PR replaces slow breezeSquaredDistance. A simple calculation involving

[jira] [Created] (SPARK-4913) Fix incorrect event log path

2014-12-21 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4913: -- Summary: Fix incorrect event log path Key: SPARK-4913 URL: https://issues.apache.org/jira/browse/SPARK-4913 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5050) Add unit test for sqdist

2015-01-01 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5050: -- Summary: Add unit test for sqdist Key: SPARK-5050 URL: https://issues.apache.org/jira/browse/SPARK-5050 Project: Spark Issue Type: Test

[jira] [Created] (SPARK-5664) Restore stty settings when exiting for launching spark-shell from SBT

2015-02-07 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5664: -- Summary: Restore stty settings when exiting for launching spark-shell from SBT Key: SPARK-5664 URL: https://issues.apache.org/jira/browse/SPARK-5664 Project:

[jira] [Created] (SPARK-5615) Fix testPackage in StreamingContextSuite

2015-02-05 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5615: -- Summary: Fix testPackage in StreamingContextSuite Key: SPARK-5615 URL: https://issues.apache.org/jira/browse/SPARK-5615 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5652) Use broadcasted weights and fix compilation error

2015-02-06 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5652: -- Summary: Use broadcasted weights and fix compilation error Key: SPARK-5652 URL: https://issues.apache.org/jira/browse/SPARK-5652 Project: Spark Issue

[jira] [Updated] (SPARK-5652) Use broadcasted weights in LogisticRegressionModel and fix compilation error

2015-02-06 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5652: --- Summary: Use broadcasted weights in LogisticRegressionModel and fix compilation error (was:

[jira] [Created] (SPARK-5681) Calling graceful stop() immediately after start() on StreamingContext should not get stuck indefinitely

2015-02-08 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5681: -- Summary: Calling graceful stop() immediately after start() on StreamingContext should not get stuck indefinitely Key: SPARK-5681 URL:

[jira] [Created] (SPARK-5332) Efficient way to deal with ExecutorLost

2015-01-20 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5332: -- Summary: Efficient way to deal with ExecutorLost Key: SPARK-5332 URL: https://issues.apache.org/jira/browse/SPARK-5332 Project: Spark Issue Type:

[jira] [Created] (SPARK-5365) Refactor KMeans to reduce redundant data

2015-01-22 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5365: -- Summary: Refactor KMeans to reduce redundant data Key: SPARK-5365 URL: https://issues.apache.org/jira/browse/SPARK-5365 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5212) Add support of schema-less transformation

2015-01-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5212: --- Description: This pr adds the support of schema-less syntax, custom field delimiter and SerDe

[jira] [Updated] (SPARK-5212) Add support of schema-less, custom field delimiter and SerDe for HiveQL transform

2015-01-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5212: --- Summary: Add support of schema-less, custom field delimiter and SerDe for HiveQL transform

[jira] [Created] (SPARK-5793) Add explode to Column

2015-02-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5793: -- Summary: Add explode to Column Key: SPARK-5793 URL: https://issues.apache.org/jira/browse/SPARK-5793 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-5862) Only transformUp the given plan once in HiveMetastoreCatalog

2015-02-17 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5862: -- Summary: Only transformUp the given plan once in HiveMetastoreCatalog Key: SPARK-5862 URL: https://issues.apache.org/jira/browse/SPARK-5862 Project: Spark

[jira] [Created] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5907: -- Summary: Selected column from DataFrame should not re-analyze logical plan Key: SPARK-5907 URL: https://issues.apache.org/jira/browse/SPARK-5907 Project: Spark

[jira] [Created] (SPARK-5908) Hive udtf with single alias should be resolved correctly

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5908: -- Summary: Hive udtf with single alias should be resolved correctly Key: SPARK-5908 URL: https://issues.apache.org/jira/browse/SPARK-5908 Project: Spark

[jira] [Closed] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-5907. -- Resolution: Duplicate Selected column from DataFrame should not re-analyze logical plan

[jira] [Commented] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-02-17 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323983#comment-14323983 ] Liang-Chi Hsieh commented on SPARK-5832: I would like to add the Affinity

[jira] [Updated] (SPARK-5512) Run the PIC algorithm with degree vector

2015-02-01 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5512: --- Priority: Minor (was: Major) Run the PIC algorithm with degree vector

[jira] [Created] (SPARK-5512) Run the PIC algorithm with degree vector

2015-02-01 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5512: -- Summary: Run the PIC algorithm with degree vector Key: SPARK-5512 URL: https://issues.apache.org/jira/browse/SPARK-5512 Project: Spark Issue Type:

[jira] [Created] (SPARK-5429) Can't generate Hive golden answer on Hive 0.13.1

2015-01-27 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5429: -- Summary: Can't generate Hive golden answer on Hive 0.13.1 Key: SPARK-5429 URL: https://issues.apache.org/jira/browse/SPARK-5429 Project: Spark Issue

[jira] [Created] (SPARK-5099) Simplify logistic loss function and fix deviance loss function

2015-01-06 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5099: -- Summary: Simplify logistic loss function and fix deviance loss function Key: SPARK-5099 URL: https://issues.apache.org/jira/browse/SPARK-5099 Project: Spark

[jira] [Updated] (SPARK-5099) Simplify logistic loss function

2015-01-06 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5099: --- Description: This is a minor pr where I think that we can simply take minus of margin,

[jira] [Updated] (SPARK-5099) Simplify logistic loss function

2015-01-06 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5099: --- Summary: Simplify logistic loss function (was: Simplify logistic loss function and fix

[jira] [Updated] (SPARK-5099) Simplify logistic loss function

2015-01-06 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5099: --- Issue Type: Improvement (was: Bug) Simplify logistic loss function

[jira] [Created] (SPARK-5212) Add support of schema-less transformation

2015-01-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5212: -- Summary: Add support of schema-less transformation Key: SPARK-5212 URL: https://issues.apache.org/jira/browse/SPARK-5212 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5212) Add support of schema-less transformation

2015-01-12 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5212: --- Issue Type: Improvement (was: Bug) Add support of schema-less transformation

[jira] [Created] (SPARK-5714) Refactor initial step of LDA to remove redundant operations

2015-02-10 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5714: -- Summary: Refactor initial step of LDA to remove redundant operations Key: SPARK-5714 URL: https://issues.apache.org/jira/browse/SPARK-5714 Project: Spark

[jira] [Created] (SPARK-5799) Compute aggregation function on specified numeric columns

2015-02-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5799: -- Summary: Compute aggregation function on specified numeric columns Key: SPARK-5799 URL: https://issues.apache.org/jira/browse/SPARK-5799 Project: Spark

[jira] [Created] (SPARK-5773) Further optimize sparse syr

2015-02-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5773: -- Summary: Further optimize sparse syr Key: SPARK-5773 URL: https://issues.apache.org/jira/browse/SPARK-5773 Project: Spark Issue Type: Improvement

[jira] [Closed] (SPARK-5773) Further optimize sparse syr

2015-02-12 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-5773. -- Resolution: Not a Problem Further optimize sparse syr ---

[jira] [Created] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-02-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5832: -- Summary: Add Affinity Propagation clustering algorithm Key: SPARK-5832 URL: https://issues.apache.org/jira/browse/SPARK-5832 Project: Spark Issue Type:

[jira] [Created] (SPARK-5145) Add BLAS.dsyr and use it in GaussianMixtureEM

2015-01-08 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5145: -- Summary: Add BLAS.dsyr and use it in GaussianMixtureEM Key: SPARK-5145 URL: https://issues.apache.org/jira/browse/SPARK-5145 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-02-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14326951#comment-14326951 ] Liang-Chi Hsieh commented on SPARK-5832: For clustering algorithm, O(N^2)

[jira] [Created] (SPARK-6354) Replace the plan which is part of cached query

2015-03-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6354: -- Summary: Replace the plan which is part of cached query Key: SPARK-6354 URL: https://issues.apache.org/jira/browse/SPARK-6354 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-6354) Replace the plan which is part of cached query

2015-03-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368649#comment-14368649 ] Liang-Chi Hsieh edited comment on SPARK-6354 at 3/19/15 7:31 AM:

[jira] [Comment Edited] (SPARK-6354) Replace the plan which is part of cached query

2015-03-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368649#comment-14368649 ] Liang-Chi Hsieh edited comment on SPARK-6354 at 3/19/15 7:28 AM:

[jira] [Commented] (SPARK-6354) Replace the plan which is part of cached query

2015-03-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368649#comment-14368649 ] Liang-Chi Hsieh commented on SPARK-6354: h2. Introduction Currently we use the

[jira] [Comment Edited] (SPARK-6354) Replace the plan which is part of cached query

2015-03-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368649#comment-14368649 ] Liang-Chi Hsieh edited comment on SPARK-6354 at 3/20/15 5:47 PM:

[jira] [Comment Edited] (SPARK-6354) Replace the plan which is part of cached query

2015-03-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368649#comment-14368649 ] Liang-Chi Hsieh edited comment on SPARK-6354 at 3/20/15 4:58 PM:

[jira] [Comment Edited] (SPARK-6354) Replace the plan which is part of cached query

2015-03-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368649#comment-14368649 ] Liang-Chi Hsieh edited comment on SPARK-6354 at 3/20/15 4:59 PM:

[jira] [Comment Edited] (SPARK-6354) Replace the plan which is part of cached query

2015-03-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14368649#comment-14368649 ] Liang-Chi Hsieh edited comment on SPARK-6354 at 3/20/15 5:05 PM:

[jira] [Closed] (SPARK-6224) Also collect NamedExpressions in PhysicalOperation

2015-03-09 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-6224. -- Resolution: Not a Problem Also collect NamedExpressions in PhysicalOperation

[jira] [Created] (SPARK-6224) Also collect NamedExpressions in PhysicalOperation

2015-03-09 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6224: -- Summary: Also collect NamedExpressions in PhysicalOperation Key: SPARK-6224 URL: https://issues.apache.org/jira/browse/SPARK-6224 Project: Spark Issue

[jira] [Created] (SPARK-6302) GeneratedAggregate uses wrong schema on updateProjection

2015-03-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6302: -- Summary: GeneratedAggregate uses wrong schema on updateProjection Key: SPARK-6302 URL: https://issues.apache.org/jira/browse/SPARK-6302 Project: Spark

[jira] [Created] (SPARK-6204) GenerateProjection's equals should check length equality

2015-03-06 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6204: -- Summary: GenerateProjection's equals should check length equality Key: SPARK-6204 URL: https://issues.apache.org/jira/browse/SPARK-6204 Project: Spark

[jira] [Closed] (SPARK-6159) Distinguish between inprogress and abnormal event log history

2015-03-06 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-6159. -- Resolution: Won't Fix Distinguish between inprogress and abnormal event log history

[jira] [Created] (SPARK-6215) Shorten apply and update funcs in GenerateProjection

2015-03-08 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6215: -- Summary: Shorten apply and update funcs in GenerateProjection Key: SPARK-6215 URL: https://issues.apache.org/jira/browse/SPARK-6215 Project: Spark Issue

[jira] [Updated] (SPARK-6326) Improve castStruct to be faster

2015-03-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-6326: --- Summary: Improve castStruct to be faster (was: Make castStruct faster) Improve castStruct

[jira] [Created] (SPARK-6326) Make castStruct faster

2015-03-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6326: -- Summary: Make castStruct faster Key: SPARK-6326 URL: https://issues.apache.org/jira/browse/SPARK-6326 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-6322) CTAS should consider the case where no file format or storage handler is given

2015-03-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6322: -- Summary: CTAS should consider the case where no file format or storage handler is given Key: SPARK-6322 URL: https://issues.apache.org/jira/browse/SPARK-6322

[jira] [Created] (SPARK-6303) Average should be in canBeCodeGened list

2015-03-12 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6303: -- Summary: Average should be in canBeCodeGened list Key: SPARK-6303 URL: https://issues.apache.org/jira/browse/SPARK-6303 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-6550: --- Description: h2. Problems In some cases, the expressions in a logical plan will be modified

[jira] [Updated] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-6550: --- Description: h2. Problems In some cases, the expressions in a logical plan will be modified

[jira] [Created] (SPARK-6550) Add PreAnalyzer to keep logical plan consistent across DataFrame

2015-03-26 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6550: -- Summary: Add PreAnalyzer to keep logical plan consistent across DataFrame Key: SPARK-6550 URL: https://issues.apache.org/jira/browse/SPARK-6550 Project: Spark

[jira] [Closed] (SPARK-6586) Add the capability of retrieving original logical plan of DataFrame

2015-03-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-6586. -- Resolution: Not a Problem Add the capability of retrieving original logical plan of DataFrame

[jira] [Commented] (SPARK-6586) Add the capability of retrieving original logical plan of DataFrame

2015-03-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386209#comment-14386209 ] Liang-Chi Hsieh commented on SPARK-6586: I am no problem with your opinion. But If

[jira] [Commented] (SPARK-6586) Add the capability of retrieving original logical plan of DataFrame

2015-03-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386068#comment-14386068 ] Liang-Chi Hsieh commented on SPARK-6586: Even just for debuging purpose, I think

[jira] [Created] (SPARK-6607) Aggregation attribute name including special chars '(' and ')' should be replaced before generating Parquet schema

2015-03-30 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6607: -- Summary: Aggregation attribute name including special chars '(' and ')' should be replaced before generating Parquet schema Key: SPARK-6607 URL:

[jira] [Created] (SPARK-6647) Make trait StringComparison as BinaryPredicate and throw error when Predicate can't translate to data source Filter

2015-04-01 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6647: -- Summary: Make trait StringComparison as BinaryPredicate and throw error when Predicate can't translate to data source Filter Key: SPARK-6647 URL:

[jira] [Created] (SPARK-6633) Should be Contains instead of EndsWith when constructing sources.StringContains

2015-03-31 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6633: -- Summary: Should be Contains instead of EndsWith when constructing sources.StringContains Key: SPARK-6633 URL: https://issues.apache.org/jira/browse/SPARK-6633

[jira] [Created] (SPARK-6586) Add the capability of retrieving original logical plan of DataFrame

2015-03-28 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6586: -- Summary: Add the capability of retrieving original logical plan of DataFrame Key: SPARK-6586 URL: https://issues.apache.org/jira/browse/SPARK-6586 Project: Spark

[jira] [Commented] (SPARK-6586) Add the capability of retrieving original logical plan of DataFrame

2015-03-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386009#comment-14386009 ] Liang-Chi Hsieh commented on SPARK-6586: Not true. Because DataFrame now is given

[jira] [Reopened] (SPARK-6586) Add the capability of retrieving original logical plan of DataFrame

2015-03-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh reopened SPARK-6586: Add the capability of retrieving original logical plan of DataFrame

[jira] [Comment Edited] (SPARK-6586) Add the capability of retrieving original logical plan of DataFrame

2015-03-29 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386009#comment-14386009 ] Liang-Chi Hsieh edited comment on SPARK-6586 at 3/29/15 11:24 PM:

[jira] [Created] (SPARK-6134) Fix wrong datatype for casting FloatType and default LongType value in defaultPrimitive

2015-03-03 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6134: -- Summary: Fix wrong datatype for casting FloatType and default LongType value in defaultPrimitive Key: SPARK-6134 URL: https://issues.apache.org/jira/browse/SPARK-6134

[jira] [Created] (SPARK-6184) Relocate logDebug to correct location in ResolveSortReferences

2015-03-05 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6184: -- Summary: Relocate logDebug to correct location in ResolveSortReferences Key: SPARK-6184 URL: https://issues.apache.org/jira/browse/SPARK-6184 Project: Spark

[jira] [Created] (SPARK-6159) Distinguish between inprogress and abnormal event log history

2015-03-04 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6159: -- Summary: Distinguish between inprogress and abnormal event log history Key: SPARK-6159 URL: https://issues.apache.org/jira/browse/SPARK-6159 Project: Spark

[jira] [Created] (SPARK-5950) Insert array into table saved as parquet should work when using datasource api

2015-02-23 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5950: -- Summary: Insert array into table saved as parquet should work when using datasource api Key: SPARK-5950 URL: https://issues.apache.org/jira/browse/SPARK-5950

[jira] [Updated] (SPARK-5942) DataFrame should not do query optimization when dataFrameEagerAnalysis is off

2015-02-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5942: --- Summary: DataFrame should not do query optimization when dataFrameEagerAnalysis is off (was:

[jira] [Updated] (SPARK-5942) DataFrame should not do query optimization when dataFrameEagerAnalysis is off

2015-02-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5942: --- Description: DataFrame will force query optimization to happen right away for the commands

[jira] [Created] (SPARK-5942) DataFrame should not do query optimization when dataFrameEagerAnalysis is on

2015-02-22 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5942: -- Summary: DataFrame should not do query optimization when dataFrameEagerAnalysis is on Key: SPARK-5942 URL: https://issues.apache.org/jira/browse/SPARK-5942

[jira] [Created] (SPARK-6041) Compute shortest path for graph with edge distances

2015-02-26 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6041: -- Summary: Compute shortest path for graph with edge distances Key: SPARK-6041 URL: https://issues.apache.org/jira/browse/SPARK-6041 Project: Spark Issue

[jira] [Created] (SPARK-5938) Generate row from json efficiently

2015-02-21 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5938: -- Summary: Generate row from json efficiently Key: SPARK-5938 URL: https://issues.apache.org/jira/browse/SPARK-5938 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-02-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329284#comment-14329284 ] Liang-Chi Hsieh commented on SPARK-5832: The time complexity O(nnz * K) is just

  1   2   3   4   5   6   7   8   9   10   >