GitHub user watermen opened a pull request:

    https://github.com/apache/spark/pull/4789

    [SPARK-6040][SQL] Fix the percent bug in tablesample

    HiveQL expression like `select count(1) from src tablesample(1 percent);` 
means take 1% sample to select. But it means 100% in the current version of the 
Spark.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/watermen/spark SPARK-6040

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/4789.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #4789
    
----
commit 9f0f804977ba16d868c6ff6f827d0cf02c374e4e
Author: q00251598 <[email protected]>
Date:   2015-02-26T07:34:41Z

    add judging the interval individually and divide fraction by 100

commit 4559fa05ae95a5723e4c2ee0aed33fb4ee10cb29
Author: q00251598 <[email protected]>
Date:   2015-02-26T12:23:12Z

    add test suite

commit 92cbc4a33731f65617948faa9a79d4f0a28055ed
Author: q00251598 <[email protected]>
Date:   2015-02-26T12:39:13Z

    delete 's' in the end

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to