[
https://issues.apache.org/jira/browse/DRILL-5152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15784490#comment-15784490
]
ASF GitHub Bot commented on DRILL-5152:
---------------------------------------
Github user cgivre commented on the issue:
https://github.com/apache/drill/pull/708
HI Paul,
Is the mock data source actually in Drill 1.9? I tried executing this
query and it threw errors.
Thx,
—C
> On Dec 27, 2016, at 21:54, Paul Rogers <[email protected]> wrote:
>
> Provides an enhanced version of the mock data source. See the JIRA entry
for motivation, package-info.java for details of operation.
>
> Allows tests to write queries of the form:
>
> select id_i, name_s50 from `mock`.`employee_1K` ...
> Where id_i is a field of random, uniformly distributed integers and
name_s50 is a VARCHAR column of width 50 of randomly generated strings. The _1K
suffix says to generate 1000 rows. The names are just for convenience, the
suffixes tell the mock data source what to generate.
>
> Examples of use will appear in a later commit that includes a revised
test framework. Existing tests that use the physical plan version of the mock
data source work as before.
>
> You can view, comment on, or merge this pull request online at:
>
> https://github.com/apache/drill/pull/708
<https://github.com/apache/drill/pull/708>
> Commit Summary
>
> DRILL-5152: Enhance the mock data source: better data, SQL access
> File Changes
>
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/AbstractRecordReader.java
<https://github.com/apache/drill/pull/708/files#diff-0> (2)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/StoragePlugin.java
<https://github.com/apache/drill/pull/708/files#diff-1> (3)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/StoragePluginRegistry.java
<https://github.com/apache/drill/pull/708/files#diff-2> (2)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/StoragePluginRegistryImpl.java
<https://github.com/apache/drill/pull/708/files#diff-3> (24)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/ColumnDef.java
<https://github.com/apache/drill/pull/708/files#diff-4> (178)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/DateGen.java
<https://github.com/apache/drill/pull/708/files#diff-5> (53)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/DoubleGen.java
<https://github.com/apache/drill/pull/708/files#diff-6> (42)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/ExtendedMockRecordReader.java
<https://github.com/apache/drill/pull/708/files#diff-7> (149)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/FieldGen.java
<https://github.com/apache/drill/pull/708/files#diff-8> (37)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/IntGen.java
<https://github.com/apache/drill/pull/708/files#diff-9> (42)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MockGroupScanPOP.java
<https://github.com/apache/drill/pull/708/files#diff-10> (127)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MockRecordReader.java
<https://github.com/apache/drill/pull/708/files#diff-11> (8)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MockScanBatchCreator.java
<https://github.com/apache/drill/pull/708/files#diff-12> (8)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MockStorageEngine.java
<https://github.com/apache/drill/pull/708/files#diff-13> (79)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MockStorageEngineConfig.java
<https://github.com/apache/drill/pull/708/files#diff-14> (9)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MockStorePOP.java
<https://github.com/apache/drill/pull/708/files#diff-15> (3)
> M
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MockSubScanPOP.java
<https://github.com/apache/drill/pull/708/files#diff-16> (20)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/MoneyGen.java
<https://github.com/apache/drill/pull/708/files#diff-17> (42)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/StringGen.java
<https://github.com/apache/drill/pull/708/files#diff-18> (49)
> A
exec/java-exec/src/main/java/org/apache/drill/exec/store/mock/package-info.java
<https://github.com/apache/drill/pull/708/files#diff-19> (130)
> M
exec/java-exec/src/test/java/org/apache/drill/exec/TestOpSerialization.java
<https://github.com/apache/drill/pull/708/files#diff-20> (2)
> M
exec/java-exec/src/test/java/org/apache/drill/exec/fn/interp/ExpressionInterpreterTest.java
<https://github.com/apache/drill/pull/708/files#diff-21> (4)
> Patch Links:
>
> https://github.com/apache/drill/pull/708.patch
<https://github.com/apache/drill/pull/708.patch>
> https://github.com/apache/drill/pull/708.diff
<https://github.com/apache/drill/pull/708.diff>
> —
> You are receiving this because you are subscribed to this thread.
> Reply to this email directly, view it on GitHub
<https://github.com/apache/drill/pull/708>, or mute the thread
<https://github.com/notifications/unsubscribe-auth/AFQfviitiJqMVi2vYWaYP7mAceYPFv7Gks5rMc91gaJpZM4LWo0d>.
>
> Enhance the mock data source: better data, SQL access
> -----------------------------------------------------
>
> Key: DRILL-5152
> URL: https://issues.apache.org/jira/browse/DRILL-5152
> Project: Apache Drill
> Issue Type: Improvement
> Components: Tools, Build & Test
> Affects Versions: 1.9.0
> Reporter: Paul Rogers
> Assignee: Paul Rogers
> Priority: Minor
>
> Drill provides a mock data storage engine that generates random data. The
> mock engine is used in some older unit tests that need a volume of data, but
> that are not too particular about the details of the data.
> The mock data source continues to have use even for modern tests. For
> example, the work in the external storage batch requires tests with varying
> amounts of data, but the exact form of the data is not important, just the
> quantity. For example, if we want to ensure that spilling happens at various
> trigger points, we need to read the right amount of data for that trigger.
> The existing mock data source has two limitations:
> 1. It generates only "black/white" (alternating) values, which is awkward for
> use in sorting.
> 2. The mock generator is accessible only from a physical plan, but not from
> SQL queries.
> This enhancement proposes to fix both limitations:
> 1. Generate a uniform, randomly distributed set of values.
> 2. Provide an encoding that lets a SQL query specify the data to be generated.
> Example SQL query:
> {code}
> SELECT id_i, name_s50 FROM `mock`.employee_10K;
> {code}
> The above says to generate two fields: INTEGER (the "_i" suffix) and
> VARCHAR(50) (the "_s50") suffix; and to generate 10,000 rows (the "_10K"
> suffix on the table.)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)