[
https://issues.apache.org/jira/browse/BEAM-5969?focusedWorklogId=167417&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-167417
]
ASF GitHub Bot logged work on BEAM-5969:
----------------------------------------
Author: ASF GitHub Bot
Created on: 19/Nov/18 15:20
Start Date: 19/Nov/18 15:20
Worklog Time Spent: 10m
Work Description: echauchot closed pull request #6957: [BEAM-5969] Add
BOUNDED_SIDE_INPUT_JOIN new Query to Beam website
URL: https://github.com/apache/beam/pull/6957
This is a PR merged from a forked repository.
As GitHub hides the original diff on merge, it is displayed below for
the sake of provenance:
As this is a foreign pull request (from a fork), the diff is supplied
below (as it won't show otherwise due to GitHub magic):
diff --git a/website/src/documentation/sdks/nexmark.md
b/website/src/documentation/sdks/nexmark.md
index bcba7279ef1..9c2dc8a5bbb 100644
--- a/website/src/documentation/sdks/nexmark.md
+++ b/website/src/documentation/sdks/nexmark.md
@@ -37,44 +37,46 @@ auction system:
The queries exercise many aspects of Beam model:
-* **Query1**: What are the bid values in Euro's?
+* **Query1** or **CURRENCY_CONVERSION**: What are the bid values in Euro's?
Illustrates a simple map.
-* **Query2**: What are the auctions with particular auction numbers?
+* **Query2** or **SELECTION**: What are the auctions with particular auction
numbers?
Illustrates a simple filter.
-* **Query3**: Who is selling in particular US states?
+* **Query3** or **LOCAL_ITEM_SUGGESTION**: Who is selling in particular US
states?
Illustrates an incremental join (using per-key state and timer) and filter.
-* **Query4**: What is the average selling price for each auction
+* **Query4** or **AVERAGE_PRICE_FOR_CATEGORY**: What is the average selling
price for each auction
category?
Illustrates complex join (using custom window functions) and
aggregation.
-* **Query5**: Which auctions have seen the most bids in the last period?
+* **Query5** or **HOT_ITEMS**: Which auctions have seen the most bids in the
last period?
Illustrates sliding windows and combiners.
-* **Query6**: What is the average selling price per seller for their
+* **Query6** or **AVERAGE_SELLING_PRICE_BY_SELLER**: What is the average
selling price per seller for their
last 10 closed auctions.
Shares the same 'winning bids' core as for **Query4**, and
illustrates a specialized combiner.
-* **Query7**: What are the highest bids per period?
+* **Query7** or **HIGHEST_BID**: What are the highest bids per period?
Deliberately implemented using a side input to illustrate fanout.
-* **Query8**: Who has entered the system and created an auction in
+* **Query8** or **MONITOR_NEW_USERS**: Who has entered the system and created
an auction in
the last period?
Illustrates a simple join.
We have augmented the original queries with five more:
-* **Query0**: Pass-through.
+* **Query0** or **PASSTHROUGH**: Pass-through.
Allows us to measure the monitoring overhead.
-* **Query9**: Winning-bids.
+* **Query9** or **WINNING_BIDS**: Winning-bids.
A common sub-query shared by **Query4** and **Query6**.
-* **Query10**: Log all events to GCS files.
+* **Query10** or **LOG_TO_SHARDED_FILES**: Log all events to GCS files.
Illustrates windows with large side effects on firing.
-* **Query11**: How many bids did a user make in each session they
+* **Query11** or **USER_SESSIONS**: How many bids did a user make in each
session they
were active?
Illustrates session windows.
-* **Query12**: How many bids does a user make within a fixed
+* **Query12** or **PROCESSING_TIME_WINDOWS**: How many bids does a user make
within a fixed
processing time limit?
Illustrates working in processing time in the Global window, as
compared with event time in non-Global windows for all the other
queries.
+* **BOUNDED_SIDE_INPUT_JOIN**: Joins a stream to a bounded side input,
modeling basic stream enrichment.
+
## Benchmark workload configuration
@@ -171,10 +173,16 @@ Number of events generators:
--numEventGenerators=4
-Run query N:
+Queries can be run by their name or by their number (number is still there for
backward compatibility, only the queries 0 to 12 have a number)
+
+Run query **N**:
--query=N
+Run query called **PASSTROUGH**:
+
+ --query=PASSTHROUGH
+
### Available Suites
The suite to run can be chosen using this configuration parameter:
@@ -182,7 +190,7 @@ The suite to run can be chosen using this configuration
parameter:
Available suites are:
* DEFAULT: Test default configuration with query 0.
-* SMOKE: Run the 12 default configurations.
+ * SMOKE: Run all the queries with the default configuration.
* STRESS: Like smoke but for 1m events.
* FULL_THROTTLE: Like SMOKE but 100m events.
@@ -347,6 +355,13 @@ These tables contain statuses of the queries runs in the
different runners. Goog
<td>ok</td>
<td>ok</td>
</tr>
+ <tr>
+ <td>BOUNDED_SIDE_INPUT_JOIN</td>
+ <td>ok</td>
+ <td>ok</td>
+ <td>ok</td>
+ <td>ok</td>
+ </tr>
</table>
### Streaming / Synthetic / Local
@@ -450,6 +465,13 @@ These tables contain statuses of the queries runs in the
different runners. Goog
<td>ok</td>
<td>ok</td>
</tr>
+ <tr>
+ <td>BOUNDED_SIDE_INPUT_JOIN</td>
+ <td>ok</td>
+ <td><a
href="https://issues.apache.org/jira/browse/BEAM-2112">BEAM-2112</a></td>
+ <td>ok</td>
+ <td>ok</td>
+ </tr>
</table>
### Batch / Synthetic / Cluster
@@ -679,7 +701,7 @@ There are dashboards for these runners (others to come):
Each dashboard contains:
- graphs in batch mode
- graphs in streaming mode
-- graphs for the 13 queries.
+- graphs for all the queries.
### Performance dashboards links
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 167417)
Time Spent: 2h 20m (was: 2h 10m)
> Update website with the new query
> ---------------------------------
>
> Key: BEAM-5969
> URL: https://issues.apache.org/jira/browse/BEAM-5969
> Project: Beam
> Issue Type: Sub-task
> Components: examples-nexmark
> Reporter: Etienne Chauchot
> Assignee: Etienne Chauchot
> Priority: Major
> Time Spent: 2h 20m
> Remaining Estimate: 0h
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)