willbarrett commented on a change in pull request #8459: [docs] User tutorial
URL: 
https://github.com/apache/incubator-superset/pull/8459#discussion_r342817872
 
 

 ##########
 File path: docs/usertutorial.rst
 ##########
 @@ -0,0 +1,455 @@
+..  Licensed to the Apache Software Foundation (ASF) under one
+    or more contributor license agreements.  See the NOTICE file
+    distributed with this work for additional information
+    regarding copyright ownership.  The ASF licenses this file
+    to you under the Apache License, Version 2.0 (the
+    "License"); you may not use this file except in compliance
+    with the License.  You may obtain a copy of the License at
+
+..    http://www.apache.org/licenses/LICENSE-2.0
+
+..  Unless required by applicable law or agreed to in writing,
+    software distributed under the License is distributed on an
+    "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+    KIND, either express or implied.  See the License for the
+    specific language governing permissions and limitations
+    under the License.
+
+Exploring data with Apache Superset
+===================================
+
+In this tutorial, we will introduce key concepts in Apache Superset through
+the exploration of a real dataset which contains the flights made by employees
+of a UK-based organization in 2011. The following information about each
+flight is given:
+
+- The traveller's department. For the purposes of this tutorial the names of
+  the departments have been renamed Orange, Yellow and Purple.
+- The cost of the ticket.
+- The travel class (Economy, Premium Economy, Business and First Class).
+- Whether the ticket was a single or return.
+- The date of travel.
+- Information about the origin and destination.
+- The distance between the origin and destination, in km.
+
+Obtaining and loading the data
+------------------------------
+
+Download the data for this tutorial to your computer from `Github
+<https://github.com/francishemingway/unofficial-superset-tutorial/blob/master/tutorial_flights.csv>`_.
+
+In the top menu, select **Sources > Upload a CSV**.
+
+.. image:: images/usertutorial/upload_a_csv.png
+
+Then, enter the **Table name** as tutorial_flights and select the **CSV file**
+from your computer.
+
+.. image:: images/usertutorial/csv_to_database_configuration.png
+
+Next enter the text Travel Date into the **Parse Dates** field.
+
+.. image:: images/usertutorial/parse_dates_column.png
+
+Leaving all the other options in their default settings, select **Save** at
+the bottom of the page.
+
+
+Table Visualization
+-------------------
+
+In this section, we’ll create our first visualization: a table to show the
+number of flights and cost per travel class.
+
+To create a new chart, select the **New** **>** **Chart**.
+
+.. image:: images/usertutorial/add_new_chart.png
+
+Once in the **Create a new chart** dialogue, select tutorial_flights
+from the **Chose a datasource** dropdown.
+
+.. image:: images/usertutorial/add_new_chart.png
+
+Next, select the visualization type as Table.
+
+.. image:: images/usertutorial/chose_a_datasource.png
+
+Then, select **Create new chart** to go into the chart view.
+
+By default, Apache Superset only shows the last week of data: in our example,
+we want to look at all the data in the dataset. No problem - within the
+**Time** section, remove the filter on **Time range** by selecting on
+**Last week** **> No filter**, with a final **OK** to confirm your
+selection.
+
+.. image:: images/usertutorial/no_filter_on_time_filter.png
+
+Now, we want to specify the rows in our table by using the **Group by**
+option. Since in this example, we want to understand different Travel
+Classes, we select Travel Class in this menu.
+
+Next, we can specify the metrics we would like to see in our table with
+the **Metrics** option. Count(*), which represents the number of rows in
+the table (in this case corresponding to the number of flights since we
+have a row per flight), is already there. To add cost, within
+**Metrics**, select Cost. **Save** the default aggregation
+option, which is to sum the column.
+
+.. image:: images/usertutorial/sum_cost_column.png
+
+Finally, select **Run Query** to see the results of the table.
+
+.. image:: images/usertutorial/tutorial_table.png
+
+Congratulations, you have created your first visualization in Apache Superset!
+
+To save the visualization, click on **Save** in the top left of the
+screen. Select the **Save as** option, and enter the chart name as Tutorial
+Table (you will be able to find it
+again through the **Charts** screen, accessible in the top menu). Similarly,
+select **Add to new dashboard** and enter Tutorial Dashboard. Finally,
+select **Save & go to dashboard**.
+
+.. image:: images/usertutorial/save_tutorial_table.png
+
+Dashboard basics
+----------------
+
+Next, we are going to explore the dashboard interface. If you’ve
+followed the previous section, you should already have the dashboard
+open. Otherwise, you can navigate to the dashboard by selecting
+**Dashboards** on the top menu, then Tutorial dashboard from the list
+of dashboards.
+
+On this dashboard you should see the table you created in the previous
+section. Select **Edit dashboard** and then hover over the table. By
+selecting the bottom right hand corner of the table (the cursor will
+change too), you can resize it by dragging and dropping.
+
+.. image:: images/usertutorial/resize_tutorial_table_on_dashboard.png
+
+Finally, save your changes by selecting **Save changes** in the top
+right.
+
+Pivot Table
+-----------
+
+In this section, we will extend our analysis using a more complex
+visualization, Pivot Table. By the end of this section, you will have
+created a table that shows the monthly spend on flights for the first
+six months, by department, by travel class.
+
+As before, create a new visualization by selecting **New > Chart** on
+the top menu. Choose tutorial_flights again as a datasource, then click
+on the visualization type to get to the visualization menu. Select the
+**Pivot Table** visualization (you can filter by entering text in the
+search box) and then **Create a new chart**.
+
+In the **Time** section, keep the Time Column as Travel Date (this is
+selected automatically as we only have one time column in our dataset).
+Then select **Time Grain** to be month as having daily data would be
+too granular to see patterns from. Then select the time range to be the
+first six months of 2011 by click on Last week in the **Time Range**
+section, then in **Custom** selecting a **Start / end** of 1\ :sup:`st`
+January 2011 and 30\ :sup:`th` June 2011 respectively by either entering
+directly the dates or using the calendar widget (by selecting the month
+name and then the year, you can move more quickly to far away dates).
+
+.. image:: images/usertutorial/select_dates_pivot_table.png
+
+Next, within the **Query** section, remove the default COUNT(*) and add
+Cost, keeping the default SUM aggregate. Note that
+Apache Superset will indicate the type of the metric by the symbol on the left
+hand column of the list (ABC for string, # for number, a clock face for
+time, etc.).
+
+In **Group by** select Time: this will automatically use the Time
+Column and Time Grain selections we defined in the Time section.
+
+Within **Columns**, select first Department and then Travel Class. All set
+– let’s **Run Query** to see some data!
+
+.. image:: images/usertutorial/tutorial_pivot_table.png
+
+You should see months in the rows and Department and Travel Class in the
+columns. To get this in our dashboard, select **Save,** name the chart
+Tutorial Pivot and using **Add chart to existing dashboard** select
+Tutorial Dashboard, and then finally **Save & go to dashboard**.
+
+Line Chart
+----------
+
+In this section, we are going to create a line chart to understand the
+average price of a ticket by month across all the dataset. As before,
 
 Review comment:
   nit: "across the entire dataset" is more correct than "across all the 
dataset"

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to