petermarshallio commented on code in PR #14523:
URL: https://github.com/apache/druid/pull/14523#discussion_r1257881185
##########
examples/quickstart/jupyter-notebooks/notebooks/03-query/03-approxCountDistinct.ipynb:
##########
@@ -0,0 +1,470 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "id": "557e06e8-9b35-4b34-8322-8a8ede6de709",
+ "metadata": {},
+ "source": [
+ "# Counting distinct values\n",
+ "\n",
+ "__It's extremely common for analysts to want to count unique occurences
of some dimension value in data. With the Druid database's history of large
volumes of data comes an advanced computer science technique to speed up this
calculation through approximation. In this tutorial, work through some examples
and see the effect of turning it on and off, and of making it even faster by
pre-generating the objects that Druid uses to execute the query.__\n",
+ "\n",
+ "## Prerequisites\n",
+ "\n",
+ "This tutorial works with Druid 26.0.0 or later.\n",
+ "\n",
+ "Launch this tutorial and all prerequisites using the `druid-jupyter`
profile of the Docker Compose file for Jupyter-based Druid tutorials. For more
information, see [Docker for Jupyter Notebook
tutorials](https://druid.apache.org/docs/latest/tutorials/tutorial-jupyter-docker.html).\n",
+ "\n",
+ "You must also have loaded the \"FlightCarrierOnTime (1 month)\" sample
data, using defaults, into the table
`On_Time_Reporting_Carrier_On_Time_Performance_(1987_present)_2005_11`.\n",
+ "\n",
+ "If you do not use the Docker Compose environment, you need the
following:\n",
+ "* A running Druid instance.\n",
+ " * Update the `druid_host` variable to point to your Router endpoint.
For example, `druid_host = \"http://localhost:8888\"`.\n",
+ "* The following Python packages:\n",
+ " * `druidapi`, a Python client for Apache Druid\n",
+ "\n",
+ "To start this tutorial, run the next cell. It defines variables for two
datasources and the Druid host the tutorial uses. The quickstart deployment
configures Druid to listen on port `8888` by default, so you'll make API calls
against `http://localhost:8888`.\n"
Review Comment:
Updated the default to host.docker.internal
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]