[datasketches-website] branch asf-site updated: Automatic Site Publish by Buildbot

git-site-role Mon, 02 May 2022 17:40:47 -0700

This is an automated email from the ASF dual-hosted git repository.

git-site-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/datasketches-website.git



The following commit(s) were added to refs/heads/asf-site by this push:
     new 66c47686 Automatic Site Publish by Buildbot
66c47686 is described below

commit 66c476864ebd3b74f56ca2e5bf2017bc36a42446
Author: buildbot <[email protected]>
AuthorDate: Tue May 3 00:40:21 2022 +0000

    Automatic Site Publish by Buildbot
---
 output/docs/Architecture/SketchCriteria.html      | 6 +++---
 output/docs/Architecture/SketchesByComponent.html | 6 +++---
 2 files changed, 6 insertions(+), 6 deletions(-)

diff --git a/output/docs/Architecture/SketchCriteria.html 
b/output/docs/Architecture/SketchCriteria.html
index 460a0f8b..bf9e70ac 100644
--- a/output/docs/Architecture/SketchCriteria.html
+++ b/output/docs/Architecture/SketchCriteria.html
@@ -513,17 +513,17 @@
 
 <p>There are lots of clever and useful algorithms that are sometimes called 
“sketches”.  However, due to limited resources, in order to be included in the 
DataSketches library, we had to clearly define what we meant by the term 
“sketch”.  Otherwise, we would end up with a hodge podge of algorithms and have 
to answer: Why don’t we include algorithm X?.</p>
 
-<p>In order to be in our library, a <em>Sketch</em> must exhibit these 
properties:</p>
+<p>In order to be in our library, a <em>Sketch</em> should exhibit these 
properties:</p>
 
 <h2 id="streaming--one-touch">Streaming / One-Touch</h2>
-<p>Sketches are a class of streaming algorithms by definition, which means 
they only touch or process each item in a stream once.  This is absolutely 
required for real-time applications.</p>
+<p>Sketches are a class of streaming algorithms by definition, which means 
they only touch or process each item in a stream once.  This is absolutely 
essential for real-time applications.</p>
 
 <h2 id="small-in-size">Small in Size</h2>
 <p>One of the key properties of any sketch is that it is a synopsis or summary 
of a much larger data set.  The whole point of a small summary is that it is 
faster to read and merge.  In this context, <em>small</em> means small with 
respect to the original data.  If the original data is terabytes in size, a 
single sketch of 100KB may not seem very different from a sketch of 50KB as 
both are very small compared to the original data.</p>
 
 <p>But <em>small</em> can also be important in an systems context. If that 
original terabyte of data generates 10,000 sketches, each sketch consuming 
100KB, that amounts to a GB of storage.  Now the total memory use starts to be 
a concern.  Being able to reduce that by 50% by using a smaller (and otherwise 
equivalent) sketch can be a big deal.</p>
 
-<p>Nonetheless, <em>small</em> is relevant to the specific application. 
Sketches can very from a few bytes to many megabytes depending on the specific 
sketch and how it has been configured. Whether it is small enough is up to the 
system engineers to determine.</p>
+<p>Nonetheless, <em>small</em> is relevant to the specific application. 
Sketches can very from a few bytes to many megabytes depending on the specific 
sketch and how it has been configured. Whether it is small enough depends on 
the use-case and the specific environment.</p>
 
 <h2 id="sublinear-in-size-growth">Sublinear in Size Growth</h2>
 <p>Not only should a sketch start small, it needs to stay small as the size of 
the input stream grows.  Some sketches have an upper bound of size independent 
of the size of the input stream, which clearly makes them sublinear.  Other 
sketches may need to continue to increase their size as the stream grows.  For 
these sketches it is important that they do so very very slowly. They should 
grow sublinearly by no more then <em>O(log(n))</em> or preferrably by <em>O(k 
log(n/k))</em> or less.</p>
diff --git a/output/docs/Architecture/SketchesByComponent.html 
b/output/docs/Architecture/SketchesByComponent.html
index e3ef9cbd..120d652f 100644
--- a/output/docs/Architecture/SketchesByComponent.html
+++ b/output/docs/Architecture/SketchesByComponent.html
@@ -560,11 +560,11 @@ This code is versioned and the latest release can be 
obtained from
     </tr>
     <tr>
       <td>org.apache.datasketches.kll</td>
-      <td>Quantiles sketch with better accuracy per size than the standard 
quantiles sketch. Includes PMF, CDF funtions, for floats. Only on-heap.</td>
+      <td>Quantiles sketch with better accuracy per size than the standard 
quantiles sketch. Includes PMF, CDF functions, for floats, doubles. On-heap 
&amp; off-heap.</td>
     </tr>
     <tr>
       <td>org.apache.datasketches.quantiles</td>
-      <td>Standard Quantiles sketch, plus PMF and CDF functions, for doubles 
and generics and for heap and off-heap.</td>
+      <td>Standard Quantiles sketch, plus PMF and CDF functions, for doubles 
and generics. On-heap &amp; off-heap.</td>
     </tr>
     <tr>
       <td>org.apache.datasketches.req</td>
@@ -576,7 +576,7 @@ This code is versioned and the latest release can be 
obtained from
     </tr>
     <tr>
       <td>org.apache.datasketches.theta</td>
-      <td>Unique counting Theta Sketches for both heap and off-heap</td>
+      <td>Unique counting Theta Sketches for both on-heap &amp; off-heap</td>
     </tr>
     <tr>
       <td>org.apache.datasketches.tuple</td>


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[datasketches-website] branch asf-site updated: Automatic Site Publish by Buildbot

Reply via email to