Niall Pemberton created ZEPPELIN-6159:
-----------------------------------------

             Summary: Remove Google Analytics from Zeppelin Docs
                 Key: ZEPPELIN-6159
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-6159
             Project: Zeppelin
          Issue Type: Task
          Components: zeppelin-site-repo
            Reporter: Niall Pemberton


Hi Zeppelin Team!

There are *2,481* files in the Zeppelin [asf-site 
branch|https://github.com/apache/zeppelin-site/tree/asf-site] that reference 
{*}_Google Analytics_{*}. The table below shows counts of the number of files 
in the *docs* & *output/docs* folders:

 
||Verison||docs||output/docs||
|0.5.0-incubating|26|26|
|0.5.5-incubating|26|26|
|0.5.6-incubating|30|30|
|0.6.0|48|48|
|0.6.1|49|49|
|0.6.2|49|49|
|0.7.0|68|68|
|0.7.1|69|69|
|0.7.2|70|70|
|0.7.3|71|71|
|0.8.0|89|89|
|0.8.1|88|88|
|0.8.2|89|89|
|0.9.0|106|106|
|0.10.0|108|108|
|0.10.1|108|108|
|0.11.0| |97|
|0.11.1| |97|
|0.11.2| |97|
|TOTAL|1094|1385|

The ASF {*}_Privacy Policy_{*}[1][2] does not permit the use of *_Google 
Analytics_* on any ASF websites and the ASF Infra team will soon enforce a 
{*}_Content Security Policy_{*}(CSP) that will block access to external 
trackers:
 * [https://lists.apache.org/thread/w34sd92v4rz3j28hyddmt5tbprbdq6lc]

Please could you remove the use of the Google Analytics from the Zeppelin 
website (I will submit a PR request shortly to do that)?
 * [https://lists.apache.org/thread/k45cwf98c5r8qtmdttg2lt1h7yt05tns]

The ASF hosts its own *_Matomo_* instance to provide projects with analytics. 
Normally projects request a tracking id for their project by sending a mail to 
*privacy AT apache.org*  - I can't see any request from *_Zeppelin_*, but I did 
find that it has already been setup on {*}_Matomo_{*}. So it is ready for you 
to use & I'll will include the tracking id in the PR in case you want to use it.
 * 
[https://privacy.apache.org/faq/committers.html#can-i-use-web-analytics-matomo]

Additionally I would recommend reviewing any external resources loaded by your 
website. The Content Security Policy will prevent any resources being loaded 
from 3rd Party providers that the ASF does not have a Data Processing Agreement 
(DPA) with. On the 1st February Infra will begin a temporary "brownout" when 
the CSP will be turned on for a short period. This will allow projects to check 
which parts, if any, of their websites will stop working. The Privacy FAQ 
answers a number of questions about which external providers are permitted or 
not:
 * [https://privacy.apache.org/faq/committers.html]

Thanks
Niall

[1] [https://privacy.apache.org/policies/website-policy.html]
[2] [https://privacy.apache.org/faq/committers.html#can-i-use-google-analytics]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to