Telemetry Experiments: experiment, A/B, and questionnaire implementation in Firefox

Benjamin Smedberg Tue, 28 Jan 2014 15:16:38 -0800

One of the things I have been looking at in some detail recently is howwe can use qualitative measurements in Firefox. This includes betterintegration of existing Telemetry and FHR systems, but also measurementswhich don't fit into those systems.

Part of my study was prompted by a request from Mozilla user research,who want to be able to run experiments and collect data from Firefoxusers, in a similar way to the mostly-defunct Test Pilot program, butwith a better sample population and more rigorous engineering and release.

I also collected examples of problems that various groups have wanted tosolve using data collection. It turns out that some of these use casescan already be solved using existing deployment and measurement systems,while others need additional features.


https://docs.google.com/document/d/19qPbV8XJQL0bDwG4ZOfFhIdwbeFAM-5VT9S8uCpzapc/edit?usp=sharing

I'm interested to know whether there are other important cases which asignificantly different from the ones which I've already collected. Inparticular, I'm looking at the following variables:


 * User population: what kind of user population is desirable/necessary
   in order to answer the question? For an early-stage UI demo, the
   desired population may be users who want to live on the bleeding
   edge and are willing to live with bugs. For some studies, we may
   want to examine user behavior in particular countries or who have
   particular addons installed.
 * Data privacy characteristics: in order to answer the question, do we
   need to collect any identifying information, such as URLs? Does
   collecting the data provide direct benefit back to users?
 * Engineering: does the measurement require changing core code, or can
   the measurement be implemented as addon code? What is the expected
   quality of the change being considered?
 * Result monitoring: what kind of result monitoring is necessary? Do
   we expect a single report to run after a while, or will this measure
   ongoing Firefox behavior? Is it important to be able to correlate
   results against other pieces of data?
 * User interactions: to what extent should users be aware that a
   measurement or experiment is in progress? Do we want to ask them
   specific questions or does the experiment require some sort of
   opt-in or opt-out (this is related to the questions about privacy
   and user population).

This quarter (in Firefox 30) my team is going to focus on building outone specific system, a way to deploy experiment code to prerelease usersin Firefox desktop builds. We're going to start out small, solving aspecific request from Gregg Lind in user research for a tool to deploysome experiments related to search behavior in Firefox.


https://docs.google.com/document/d/1GPpkIcWFNkZmXONjqBCc05U3uocOD-1jpZHdAsR0v1k/edit?usp=sharing

Each experiment will be deployed as a restartless addon, andmeasurements will be taken via some combination of existing FHR andtelemetry data collection channels. The experiment system will belimited to people with telemetry enabled(1) and each experiment willalso be able to set additional conditions, such as limiting theexperiment to users in certain release channels, locales, addons or lackof addons, etc.

After this first phase is complete, I expect to extend this system. Wewill probably want to be able to run similar experiments in Firefox forAndroid, although addons can do far less UI modification in general. Wewill also want to handle A/B testing where we don't install an addon,but simply flip various pref configurations. We also plan on extendingthis same system as a way to deploy questionnaires or surveys to users.For example, if we find an addon which appears to be malware, we mightask users whether they know the addon is installed, whether theyinstalled it intentionally, etc. I am interested if people have specifichigh-priority studies or surveys in mind that we can use to serve asmodels for future revisions.

Finally, we are considering whether and how to combine FHR and telemetrydata collection. Each system currently has weaknesses which we'd like toaddress, and it seems that the best way forward is to combine them. Thisis still in early decision-making, but I've written up a proposal herefor comment:https://docs.google.com/document/d/1JKnqejahVWMev4xUYGbRiICw0HpwopcXBqPYxco0YzU/edit?usp=sharing


Questions, concerns? Followup to firefox-dev please.

--BDS

1. Currently telemetry is enabled by default in nightly and aurorabuilds, and I have requested that it be enabled by default in allprerelease builds (including beta). Being able to run experiments onbeta users and measure the results is critical, since our beta userpopulation is much more representative of release users.

_______________________________________________
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform

Telemetry Experiments: experiment, A/B, and questionnaire implementation in Firefox

Reply via email to