Re: [IxDA Discuss] Article on Number of Usability Test Participants

Jared Spool Sat, 03 Oct 2009 09:48:34 -0700

[Ok. I started to write a simple post about how you need to talk aboutwhat you want to learn from your study before you can ask about numberof participants, but then it evolved into this 1200+ word historylesson. I left that part in, but you can skip to the very end to seemy point. - Jared]

We're talking about usability testing as if it's a single procedurethat everyone performs exactly the same way. (This is not the onlyproblem with this thread, but it's the one that I won't be all snarkyabout.)

As a field, we're not very good about making sure everyone knows aboutour history. In the case of usability testing methods, history ispretty important.

-> BACKGROUND - Skip to the next section if you just want to get tothe IMPORTANT RELEVANT STUFF

The first usability tests (as we know them today) were conducted bycognitive psychologists in the 1970s. (You can trace usability testingback to time-and-motion studies in the '20s and '30s, but I don'tthink we need to go back that far for this conversation.)

When the cog. psychs. were testing, they were using the testmethodology as a technique to understand human behavior and cognition:how did people react to stimuli (physical and digital)? They werelooking at reaction times, memory, motor response, and other basics. Alot of this work was being done at universities and corporate researchlabs, like Bell Labs and Xerox PARC. NASA, DARPA, and the DOD werealso involved. (Interestingly, they all discovered a lot of stuff thatwe take for granted today in design -- back then it was all new andcontroversial, like Fitts's Law.)

In the late '70s, early '80s, we started applying usability testinginto engineering processes. I was part of one of the first teams (atDigital Equipment Corporation) to use usability tests in the processof developing products. Engineering teams at IBM, HP, WANG, Boeing,Siemens, GTE, and Nortel were doing similar things. (I'm sure therewere others that I've forgotten or didn't know about.)

At DEC, the first engineering uses of usability testing were foreither research-based prototype evaluation or very late-stage productdefect determination. Meanwhile, John Gould and his team at IBMpublished a seminal paper about using an iterative process fordesigning a messaging system at the 1984 Summer Olympics. JimCarroll's team were using testing methods for understandingdocumentation needs in office systems. Ron Perkins & co at WANG weredoing similar things. Industrial design groups at many companies wereusing usability testing for studying behavioral responses andergonomic constraints for interactive system devices.

It was still a few years until we saw labs at companies likeMicrosoft, Word Perfect, and Apple. By the time they'd gotteninvolved, we'd evolved many of the methods and protocols to look at athe design at a variety of points throughout the development process.But the early testing methods were too expensive and too timeconsuming to effectively use within the engineering practice. It wasalways a special case, reserved for the most important projects.

All of these studies involved laboratory-based protocols. In the verylate '80s and early '90s, many of us pushed for laboratory-lesstesting techniques, to lower the costs and time constraints. We alsostarted experimenting with techniques, such as paper prototypes, whichreduced the up-front cost of building the design to test it.

Others, such as those behind the participatory design movement inScandinavia and the ethnographic/contextual design methods emerging inthe US and central Europe, were looking at other methods for gleaninginformation. (This is when Jakob started popularizing DiscountUsability Engineering, which had a huge impact on the adoption of thetechniques within the design process.)

Today, we see that the cost of conducting a usability test has droppedtremendously. When I started in the '70s, a typical study would easilycost $250,000 in today's dollars. Today, a team can perform an eightparticipant in-person study for much less than $5,000 and remotemethods are even cheaper.


-> IMPORTANT RELEVANT STUFF (in case you decided to skip the BACKGROUND)

All this is relevant to the conversation, because usability testinghas morphed and changed in its history. When we used it for scientificbehavioral and cognitive studies, we needed to pay close attention toall the details. Number of users was critical, as was the recruitingmethod, the moderation protocols, and the analysis methods. Youcouldn't report results of a study without describing, in high detail,every aspect of how you put the study together and came to yourconclusion. (You still see remnants of this today in the way CHIaccepts papers.)

When we were using it for defect detection, we needed to understandthe number of users problem better. That's when Nielsen & Landauer,Jim Lewis, Bob Virzi, and Will Schroeder & I started looking at thevariables.

But we've moved passed defect detection for common usage. And in thatway, usability testing has morphed into a slew of differenttechniques. As a result, the parameters of using the method changebased on how you're using it.

Today, the primary use is for gleaning insights about who our usersare and how they see our designs. It's not about finding problems inthe design (though, that's always a benefit). Instead, it's a toolthat helps us makes decisions in those thousands of moments during thedesign process when we don't have access to our users.

Sitting next to a single user, watching them use a design, can be, byitself an enlightening process. When we work with teams who arewatching their users for the first time (an occurrence that happensway too often still), they come out of the first session completelyenergized and excited about what they've just learned. And that's justafter seeing 1 definitely-not-statistically-significant user.

Techniques like usability testing are used today to see the designthrough the eyes of the user. Because a lot of hard work has been donethrough the years to bring the costs of testing down significantly, wecan use it in this way, which was never possible back when I startedin this business.

But, there are uses of usability testing that still need to takesample size into account. For example, when we conduct our CompelledShopping Analysis, we typically have 50 or more participants in thestudy. (The largest so far had 72 participants in the main study with12 pilot/rehearsal participants to work the bugs out of theprotocols.) These studies are very rigorous comparisons of multipleaspects of live e-commerce sites and we need to ensure we're capturingall the data accurately. Interestingly, we regularly find show-stopping design problems in the last 5 participants that weren't seenbefore in the study.


-> MY POINT (finally)

So, usability testing has evolved into a multi-purpose tool. You can'treally talk about the minimum number of participants without talkingabout how you want to use the tool. And you can't talk about how youwant to use the tool without talking about what you want to learn.

If you just want to gain insights about who your users are and howthey'll react to your design ideas, you only need a small number (1-5)to get really interesting, great insights. Other techniques (such as 5-second tests, defect detection, Compelled Shopping, Inherent Valuestudies) require different numbers of participants.

And the different techniques also require different recruitingprotocols, different moderating protocols, and different data analysisprotocols. So, if we're talking about number of participants, we alsoneed to talk about those differences too.

Hopefully, that will clear all this up. If you want to ask about thenumber of participants, tell us first about what you hope to learn.


Jared


________________________________________________________________
Welcome to the Interaction Design Association (IxDA)!
To post to this list ....... [email protected]
Unsubscribe ................ http://www.ixda.org/unsubscribe
List Guidelines ............ http://www.ixda.org/guidelines
List Help .................. http://www.ixda.org/help

Re: [IxDA Discuss] Article on Number of Usability Test Participants

Reply via email to