Archive for the 'Dialog Design' Category

Smashing Magazine just published an article about Useful Techniques for Good UI Design, that even though concentrated on Web Applications, I felt there were a few techniques that could be reviewed with an eye towards VUI design:

  1. Highlight important changes
  2. And by this I don’t mean “Please listen careful for our menu options have changed”. Here we’re talking about system status visibility - knowing where you are and that actions led to expected results. Implicit confirmations are a good example of this.

  3. Enable shortcuts in your application
  4. The key concept here is to offer users more responsive user interfaces. In the case of the phone, this can be somewhat compared to having “touch-tone equivalents” which allow expert users to breeze through applications without having to wait to be prompted for input. But here I would also argue that other things would fall under this same principle: good command selection, “shadow” options that even though not prompted for can be supported if tuning data backs up the decision, and even “parallel” choices which allow someone picking one branch to swiftly change “branches” without having to “go back” or return to a “main menu”.

  5. “Upgrade” options
  6. I feel this would be similar to having functionality available for “expert users”. Proper use of barge-in as well as multi-slot support allows someone to interact in more “advanced” ways with a system keeping such a transition simple and intuitive.

  7. Advertise features of the application
  8. And no, sending out “user manuals” for applications does not qualify for this. Here we would be talking about pointing out “new” features or even “advanced” features that are available. As with most non-task-oriented information, this should be presented only after the user has successfully completed their task.

  9. Use “color-coded” lists
  10. Items appearing together are an issue, no matter what type of interface we’re talking about. Therefore it is really important to put attention to the different tools at our disposal to avoid this issue: voice intonation/emphasis (such as making menu options clearer), word choice (avoid confusion amongst different choices), silences (natural pauses to allow mental processing of the information), etc.

  11. Offer personalization options
  12. I’m sorry to say this has been out there for quite a few years for most interfaces, yet voice user interfaces hardly ever offer them. It may be harder to do than with visual interfaces (where you can change colors, rearrange items, etc.) yet small things like offering only relevant choices based on someone profile or remembering the last tasks they’ve performed go a long way in making users feel like they “own” the system.

  13. Display help messages
  14. Help is definitively one of those topics that has been debated a lot, and like most controversial ones, whether or not it should be used and how it should be used can be summarized in two words: “it depends”. Personally I feel help should not be a stand-alone piece of a design (whether it is a tutorial or a separate collection of help prompts) but rather an integral part of design – commands offered should be helpful to a user, error messages should add relevant information to help users recover, transition messages should help users know where they are and where they are going, etc.

  15. Design feedback messages carefully
  16. Here we can think about transition messages (exit prompts) and error recovery prompts. Here, there are various things that can be done to make them more effective: control voice intonation so as to convey the right meaning, reword questions and statements so as to convey the same meaning without simply “reprompting” users (if the problem had to do with users not understanding you in the first place, simply repeating is not going to help at all)

  17. Use tabbed navigation
  18. This is one of those areas where visual design has an advantage over non-visual design. Nevertheless, good information architecture can help obtain similar results, where applications reflect user’s mental models making it easier for them to know what choices mean and how the system works. This makes users feel at ease since they feel more in control.

  19. Darken background
  20. Conversely, here non-visual design has the advantage since all interactions happen in sequence. No need to “gray out” parts of our design since by the time we move on with a new “task”, all previous information now belongs to the past.

  21. Lightboxes and Slideshows
  22. The concept here is to allow users to navigate back and forth. In the visual world there are a lot of tricks to reduce the noise and allow a user to focus on a particular item. In the case of non-visual design, there are ways to do similar things when the system has been properly architected, which should include consideration of “go back” behavior that would contextually match a user’s mental model, as well as proper “sub task” definitions so that if users need to make a change or update a certain piece of information, they don’t have to start all over again or get confused about how to do it.

  23. Short sign-up forms
  24. This is another one where non-visual design has the opportunity to leverage information. The whole notion here is to minimize the effort needed to identify a user and to speed up the process. On one hand, this means taking advantage of what technology has to offer: ANI identification, DNIS separation (ideal for language selection), CTI (please, please, please don’t loose everything that the user has accomplished so far). And on the other, there are design techniques such as the removal of “optional” elements if that allows a user to proceed (and especially if that avoids confusion and distraction from the user’s real goal).

With more people spending more time watching video on their computers than on TV, it makes sense to have a XUI – A channel for User Interface Design. Enjoy!

I’ve always been fascinated by the cross-pollination that can be achieved when various fields and disciplines work together. As you might have noticed from my various posts, I also like to keep an eye out for opportunities in the UI field – things happening in other fields that may not be working together in ours yet, but whose potential definitively makes them worth a second thought and an evaluation to see what properties and ideas can be applied to UI design.

Therefore, you can imagine my excitement after seeing this particular video from TED: Yves Behar demonstrates in a very engaging presentation how business requirements, resource limitations and the status-quo (read “fear of change”) tend to yield “us-too” type of designs and solutions that don’t really add any value to those companies or the customers of those companies.

I loved the phrase he presented: “…Advertising is the price companies pay for being un-original.”, which makes me wonder what other things aside from Advertising may be “prices” companies are paying for not trying to push the envelope, for not questioning “best practices”, and as he puts it, from not “designing from the inside out”.

For me, the biggest UI lesson contained in his words have to do with how we all do our best to deliver VALUE to our customers and to their users, but how many times have you brought YOUR VALUES to the table? When was the last time you, as he puts it, “fought like an animal” to make sure the VALUES that drove the project in the first place are maintained until the end, without compromises?

While doing some research on Multimodal Usability and User Interfaces, I ran across a list of Three Main Attributes of Multimodal User Interfaces, which on careful thought, can definitively be applied to ANY User Interface. It was a nice refresher which reminded me of why good design is so difficult to achieve, but very rewarding.

Good User Interfaces can be judged based on their:

  1. Effectiveness: Whether a user can achieve a desired goal or a certain task with a predefined degree of perceived accuracy. This one has to do directly with our users and their environment. Good questions to ask ourselves: What level of accuracy are you designing for? What completion rates can you expect from certain design strategies? How satisfied will your users be after completing a task?

  2. Efficiency: How much effort (fatigue, frustration, cognitive demand, stress, discomfort, number of tries) and resources (time) will those users need to complete a task. Good questions to ask ourselves: How long is it going to take for an average user to achieve a certain goal? How many error corrections can we expect from certain design strategies? Will users obtain something of a higher value than the one they are putting into the interaction?

  3. Learnability:Whether a user can easily discover the functionality contained in a system and quickly learn how to use it. Good questions to ask ourselves: How intuitive is my design? Based on the context on the interaction, does the funtionality offered by the system match the user’s expectations? How easy is it for users to explore and discover new functionality and advanced features?

Based on these attributes, where do your design stand? Come on, don’t be shy…

Finally had a chance to look at this month’s cover article of the SpeechTEK magazine. What a joy! I was very pleased to see that once again, the topic of Translations (which I often refer to as “Transgressions”) comes up in an attempt to raise awareness about the implications of simply taking content in once language and doing a straight 1:1 translation without any consideration for cultural issues and linguistic nuances. As the author pointed out, translations are “a quick, cheap, lousy idea.” (well said!)

As the article points out, in the particular case of IVR systems there are various syntax and grammatical issues to consider as well as code implications and underlying logic divergences derived from those structural differences – it’s not just a matter of word order!

Furthermore, the author makes a really good point about the dual nature of IVR and Speech Recognition systems in the sense that we should not only be careful about what we say and how we say it (the outgoing activity) but also about the incoming activity – what callers say, how they say it, etc.

Along with the list of Tricks of the Trade provided, things such as the use of a native speaker for reviewing localized Dialog are of utmost importance.

Finally, one of the topics that often comes up when dealing with countries where more than one variety of the same language are spoken (for example, Spanish in the US), is the issue of which “version” of the language should be spoken so as to accommodate various countries of origin – normally referred to as “neutral Spanish”. I loved the definition they used because it captures the complexity of finding that middle ground while still taking into consideration the technical/financial implications and realities which we often face in our projects – “the neutral version of any accent is the version that offends the fewest people”

Happy Reading!

It seems one of the most frequent questions that come up when designing a new user interface is “how many options should appear on any given menu?” which depending on the answer is almost always followed by “how many levels down should the system have?”

And believe me, neither one has an easy or absolute response. As witnessed by some of the heated debates on the VUIDs list, the old standing guideline of “five options or less” was recently put to the test when Patrick Commarford and James Lewis from IBM published the article entitled “A Comparison of Broad Verses Deep Auditory Menu Structure” where the results seem to indicate that when comparing broad vs. deep menu designs, giving callers a single menu with many options (providing touchtone fallback) yields better results than using fewer options supported by submenus.

Aside from all the points, concerns, comments and objections raised by both the authors, the VUI community and other blogs, in my opinion I think it’s going to be really hard to reach a consensus if the main focus of the discussion if to agree on whether *5* is the right number of choices or not. The problem I see with trying to find that ‘ideal’ number is that a number by itself doesn’t mean anything. In this case, I think the devil is in the *context*.

In other words, any attempt to generalize what a good design should look like without taking into consideration the context of the application is not very responsible. We all know there are many factors that need to be taken into account when designing any system: caller’s age, demographics, type of application, caller’s objectives, business goals, etc.

Furthermore, when evaluating possible menu choices, the frequency with which each choice is needed and used by callers should not only drive their position within the menu but also the decision of whether or not the option should be in that menu in the first place (for example, if the top 3 choices on a menu are used 45%, 31% and 20% of the time respectively, then adding one more choices that’s only used 1% of the time should not be considered – in fact those calls are likely to be better handled by agents in the first place).
When all those factors are added to the mix, then an appropriate design *strategy* can be conceived.

For example, when thinking about the context of an application, imagine a system asking callers to choose a bird species – “And which type of bird is circling around you: dendrocygna, gelochelidon, herpethotheres, or tachycineta?”. In that scenario, unless you’re a birdwatcher, you’re likely to be overwhelmed even after hearing only 3 choices. The counterpart of course would be to ask for something so common and known that the number of choices in itself wouldn’t make a difference – “And which planet are you calling from: Mercury, Venus, Earth, Mars, Jupiter, Saturn, Uranus, Neptune or Pluto?”

I definitively believe the industry should strive to find more answers to long standing questions, particularly when those responses can be backed up by real world data, but in this case, I think that unfortunately the answer to the question whether to go broad or deep on menus still is… it depends.

I recently got a link from Amy Quinn pointing out an article entitled “101 Five-Minute Fixes to Incrementally Improve Your Web Site” provided by InsideCRM, and though it might be helpful to go over some of those points in the list and hence suggest “21 Quick Fixes” (granted they might take a little more than 5 minutes) that can be adapted and applied to Voice User Interfaces and Voice Applications:

Copywriting

1. Tell callers why they should perform a task. “People are trained to follow a request, as long as you give them a good reason to do it”. Therefore, put yourself in your caller’s shoes (or ears), and make sure you’re not only offering something of value to them, but that you’re clear in why they should go through it (e.g. shorter wait times, 24×7 availability, etc.)

2. Make the most highly trafficked menus easier to listen to. If your system contains menus that are too long and you can’t reduce the number of choices, think about grouping some of them or break the menu up so it’s easier to process in short term memory.

3. Make choices meaningful. Be sure to change any vague or cutesy menu options to something more up-front, meaningful, that callers can understand (no jargon please)

4. Stay consistent. Check your prompts and terminology for consistency, or else the experience will seem unstable, unprofessional or patchy.

5. Stay simple. You just can’t beat that. Granted some processes are complex by nature, that doesn’t mean they need to be complicated. Strive for clarity!

6. Avoid making hollow promises. Including those callers are taken for granted nowadays, such as pressing 0 and expecting to talk to an agent.

7. Be concise. ‘Nuff said.

8. Go with what works. And if you don’t know what works, ask your users – that’s why Usability was invented for in the first place!

Usability

9. Make navigation consistent. Listen to your users, understand how they think and what steps they need to follow to complete a task, and then stick to that flow.

10. Never ask for more information than you need. Related to #1, this includes only requesting relevant information at appropriate times (why ask for the last 4 of the SSN when all you want is the hours of operation?). “When you get greedy for data, you’ll turn off some visitors”.

11. Add a search box. Or in the case of VUI design, think about adding a SpeakFreely/SayAnything-type of implementation where callers can say what it is they need right at the beginning of the call.

12. Use plenty of contrast. In our case, we’re looking for ’sound contrast’. You’ll need to make sure you coach the voice talent properly so as to not only get a good sounding application but also intonations that convey the right message and elicit the right responses.

13. Test it on real users. Oh, if we could just make this a law…

Accessibility

14. Modify color. In our case, we’re looking for ’sound color’. This might include having multiple personas and voice ‘sets’ to account for playback speed and other traits that are necessary to support multiple caller’s ages, audible ranges, etc.

15. Identify the language. Explore the use of alternative DNIS numbers for each language so as to avoid requiring language menus (“For Spanish, press 2…”) in which case don’t forget to make one of them the default.

16. Supplement navigational aids. Explore the use of professionally generated earcons that not only serve as branding elements but also have an impact on application usability (for example, a quick tone preceding the playback of a confirmation number).

17. Define shortcuts. Take advantage of any Usability and Tuning findings you discover that might involve real users employing certain keywords/phrases as shortcuts often enough to justify adding them as ‘hidden shortcuts’

Design

18. Place important information “above the fold”. In our case, either at the beginning of a certain task (e.g. providing account balance before attempting a transfer), or at the end of a sentence so as to facility short term memory recall, particularly if the information provided is new to the caller. (e.g. “Your due date is May 15th.”)

19. Reduce choices. If your system contains menus with too many choices, reduce the number of choices to the very minimum necessary for callers to accomplish their task and maybe add a “something else” choice for everything else.

20. Nix banners. Which in our case might also include unwanted advertisements and legal disclaimers (“This call may be recorded for quality purposes.”) whenever possible.

21. Stay consistent. Check to make sure the design and prompt recordings are not only consistent throughout the entire experience but also consistent with the system persona and user profiles.

The As if not enough discussion has been generated over this particular UI design tool on the VUIDs group, it seems the 37 signals post about how they don’t use Personas stirred yet another round of arguments on both their own site as well as in some other UI-related blogs such as Good Experience and The User Experience Soapbox.

I think they all provide a very interesting perspective and valid points both for and against the use of Personas (or Personae) as a design tool.

But for me, aside from it being just one more tool in our UI toolbox, the added extra business value that I find in them (which I didn’t see being brought up on those discussions) is how effective it is to help business owners and other business-side team members move away from thinking about “the user” in an abstract way.

In general, if you’re in the same room with five different people and you mention you want to do something to help improve the experience of their “users”, chances are each and every one of them will have a very different image in their head and idea of who this “user” is and what this “user” needs.

On the other hand, once all the information has been gathered and the right process has been followed to define a User Persona (or Archetype) for your system, then any future discussion takes a very different direction. You’ll notice how now new design decisions and arguments can be centered on your Persona.

It doesn’t have to be too complicated nor time consuming. For example, something as simple as defining a Natalie archetype might be enough: “Natalie is a real estate investor and has an ABC bank customer for about six years. She often calls her branch to get up to date information about interest rates and mortgage products. She normally calls from her home-office, so she’s in a quiet environment. She considers herself tech-savvy, so she likes automation, but time is precious to her, so if things aren’t working fast, she’d rather just talk to someone who can help her quickly.”

With this in hand, you’ll see how asking regular questions such as “how will the user be able to get her balance?” or “will the user need more information after a transaction?” turn into very relevant, in-context discussion starters such as “what’s the best way to offer Natalie the most up to date interest rates?” or “what information will Natalie need after choosing one of our mortgage products to make an investment decision?” respectively.

I understand the concern about using personas to replace talking to real people, but I like to think that they are not just the output of talking to real people, but a way for those people to keep “talking” for the duration of a project. So tell me, what do your users think?

(I found this great article by Jared M. Spool) and decided to do an adaptation)

From observation and analysis, it seems teams who focus on the long term objectives of a project and an organization are far more likely to create designs that really pay off for the organization, whereas short-term vision teams end up not only not meeting the caller’s expectations, but often having to revisit the design and redo most of the work later on.

It is important to understand that the right approach to any Redesign involves turning the system into a living, breathing entity that evolves over time and grows with the caller’s and organization’s needs.

Here are seven essential long-term components to reach a successful phone system redesign project:

1. Make Sure You Have A Vision

This can be as simple as to look five or ten years into the future and ask the question, “What will calling our system be like on that date? What experience will the user have?” Team members from the best organizations have a consistent, clear idea what the user’s experience will be like in the future. Such a vision helps drive the design as well as any future changes/enhancements (“will this change get us closer to that vision?”).

It’s critical the vision not focus on future technology but instead on future experience. What are the steps in today’s process that makes things cumbersome or frustrating? How could the experience become more delightful?

2. Spend Time With Your Users

To successfully redesign (and to design for that matter), you need to be in close contact with the source – your users. You need to know who is using your system and what they are doing with it. You need to know what works and doesn’t work for them.

Based on the number of new implementations and changes happening out there, it seems many companies finally understand the importance of the telephone channel as a critical touch point with their customers. Unfortunately, even though some of those systems are getting major face lifts (heard any new “please listen carefully as our options have recently changed” recently?), most of the time decisions are made without even looking or listening to their users.

When preparing for a redesign of any system, teams should not only focus on what the new design should do but should also spend the same amount of time listening to real caller interactions with the existing design as well as sitting down with call center agents.

3. Reduce Risk By Working In Little Bits

Going back to thinking about design as a never-ending, always-improving process, the most successful teams keep their projects small. They don’t attempt to redesign everything in a single launch; instead, they work on one small section at a time.

Otherwise, you end up with complexity at all angles: the scope of functionality is larger, the number of stakeholders is larger (each with their own concerns), the number of archetypes the team is designing for is larger, more compromises are made, and the risk is much higher. If things don’t go as planned (as it often happens), it’s a huge problem for everyone, often with more visibility in the organization than the team would like.

Teams that only focus on a small portion of the application at a time reduce all those factors and risks while being able to concentrate on those critical areas that will make or break the design. Plus of course, reducing scope and concentrating on the most used features yield much better ROI numbers and timeframes.

4. Have the Right Skills Internally

Here recommendations might slightly different from what a regular web project might require. Even though on the web the best teams are less likely to hire outsiders to do their designs, building the right speech and VUI skills in-house is not only hard and too specialized, but may get in the way of a successful redesign. Nevertheless, since the idea is to have a design that evolved over time, it is up to the company to maintain, change, update and enhance the application as user’s needs change.

Therefore here the suggestion would be to have a dedicated in-house team that would work side-by-side with someone with experience in these types of systems so they can learn not only the intricacies of the design but also the reasoning and the strategy behind it (which will also help them become advocates of it and defenders of the user experience). That way, external resources can be later on used as mentors or as a way to make larger changes faster.

5. Think ‘Standards’

The VoiceXML and SRGS standards are the successful teams’ best friends. Careful application of the standards can dramatically shorten the time it takes to make changes down the road, as well as to simplify the integration of third party components and technologies.

Even if an original application didn’t start out as standard-compliant system, it’s worth the effort of slowly converting it. As functions/modules/flows are redesigned (in little bits, remember), changing them over to be standards-compliant is an effective approach. Every new redesigned area helps improve the team’s skills in using standards, thereby making the next section even easier to convert over.

6. Have a Plan for Change

Aside from having clear implementation and maintenance plans in place (system architecture, internal processes, etc.), it is critical to also plan for how the users will experience the change. Will they just call in one day to find an entire new experience or will the change slowly happen over time, almost imperceptibly? Will they be notified in advance of what the new system is prepared to do for them? Are all other contact points (web, branches, etc.) ready to support the cross-channel interactions that will be generated by the new system? Will the agents be prepared to handle the temporary increase in the number of questions and issues that often arise as soon as a new system is implemented? (and no, asking callers to “please listen carefully as our options have recently changed” doesn’t count as a plan)

For example, one interesting approach some companies take on the web is to slowly convert users over to a new version of a system by offering the new options while still allowing users to continue to use the old functionality/version for a while (this may be particularly critical for caller populations with a wide gap between infrequent callers and expert users, specially if moving from a DTMF system to speech). That way companies can assess the risk of changing the functionality out from under these users versus the cost of supporting both interfaces.

7. Understand the Internal Processes

Unfortunately, just as it happens on the web, many teams approach the redesign process much like they’d approach the design of a brochure or a monthly statement. Designing either one has the advantage that, at some point, it is printed and delivered. Once that happens, it can’t be changed — it’s done. The only thing is to start over with a new one.

The problem of course is that if you think you can think about a new design, implement it, then pay attention to other business, not giving the system any further attention, you’ll be in trouble.

The most successful teams consider, in the planning stages of the project, what the long-term internal processes will be for updating/enhancing the system after the design changes (due to new user’s needs or business requirements). How will they add new functionality? When do they remove low usage flows? Who will edit prompts before they go live? Who will review changes? Who will decide about changes to the user’s experience?

Understanding how the organization will handle the ongoing system changes shortens the time it takes to make improvements, reducing the need for a risky major relaunches (or sometimes even worse – pulling out the new system and bringing back the old one).

Conclusion

Careful consideration of long-term factors dramatically increases the odds a team will produce ongoing results that have considerable business impact. Teams ignoring these long-term components may get a new design launched, but will likely find themselves reliving the difficult experience again in just a few months.

Considering the latest advancements and the impact of social media in our environment, I think it would make sense to borrow a page from social design and re-think who should be the center or “pivot” of our designs.

As Joshua points out in his blog, social software has changed the focus of most web applications from a thread-centered approach (the way Usenet was setup and how most Forums work so that each new thread was assigned a specific topic under which the discussion took place) to a people-centered design (the way MySpace, Ryze, LinkedIn, Friendster, and others work where the individual becomes the focal point of all interactions which then spread throughout that individual’s network of friends.

From a VUI perspective, I think there are some lessons that can be learned and applied to application design. In particular I think the point about how if we live our daily lives with ourselves as the center it would also make sense for the applications around us to behave the same way makes total sense in the context of self-service.

We all know the drill: design tends to be driven by all sorts of requirements which tend to yield solutions that are business-rule-driven, ROI-driven, containment-driven, call-length-driven and from time to time, user-driven (even though I’m glad to report that tendency seems to be changing).

But to me, user-driven is not the same as people-centered. On one hand, applications can certainly be designed around user’s needs and offer things that most users want. But on the other hand, in a true step towards personalization, designs could certainly be centered on people: who they are, what they like, how they interact, what they need, etc. at a very individual level in some areas, but yet connected to that individual’s network of friends, peers, family, and acquaintances.

And as we know, once a different “pivot” is selected during the definition of an organizational scheme, the entire interaction, steps needed and flow are likely to change dramatically.

For example, what would a financial system sound like with people as the pivot point? Is there a shared object in our systems around which people would like to connect?