<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Is multi-modality and network-based speech recognition the future?</title>
	<atom:link href="http://www.vuidesign.net/is-multi-modality-and-network-based-speech-recognition-the-future.htm/feed" rel="self" type="application/rss+xml" />
	<link>http://www.vuidesign.net/is-multi-modality-and-network-based-speech-recognition-the-future.htm</link>
	<description>Interface Design Lessons From The World Around Us</description>
	<lastBuildDate>Thu, 25 Feb 2010 15:09:02 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: eolvera</title>
		<link>http://www.vuidesign.net/is-multi-modality-and-network-based-speech-recognition-the-future.htm/comment-page-1#comment-327</link>
		<dc:creator>eolvera</dc:creator>
		<pubDate>Tue, 05 Jun 2007 16:47:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.vuidesign.net/is-multi-modality-and-network-based-speech-recognition-the-future.htm#comment-327</guid>
		<description>I totally agree with you in that speech-to-text is likely to continue to be very important as speech recognition technologies continue to evolve. But one very interesting trend I&#039;ve been seeing with some of the new services that offer similar types of services where you leave a voice mail and then receive the text version of it on your cell phone (Jott, Simulscribe, etc.), is that the way the &#039;solve&#039; the accuracy issue is simply by outsourcing the transcription effort to agents in another country where the labor costs certainly justify using them as an alternative to any sort of automated technology. For example, one company taking advantage of this setup is Truemors - you can call in a rumor and by using one of these services the company transcribes the contents of your message and publishes it on the website.</description>
		<content:encoded><![CDATA[<p>I totally agree with you in that speech-to-text is likely to continue to be very important as speech recognition technologies continue to evolve. But one very interesting trend I&#8217;ve been seeing with some of the new services that offer similar types of services where you leave a voice mail and then receive the text version of it on your cell phone (Jott, Simulscribe, etc.), is that the way the &#8217;solve&#8217; the accuracy issue is simply by outsourcing the transcription effort to agents in another country where the labor costs certainly justify using them as an alternative to any sort of automated technology. For example, one company taking advantage of this setup is Truemors &#8211; you can call in a rumor and by using one of these services the company transcribes the contents of your message and publishes it on the website.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: IanRae</title>
		<link>http://www.vuidesign.net/is-multi-modality-and-network-based-speech-recognition-the-future.htm/comment-page-1#comment-325</link>
		<dc:creator>IanRae</dc:creator>
		<pubDate>Tue, 05 Jun 2007 15:09:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.vuidesign.net/is-multi-modality-and-network-based-speech-recognition-the-future.htm#comment-325</guid>
		<description>Interesting ideas.  I think network-based speech is a prerequisite to any sort of wide adoption of speech rec.  Multi-modal works well in hands-free activities such as driving or field service workers.  It doesn&#039;t make as much sense for cell phones which are typically held to the ear or (with Bluetooth) worn on a belt.  That being said, one real holy grail is speech-to-text dictation of notes.  There would be a huge market for the ability to speak an e-mail.  This is not currently possible with our legacy 64 kb/s telecom network with its poor audio quality of 4KHz bandwidth.  Network-based speech breaks this boundary by capturing high-quality audio and transmitting it as data to speech rec engines on the network.

The car, because it is a private space, and the user&#039;s hands and eyes are busy, offers huge opportunities for speech apps.</description>
		<content:encoded><![CDATA[<p>Interesting ideas.  I think network-based speech is a prerequisite to any sort of wide adoption of speech rec.  Multi-modal works well in hands-free activities such as driving or field service workers.  It doesn&#8217;t make as much sense for cell phones which are typically held to the ear or (with Bluetooth) worn on a belt.  That being said, one real holy grail is speech-to-text dictation of notes.  There would be a huge market for the ability to speak an e-mail.  This is not currently possible with our legacy 64 kb/s telecom network with its poor audio quality of 4KHz bandwidth.  Network-based speech breaks this boundary by capturing high-quality audio and transmitting it as data to speech rec engines on the network.</p>
<p>The car, because it is a private space, and the user&#8217;s hands and eyes are busy, offers huge opportunities for speech apps.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
