Differences between revisions 10 and 11
Revision 10 as of 2007-09-04 18:30:30
Size: 3260
Editor: 68
Comment: flint cites IBM via voice as a code source, mostly out of complete ignorance.
Revision 11 as of 2007-11-21 14:40:32
Size: 3047
Editor: i59F71AB2
Deletions are marked like this. Additions are marked like this.
Line 21: Line 21:

---- /!\ '''Edit conflict - other version:''' ----
 * It will be great if we have a multilingual voice contolled system

---- /!\ '''Edit conflict - your version:''' ----

---- /!\ '''End of edit conflict''' ----
Line 60: Line 53:
 * Document structure inspired by the FeistyTelepathy spec wiki page, thanks to you. --kalon33  * Document structure inspired by the [:Spec/FeistyTelepathy] spec wiki page, thanks to you. --kalon33



As Ubuntu wants to be a serious alternative to M$ systems, it would be useful and interesting to provide a spoken system (which is complementary of the spoken boot) and to be able to launch current tasks using our voice.


These last years, a very important effort is done to ameliorate the user interface. In order to perpetuate and revolutionize it, we should be able to innovate, and try to include a spoken system and at least a few voice commands in order to make the users life different, and provide something that is unique and ambitious.

Use cases

  • Nicolas wants to be able to communicate more naturally with his system, hands free, and he wants his system to read for him the main outputs it feeds back.
  • Paul, a blind person, can also benefit from this system, with a more interactive OS.
  • It will be great if we have a multilingual voice controlled system


The spec aims to provide an advanced stack of tools preconfigured for doing voice control of the system and for the system to read the error and information messages. The voice control could be made of a voice recording assistant which, a bit like the "keyboard shortcuts" one, can associate a word with a system command (instead of keys combinations) but instead of it, tolerates a few differences, because voice is not always the same. The information and error messages reading could be done using espeak, festival or similar, like the system already included in kde (kttsd, ksayit and kmouth with a voice synthesis system)


  • Use packages that are already in our repositories, like espeak and festival, and try to find those which can be useful for the voice command system part.
  • Maybe use improvements and/or additional packages or programs provided by developers interested by this challenge.


* Not started

Current Status

  • Initial Drafting

Things being worked on / problems

Things feasible for Gutsy release

Things yet to come



  • Document structure inspired by the [:Spec/FeistyTelepathy] spec wiki page, thanks to you. --kalon33
  • ["Warbo"]: The only advanced Open Source voice recognition software I know of is Sphinx, which Google tells me is based here: http://cmusphinx.sourceforge.net/html/cmusphinx.php Hope it helps Smile :)

  • This also ties to the disability accomidation aspect of the product. [http://www-304.ibm.com/jct03001c/software/pervasive/embedded_viavoice/ IBM Viavoice] has been bandied about as available technology for FOSS use. Maybe some nice person from IBM could come to the Developers Conference in October and explain this to us.


SpokenSystemAndVoiceCommand (last edited 2008-08-06 16:13:55 by localhost)