Voice Gesture [ by Didier Brun ]

Do you remember Mouse Gesture ?

I am currently working on its good friend : Voice Gesture.

Well, the name is not very explicit, but it's a voice recognition library based on the Flash Player 10.1 Microphone new feature.

I just want to show you a very early demonstration :

Voice Gesture from didier.brun on Vimeo.

As you can see, it works pretty fine ( > 95% accuracy ) but I have to admit that, for now, the algorithm require these 2 points :

  • The user is the trainer (I have recorded my own voice models into the library)
  • A silent place

I have some more work to optimize the algorithm and build an AIR application to record and organize the sound library.

So keep in touch, I will publish this library soon...

PS : Voice Gesture works in a simple web-based Flash Player 10.1 (I have recorded this demo using FireFox), it is not specific to AIR 2.0.

Comments (25)

  1. Very nice ! Well done mate.

    Wednesday, December 16, 2009 at 2:00 am #
  2. Tek wrote::

    Your teaser is impressive, I’m pretty sure we will see it used in a real life project soon. I hope that the algorithm could be improved in a near future to remove defaults you found in it currently.

    Wednesday, December 16, 2009 at 2:36 am #
  3. Elliot Rock wrote::

    Interesting, apart from non-voice sounds (like the can of coke opening ) the training could be an issue for general usage.

    Great work!!

    Wednesday, December 16, 2009 at 5:02 am #
  4. Ross Phillips wrote::

    Really cool work! – looking forward to seeing the published lib. There are some really cool possibilities this will bring.

    Wednesday, December 16, 2009 at 5:59 am #
  5. jeanphilippe wrote::

    Really good stuff !
    I’m waiting for the next step and could see this library.
    Regards
    JP

    Wednesday, December 16, 2009 at 10:06 am #
  6. simonconlin wrote::

    Great work Didier, looking forward to seeing this evolve in 2010.

    Also kudos to SidLee for getting behind some R&D, I hope others take notice and follow their example.

    Wednesday, December 16, 2009 at 10:45 am #
  7. lionel wrote::

    Very nice demo! love the pschiiit end :)
    In the demo, you don’t speak a lot…Could that work during a conversation, for example, if I made a pause to launch the correct order ?

    Wednesday, December 16, 2009 at 11:07 am #
  8. Very very nice stuff !
    A new dimension in user experiences in our browser.
    We are all waiting for sources ;)

    Have a nice day !

    P.S : I spend few minutes looking for the invisible mail in my iphone with the sound at 1’14 :p

    Wednesday, December 16, 2009 at 12:46 pm #
  9. fabien wrote::

    Wow ! Nice !
    We can imagine a lot of interesting use cases for accessibility, and e-learning… If we don’t need to drink too Coca Cola for calibration :)

    Wednesday, December 16, 2009 at 1:11 pm #
  10. Didier Brun wrote::

    Thanks for your comments guys :)

    Wednesday, December 16, 2009 at 2:40 pm #
  11. Jloa wrote::

    Wow! Really nice. Especially the pretty girl that appeared when he opened the tin ^_^

    Wednesday, December 16, 2009 at 3:11 pm #
  12. Romain wrote::

    Really impressive and I have an idea for future use …
    I’m excited!

    Great work, thanks a lot ;)

    Wednesday, December 16, 2009 at 3:45 pm #
  13. Malatze wrote::

    Cool! Wonder if you used any opensource solution (ported or otherwise) like Sphinx for instance, for the voice recognition or is this all build from the ground up?

    Wednesday, December 16, 2009 at 7:28 pm #
  14. krys wrote::

    Keep up the great work Didier :)

    k

    Wednesday, December 16, 2009 at 8:12 pm #
  15. Bien fait! Ca marche super!

    Wednesday, December 16, 2009 at 9:55 pm #
  16. Fardeen wrote::

    Excellent !

    C’est pour SidLee ?

    Wednesday, December 16, 2009 at 11:16 pm #
  17. Blackiz wrote::

    Wow!!! Great!!

    Thursday, December 17, 2009 at 2:40 am #
  18. dim wrote::

    coca-cola should give you money fot this :D DD – very good work !!!

    Saturday, December 19, 2009 at 12:37 pm #
  19. Vijay.R wrote::

    Great Work…Looking forward to see the next step.

    Monday, January 18, 2010 at 11:04 am #
  20. Gaurav wrote::

    Good Stuff !!

    Thursday, February 25, 2010 at 2:53 pm #
  21. ilogyc wrote::

    cool!

    that library would be capable of recognize phrases? or just simple words and sounds?

    Thursday, April 1, 2010 at 9:31 am #
  22. Didier Brun wrote::

    @ilogyc> It could work with short phrases :)

    Thursday, April 1, 2010 at 4:06 pm #
  23. Og2t wrote::

    Hey Didier! Nice work! Are you using FFT or zero crossing frequency method? Are you thinking of sharing the source soon?

    Monday, May 24, 2010 at 1:05 pm #
  24. Guillaume wrote::

    Very interesting library !
    When do you think you will have a usable library ? Will it cost something ?

    I’m very interesting to use this for different stuff that i would do.

    Friday, July 23, 2010 at 2:48 pm #
  25. vanilla wrote::

    I recently made one :)
    http://vimeo.com/13637625

    Monday, July 26, 2010 at 10:17 am #

Trackbacks/Pingbacks (22)

  1. Voice Gesture-语音识别for FP10.1 | 熠●极光 on Wednesday, December 16, 2009 at 5:41 am

    [...] 翻译自原文:http://www.bytearray.org/?p=1151 [...]

     
  2. Hebiflux » Reconnaissance vocale sous flash ? on Wednesday, December 16, 2009 at 10:20 am

    [...] vidéo de Didier Brun chez Bytearray qui laisse sans voix (ahAHAH ! Pardon…) sur des essais de reconnaissance vocale. Basé sur [...]

     
  3. coderkind.com » Blog Archive » Flash 10.1 voice recognition demo on Wednesday, December 16, 2009 at 1:56 pm

    [...] impressive voice recognition demo here, as discovered via cisnky’s Twitter [...]

     
  4. Reconnaissance vocale par Didier Brun on Wednesday, December 16, 2009 at 2:50 pm

    [...] Didier Brun est développeur AS3, vous pouvez voir ses travaux ici : http://www.didierbrun.com et vous pouvez avoir plus d’explications sur son expérimentation ici http://www.bytearray.org/?p=1151 [...]

     
  5. Apukeittiö.fi » Blog Archive » Puheentunnistusta on Wednesday, December 16, 2009 at 4:39 pm

    [...] Kaikkea sitä Flashillä voikin tehdä > http://www.bytearray.org/?p=1151 [...]

     
  6. Business Centered Design Blog » “Voice Gesture” in Flash on Thursday, December 17, 2009 at 11:36 am

    [...] “Voice Gesture” in Flash Dezember 17th, 2009 | Category: Allgemeines via bytearray.org [...]

     
  7. [...] a good way to stay on top of any new exciting libraries that people are talking about. For example, Voice Gesture an article recently posted on ByteArray.org has been getting a lot attention on Twitter. It [...]

     
  8. Flash platform and more… « dkor on Friday, December 18, 2009 at 2:43 pm

    [...] on his blog a link to… ByteArray! about the coming project of voice recognition call ‘Voice Gesture‘ and developed by Didier Brun. This application take advantage of the new Microphone access [...]

     
  9. Adobe Flash platform and more… « dkor on Friday, December 18, 2009 at 3:04 pm

    [...] on his blog a link to… ByteArray! about the coming project of voice recognition call ‘Voice Gesture‘ and developed by Didier Brun. This application take advantage of the new Microphone access [...]

     
  10. New NUI « UI Addict on Sunday, December 20, 2009 at 10:50 pm

    [...] 20 12 2009 Recently I saw a interesting tweet from Seth Sandler regarding a AS3 speech recognition lib. The demo was very impressive. So I wondered why not for python ?. So I started looking around [...]

     
  11. [...] Voice Gesture [by Didier Brun] [...]

     
  12. Voice Gesture by Didier Brun « Willekeurigheid on Wednesday, December 23, 2009 at 2:43 am

    [...] Drool… more info [...]

     
  13. onebyoneblog » Blog Archive » FITC in Quick Review on Wednesday, February 24, 2010 at 2:19 pm

    [...] lunch came the obligatory Cool Shit hour. Here we got to see Didier Brun show off some voice recognition in Flash Player 10.1, Chris Allen of Infrared5 show off Brass Monkey, a great looking framework which will allow [...]

     
  14. » Blog Archive » Pensamientos flexeros (2010-02-14) on Friday, February 26, 2010 at 11:47 pm

    [...] reconocimiento de voz con AS3 http://www.bytearray.org/?p=1151 (via @yacaFx) [...]

     
  15. Mouse ve Voice Gesture Olayları on Monday, March 1, 2010 at 1:19 am

    [...] ve Voice gesture olayları ile ilgili aynı şahıstan (Didier Brun-ByteArray) iki farklı deneme: http://www.bytearray.org/?p=1151 http://www.bytearray.org/?p=91 >> [...]

     
  16. FITC Amsterdam » GELB der Powerflasher Blog on Tuesday, March 2, 2010 at 2:53 pm

    [...] 10 Minuten geile Scheiße. Am meisten beeindruckt hat mich dabei Didier Bruns, der seine Spracherkennung in Flash anhand eines SingStar Klones zeigte. Wirklich erstaunlich was mit dem neuen FlashPlayer 10.1 [...]

     
  17. Pitching the Microphone with Flash Player 10.1 Beta on Thursday, March 11, 2010 at 2:09 pm

    [...] http://www.bytearray.org/?p=1151 [...]

     
  18. Altes Thema, alte Frage ... Sprachsteuerung - Flashforum on Wednesday, May 5, 2010 at 5:56 pm

    [...] [...]

     
  19. Innovation and Flash | RIAgora on Wednesday, June 2, 2010 at 4:03 pm

    [...] Augmented Reality et ma boule de feu: AIR app and the source code Voice gesture recognition: http://www.bytearray.org/?p=1151 Intel8080 CPU emulation: http://www.bytearray.org/?p=622 Street Fighter CPU emulation: [...]

     
  20. Innovación y Flash | RIA212 on Wednesday, June 2, 2010 at 6:44 pm

    [...] Reconocimiento de Voz. [...]

     
  21. Innovation through Flash on Thursday, June 3, 2010 at 3:11 pm

    [...] shown above. Some of my favourites include an excellent demonstration of face recognition, an AS3 voice recognition library and a fantastic World Construction Kit that utilises the C++ Box2D physics library, running [...]

     
  22. [...] ist, der kann auf Bibliotheken zu Spracherkennung zurückgreifen. Didier Brun hat unter http://www.bytearray.org/?p=1151 eine solche Bibliothek bereits für den Flash Player 10.1 angekündigt. Oder man sendet die [...]