Do you remember Mouse Gesture ?
I am currently working on its good friend : Voice Gesture.
Well, the name is not very explicit, but it's a voice recognition library based on the Flash Player 10.1 Microphone new feature.
I just want to show you a very early demonstration :
Voice Gesture from didier.brun on Vimeo.
As you can see, it works pretty fine ( > 95% accuracy ) but I have to admit that, for now, the algorithm require these 2 points :
- The user is the trainer (I have recorded my own voice models into the library)
- A silent place
I have some more work to optimize the algorithm and build an AIR application to record and organize the sound library.
So keep in touch, I will publish this library soon...
PS : Voice Gesture works in a simple web-based Flash Player 10.1 (I have recorded this demo using FireFox), it is not specific to AIR 2.0.
Comments (25)
Very nice ! Well done mate.
Your teaser is impressive, I’m pretty sure we will see it used in a real life project soon. I hope that the algorithm could be improved in a near future to remove defaults you found in it currently.
Interesting, apart from non-voice sounds (like the can of coke opening ) the training could be an issue for general usage.
Great work!!
Really cool work! – looking forward to seeing the published lib. There are some really cool possibilities this will bring.
Really good stuff !
I’m waiting for the next step and could see this library.
Regards
JP
Great work Didier, looking forward to seeing this evolve in 2010.
Also kudos to SidLee for getting behind some R&D, I hope others take notice and follow their example.
Very nice demo! love the pschiiit end
In the demo, you don’t speak a lot…Could that work during a conversation, for example, if I made a pause to launch the correct order ?
Very very nice stuff !
A new dimension in user experiences in our browser.
We are all waiting for sources
Have a nice day !
P.S : I spend few minutes looking for the invisible mail in my iphone with the sound at 1’14 :p
Wow ! Nice !
We can imagine a lot of interesting use cases for accessibility, and e-learning… If we don’t need to drink too Coca Cola for calibration
Thanks for your comments guys
Wow! Really nice. Especially the pretty girl that appeared when he opened the tin ^_^
Really impressive and I have an idea for future use …
I’m excited!
Great work, thanks a lot
Cool! Wonder if you used any opensource solution (ported or otherwise) like Sphinx for instance, for the voice recognition or is this all build from the ground up?
Keep up the great work Didier
k
Bien fait! Ca marche super!
Excellent !
C’est pour SidLee ?
Wow!!! Great!!
coca-cola should give you money fot this
DD – very good work !!!
Great Work…Looking forward to see the next step.
Good Stuff !!
cool!
that library would be capable of recognize phrases? or just simple words and sounds?
@ilogyc> It could work with short phrases
Hey Didier! Nice work! Are you using FFT or zero crossing frequency method? Are you thinking of sharing the source soon?
Very interesting library !
When do you think you will have a usable library ? Will it cost something ?
I’m very interesting to use this for different stuff that i would do.
I recently made one
http://vimeo.com/13637625
Trackbacks/Pingbacks (22)
[...] 翻译自原文:http://www.bytearray.org/?p=1151 [...]
[...] vidéo de Didier Brun chez Bytearray qui laisse sans voix (ahAHAH ! Pardon…) sur des essais de reconnaissance vocale. Basé sur [...]
[...] impressive voice recognition demo here, as discovered via cisnky’s Twitter [...]
[...] Didier Brun est développeur AS3, vous pouvez voir ses travaux ici : http://www.didierbrun.com et vous pouvez avoir plus d’explications sur son expérimentation ici http://www.bytearray.org/?p=1151 [...]
[...] Kaikkea sitä Flashillä voikin tehdä > http://www.bytearray.org/?p=1151 [...]
[...] “Voice Gesture” in Flash Dezember 17th, 2009 | Category: Allgemeines via bytearray.org [...]
[...] a good way to stay on top of any new exciting libraries that people are talking about. For example, Voice Gesture an article recently posted on ByteArray.org has been getting a lot attention on Twitter. It [...]
[...] on his blog a link to… ByteArray! about the coming project of voice recognition call ‘Voice Gesture‘ and developed by Didier Brun. This application take advantage of the new Microphone access [...]
[...] on his blog a link to… ByteArray! about the coming project of voice recognition call ‘Voice Gesture‘ and developed by Didier Brun. This application take advantage of the new Microphone access [...]
[...] 20 12 2009 Recently I saw a interesting tweet from Seth Sandler regarding a AS3 speech recognition lib. The demo was very impressive. So I wondered why not for python ?. So I started looking around [...]
[...] Voice Gesture [by Didier Brun] [...]
[...] Drool… more info [...]
[...] lunch came the obligatory Cool Shit hour. Here we got to see Didier Brun show off some voice recognition in Flash Player 10.1, Chris Allen of Infrared5 show off Brass Monkey, a great looking framework which will allow [...]
[...] reconocimiento de voz con AS3 http://www.bytearray.org/?p=1151 (via @yacaFx) [...]
[...] ve Voice gesture olayları ile ilgili aynı şahıstan (Didier Brun-ByteArray) iki farklı deneme: http://www.bytearray.org/?p=1151 http://www.bytearray.org/?p=91 >> [...]
[...] 10 Minuten geile Scheiße. Am meisten beeindruckt hat mich dabei Didier Bruns, der seine Spracherkennung in Flash anhand eines SingStar Klones zeigte. Wirklich erstaunlich was mit dem neuen FlashPlayer 10.1 [...]
[...] http://www.bytearray.org/?p=1151 [...]
[...] [...]
[...] Augmented Reality et ma boule de feu: AIR app and the source code Voice gesture recognition: http://www.bytearray.org/?p=1151 Intel8080 CPU emulation: http://www.bytearray.org/?p=622 Street Fighter CPU emulation: [...]
[...] Reconocimiento de Voz. [...]
[...] shown above. Some of my favourites include an excellent demonstration of face recognition, an AS3 voice recognition library and a fantastic World Construction Kit that utilises the C++ Box2D physics library, running [...]
[...] ist, der kann auf Bibliotheken zu Spracherkennung zurückgreifen. Didier Brun hat unter http://www.bytearray.org/?p=1151 eine solche Bibliothek bereits für den Flash Player 10.1 angekündigt. Oder man sendet die [...]