Baidu Research’s Silicon Valley AI Lab (SVAIL) is developing a next-generation speech recognition system named Deep Speech, which converts spoken words directly to transcribed output. Q & A with Awni Hannun and Bryan Catanzaro.
Baidu Chief Scientist Andrew Ng delivered a keynote address at the GPU Technology Conference on March 19 in San Jose, CA. During his hour-long talk to a packed house of 2500+, Dr. Ng provided a look at the driving forces behind recent advances in deep learning.
Speech recognition is an established technology, but it tends to fail when we need it the most, such as in noisy or crowded environments, or when the speaker is far away from the microphone. At Baidu we are working to enable truly ubiquitous, natural speech interfaces.