Baidu Tech Blog


Ryan and tony_meitu_1

Read the full post here.

Ryan Prenger and Tony Han, research scientists at Baidu’s Silicon Valley AI Lab, have posted a Tech Note about how SVAIL enabled the Baidu Deep Speech system to recognize Mandarin in about sixty days, an ambitious project. They write:

…In just a few months, we produced a Mandarin speech recognition system with a recognition rate better than native Mandarin speakers, in some cases. The biggest change we had to make for our Deep Speech system (originally developed in English) to work in Mandarin was increasing the size of our output layer to accommodate the larger number of Chinese characters.

 In the Tech Note, they provide details on what the team did to adapt the system to Mandarin and how the Baidu end-to-end learning approach made the project achievable.


See also our SVAIL GitHub Blog. 

SVAIL Tech Notes are written by engineers for engineers on topics related to AI technologies, techniques, tips and trends. Previous issues:

Deploying Deep Neural Networks Efficiently, by Chris Fougner

Optimizing RNN Performance, by Erich Elsen

* With credit to Jules Verne!



2018-01-17T16:20:53+00:00 February 9th, 2016|


  1. negative feedback removal amazon March 19, 2017 at 11:47 am

    I’m not that much of a online reader to be honest but your sites really nice, keep it up!
    I’ll go ahead and bookmark your site to come back later.

  2. marketing May 21, 2017 at 4:59 am

    Excellent post but I was wondering if you could write a litte more on this topic?
    I’d be very thankful if you could elaborate a little bit more.

Comments are closed.