Baidu Tech Blog

­

Introducing SwiftScribe: A Breakthrough in AI-Powered Transcription Software

Today we are proud to announce the beta launch of Baidu’s first AI-powered transcription software, SwiftScribe. We set out to develop SwiftScribe to fix a pain point – the time-consuming process of manually transcribing word-by-word. Now, [...]

March 13th, 2017|

PaddlePaddle’s New API Simplifies Deep Learning Programs

In September, we open sourced PaddlePaddle, the deep learning framework that has been used to power a range of Baidu products since its inception four years ago. To make the platform easier to use for [...]

March 9th, 2017|

Gram CTC: Speech Recognition with Word Piece Targets

Deep Speech presented an end-to-end neural architecture using the CTC loss for speech recognition in multiple languages. Today, we present Gram CTC which extends the CTC loss function to automatically discover and predict word pieces [...]

March 2nd, 2017|

Deep Voice: Real-Time Neural Text-to-Speech for Production

Baidu Research presents Deep Voice, a production-quality text-to-speech system constructed entirely from deep neural networks. The biggest obstacle to building such a system thus far has been the speed of audio synthesis – previous approaches [...]

February 28th, 2017|

Bringing HPC Techniques to Deep Learning

Summary: Neural networks have grown in scale over the past several years, and training can require a massive amount of data and computational resources. To provide the required amount of compute power, we scale models [...]

February 21st, 2017|

PaddlePaddle and Kubernetes Join Forces, Helping Developers Efficiently Train Deep Learning Models

Kubernetes community announced today that PaddlePaddle, the open source deep learning framework originally developed by Baidu, is now compatible with Kubernetes, the cluster management system, making PaddlePaddle the only deep learning framework that officially supports [...]

February 7th, 2017|

Baidu’s Melody: AI-Powered Conversational Bot for Doctors and Patients

Baidu has launched Melody, an AI-powered conversational bot designed to provide relevant information to doctors to assist with recommendations and treatment options. Melody incorporates advanced deep learning and natural language processing (NLP) technologies developed by Baidu. [...]

October 18th, 2016|

Baidu Research Announces New Open Source Deep Learning Benchmark

Today at the O’Reilly AI Conference in New York, Baidu Research announced DeepBench, the first open source benchmarking tool for evaluating deep learning performance across different hardware platforms. The announcement was made during a presentation [...]

September 26th, 2016|

Baidu’s Duer Personal Assistant Has New Talent: Sports Commentary

Badu’s Duer, which launched in 2015, has added a new skill to its repertoire. In addition to booking flights and ordering movie tickets, it can now hold its own as a sports announcer. In conjunction [...]

August 26th, 2016|

Baidu’s Silicon Valley AI Lab is Hiring!

Baidu’s Silicon Valley Artificial Intelligence Lab (SVAIL) has an ambitious mission: focus on cutting-edge AI research in areas such as speech recognition and translate this research into products that impact millions of users. We call [...]

August 19th, 2016|

Baidu Launches New Augmented Reality Platform for Smartphones

Today Baidu announced DuSee, a new augmented reality (AR) platform developed for smartphones. DuSee will be integrated into Baidu’s flagship platform apps, such as the Mobile Baidu search app, which has a user base of [...]

August 3rd, 2016|

SVAIL Tech Notes: Optimizing RNNs with Differentiable Graphs

This week we posted a new Tech Note in which Jesse Engel discusses a new technique for speeding up the training of deep recurrent neural networks. This is Part II of a multi-part series detailing some of the [...]

June 15th, 2016|

Adam Coates Speaks to TechEmergence about Future of Speech Recognition

Adam Coates sat down recently with Daniel Faggella from TechEmergence at our Sunnyvale office for an interview about AI, Speech Recognition and Natural Language Processing. During the interview, Coates, Director of Baidu Silicon Valley AI [...]

May 6th, 2016|

Baidu Researchers to Present at GPU Tech Conference

Baidu Research will participate in next week's GPU Tech Conference in San Jose, California. Here's a rundown of some of our activities there: - Exhibit Hall: Baidu will have a table in the "AI Playground" [...]

March 31st, 2016|

Big Data vs. Big Crowds – New Research from Baidu’s Big Data Lab

Baidu’s Big Data Lab has released a paper detailing how to use big data analytics to predict large-scale crowd formation and warn people of potentially deadly stampede events, like the tragic one that claimed the lives [...]

March 29th, 2016|