SVAIL Tech Notes: Deploying Deep Neural Networks Efficiently

Deep neural networks are increasingly important for powering AI-based applications but deploying them at scale is a challenge. In this video, Christopher Fougner, Research Scientist in Baidu Research’s Silicon Valley AI Lab, talks about how we use GPUs to solve this problem.

Some key points made by Chris:

  • Deep neural networks are increasingly important for powering AI-based applications like speech recognition.
  • Baidu’s research shows that adding GPUs to the data center makes deploying big deep neural networks practical at scale.
  • Deep learning based technologies benefit from batching user requests in the data center, which requires a different software architecture than traditional web applications.
  • Find more technical details about batching user requests here (Section 7.1) 

2017-05-22T04:01:05+00:00 November 25th, 2015|
Skip to toolbar