Awni Hannun


Publications and Preprints

  • Ambulatory Atrial Fibrillation Monitoring Using Wearable Photoplethysmography with Deep Learning, Yichen Shen, Maxime Voisin, Alireza Aliamiri, Anand Avati, Awni Hannun and Andrew Ng. KDD 2019. (paper, web page)
  • Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions, Awni Hannun, Ann Lee, Qiantong Xu, Ronan Collobert. Interspeech 2019. (paper, code)
  • A Fully Differentiable Beam Search Decoder, Ronan Collobert, Awni Hannun, Gabriel Synnaeve. ICML 2019. (paper, blog)
  • Wav2Letter++: A Fast Open-source Speech Recognition System,Vineel Pratap, Awni Hannun, Qiantong Xu, Jeff Cai, Jacob Kahn, Gabriel Synnaeve, Vitaliy Liptchinsky, Ronan Collobert. ICASSP 2019. (paper, code, blog)
  • Cardiologist-level Arrhythmia Detection and Classification in Ambulatory Electrocardiograms Using a Deep Neural Network. Awni Y. Hannun*, Pranav Rajpurkar*, Masoumeh Haghpanahi*, Geoffrey H. Tison*, Codie Bourn, Mintu P. Turakhia and Andrew Y. Ng. Nature Medicine, 2019. (paper, code, web page)
  • Transcribing Real-valued Sequences with Deep Neural Networks, Awni Y. Hannun. PhD Thesis, Stanford University, 2018. (pdf, LaTex)
  • Sequence Modeling With CTC, Awni Y. Hannun. Distill, 2017. (html, code)
  • Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks,. Pranav Rajpurkar*, Awni Y. Hannun*, Masoumeh Haghpanahi, Codie Bourn and Andrew Y. Ng. arXiv:1707.01836, 2017. (paper, web page)
    Mentions: MIT Tech Review, Stanford News
  • Building DNN Acoustic Models for Large Vocabulary Speech Recognition,. Andrew L. Maas, Peng Qi, Ziang Xie, Awni Y. Hannun, Christopher T. Lengerich, Daniel Jurafsky and Andrew Y. Ng. (2017). Computer Speech & Language, Volume 41, Pages 195-213. (link)
  • An End-to-End Architecture for Keyword Spotting and Voice Activity Detection, Chris Lengerich* and Awni Hannun*. NeurIPS 2016 Workshop on End-to-End Learning for Speech and Audio Processing. (paper, code)
  • Persistent RNNs: Stashing Recurrent Weights On-Chip, Gregory Diamos, Shubho Sengupta, Bryan Catanzaro, Mike Chrzanowski, Adam Coates, Erich Elsen, Jesse Engel, Awni Hannun, Sanjeev Satheesh. ICML 2016. (pdf)
  • Deep Speech 2: End-to-End Speech Recognition in English and Mandarin, SVAIL. ICML 2016. (pdf, long)
    Mentions: MIT Tech Review, MIT Tech Review
  • Lookahead Convolution Layer for Unidirectional Recurrent Neural Networks, Chong Wang*, Dani Yogatama*, Adam Coates, Tony Han, Awni Hannun, and Bo Xiao. ICLR Workshop, 2016. (pdf)
  • Learning Multiscale Features Directly From Waveforms, Zhenyao Zhu, Jesse H. Engel, Awni Hannun. Interspeech, 2016. (paper)
  • Deep Speech: Scaling up end-to-end speech recognition, Awni Y. Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y. Ng. arXiv:1412.5567, 2014. (pdf)
    Mentions: Forbes
  • First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs, Awni Y. Hannun, Andrew L. Maas, Daniel Jurafsky and Andrew Y. Ng. arXiv:1408.2873, 2014. (pdf, example decoder)
  • Increasing Deep Neural Network Acoustic Model Size for Large Vocabulary Continuous Speech Recognition, Andrew L. Maas, Awni Y. Hannun, Christopher T. Lengerich, Peng Qi, Daniel Jurafsky and Andrew Y. Ng. arXiv:1406.7806, 2014. (pdf)
  • Rectifier Nonlinearities Improve Neural Network Acoustic Models, Andrew L. Maas, Awni Y. Hannun, and Andrew Y. Ng. ICML Workshop on Deep Learning for Audio, Speech, and Language Processing (WDLASL 2013). (pdf)
  • Recurrent Neural Network Feature Enhancement: The 2nd Chime Challenge, Andrew L. Maas, Tyler M. O'Neil, Awni Y. Hannun, Andrew Y. Ng. The 2nd International Workshop on Machine Listening in Multisource Environments (CHiME 2013). (pdf)