Publications

Please also check my research profiles for more up-to-date information:

2025

  1. CONFERENCE INTERSPEECH Unified Model
    Joint Target-Speaker ASR and Activity Detection
    Chikara Maeda, Muhammad Shakeel, and Yui Sudo
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH) (accepted), Aug 2025
  2. CONFERENCE INTERSPEECH Contextualized ASR
    DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition
    Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, Chyi-Jiunn Lin, and 1 more author
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH) (accepted), Aug 2025
  3. CONFERENCE INTERSPEECH Foundation Model
    OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning
    Yifan Peng, Muhammad Shakeel, Yui Sudo, William Chen, Jinchuan Tian, and 2 more authors
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH) (accepted), Aug 2025
  4. JOURNAL TASLP Unified Model
    Joint Beam Search Integrating CTC, Attention, and Transducer Decoders
    Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Brian Yan, Jiatong Shi, and 2 more authors
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), Jan 2025

2024

  1. CONFERENCE AIチャレンジ研究会 Contextualized ASR
    動的な語彙拡張を用いたEnd-to-end音声認識の文脈適応
    唯 周藤, Muhammad Shakeel, Peng Yifan, and 唯 周藤
    人工知能学会第二種研究会資料, Jan 2024
  2. CONFERENCE AIチャレンジ研究会 Speech Separation
    Speech Separation with Auxiliary Signal-to-Artifact Ratio Loss for Improving Multi-Talker ASR
    Ngai Matthew, Maeda Chikara, Muhammad Shakeel, and Sudo Yui
    人工知能学会第二種研究会資料, Jan 2024
  3. CONFERENCE SLT Contextualized ASR
    dv_slt2024.png
    Contextualized Automatic Speech Recognition with Dynamic Vocabulary
    Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, and Shinji Watanabe
    In Proceedings of the IEEE Spoken Language Technology Workshop (SLT) (Best Paper Award) , Dec 2024
  4. CONFERENCE INTERSPEECH Contextualized ASR
    ib_interspeech2024.jpg
    Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss
    Muhammad Shakeel, Yui Sudo, Yifan Peng, and Shinji Watanabe
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Sep 2024
  5. CONFERENCE ICASSP Contextualized ASR
    Contextualized Automatic Speech Recognition With Attention-Based Bias Phrase Boosted Beam Search
    Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Yifan Peng, and Shinji Watanabe
    In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024
  6. CONFERENCE ACL Foundation Model
    owsm_ctc.jpg
    OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification
    Yifan Peng, Yui Sudo, Muhammad Shakeel, and Shinji Watanabe
    In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), Aug 2024
  7. CONFERENCE INTERSPEECH Foundation Model
    OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
    Yifan Peng, Jinchuan Tian, William Chen, Siddhant Arora, Brian Yan, and 7 more authors
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Sep 2024
  8. WORKSHOP ICASSPW Unified Model
    joint_icassp2024.jpg
    Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation
    Muhammad Shakeel, Yui Sudo, Yifan Peng, and Shinji Watanabe
    In IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), Apr 2024

2023

  1. CONFERENCE ASRU Foundation Model
    Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data
    Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, and 11 more authors
    In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Dec 2023
  2. CONFERENCE INTERSPEECH Unified Model
    Time-synchronous one-pass Beam Search for Parallel Online and Offline Transducers with Dynamic Block Training
    Yui Sudo, Muhammad Shakeel, Yifan Peng, and Shinji Watanabe
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2023
  3. CONFERENCE INTERSPEECH Unified Model
    4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders
    Yui Sudo, Shakeel Muhammad, Brian Yan, Jiatong Shi, and Shinji Watanabe
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2023
  4. CONFERENCE Unified Model
    End-to-end integration of online and offline encoders using auxiliary losses for automatic speech recognition
    Muhammad Shakeel, Yui Sudo, Yifan Peng, and Shinji Watanabe
    In 人工知能学会第二種研究会資料, Nov 2023
  5. CONFERENCE INTERSPEECH Efficient Model
    DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
    Yifan Peng, Yui Sudo, Muhammad Shakeel, and Shinji Watanabe
    In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2023
  6. CONFERENCE IEEE/SICE SII Anomaly Detection
    Metric-Based Multimodal Meta-Learning for Human Movement Identification Via Footstep Recognition
    Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, and Kazuhiro Nakadai
    In 2023 IEEE/SICE International Symposium on System Integration (SII), Aug 2023

2022

  1. JOURNAL Anomaly Detection
    3D Convolution Recurrent Neural Networks for Multi-Label Earthquake Magnitude Classification
    Muhammad Shakeel, Kenji Nishida, Katsutoshi Itoyama, and Kazuhiro Nakadai
    Applied Sciences, Aug 2022

2021

  1. JOURNAL Anomaly Detection
    earthquake_applied2021.png
    Detecting earthquakes: a novel deep learning-based approach for effective disaster response
    Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, and Kazuhiro Nakadai
    Applied Intelligence, Nov 2021
  2. CONFERENCE IEEE/SICE SII Anomaly Detection
    EMC: Earthquake Magnitudes Classification on Seismic Signals via Convolutional Recurrent Networks
    Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, and Kazuhiro Nakadai
    In 2021 IEEE/SICE International Symposium on System Integration (SII), Nov 2021
  3. CONFERENCE IEEE/SICE SII Others
    Assessment of a Beamforming Implementation Developed for Surface Sound Source Separation
    Zhi Zhong, Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, and Kazuhiro Nakadai
    In 2021 IEEE/SICE International Symposium on System Integration (SII), Nov 2021

2015

  1. CONFERENCE IEEE/SSSR Others
    Environmental sensing using millimeter wave sensor for extreme conditions
    Shakeel Muhammad, Daniele Nardi, Kazunori Ohno, and Satoshi Tadokoro
    In 2015 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), Nov 2015