|
李勝 研究業績一覧 (24件 / 141件)
論文
-
Kai Wang,
Lili Yin,
Sheng Li,
Madina Mansurova,
Hao Huang..
Neural TTS-based Dynamic Data Augmentation for Improved Speech Separation.,
IEEE Trans. Audio, Speech \& Language Process. (TASLP),
Dec. 2025.
-
Chin Yuen Kwok,
He Xin Liu,
Jia Qi Yip,
Sheng Li,
Eng Siong Chng.
A Two-Stage LoRA Strategy for Expanding Language Capabilities in Multilingual ASR Models.,
IEEE Trans. Audio, Speech \& Language Process. (TASLP),
Dec. 2025.
-
Longfei Yang,
Jiyi Li,
Sheng Li,
Takahiro Shinozaki.
Multi-Domain Dialogue State Tracking with Large Language Model Rationale and Disentangled Domain-Slot Attention,
IEEE Trans. Audio, Speech \& Language Process. (TASLP),
Dec. 2025.
-
Zhengdong Yang,
Qianying Liu,
Sheng Li,
Fei Cheng,
Chenhui Chu.
Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition.,
EEE Trans. Audio, Speech \& Language Process. (TASLP),
Dec. 2025.
国際会議発表 (査読有り)
-
Chin Yuen Kwok,
Sheng Li,
Jia Qi Yip,
Chenhui Chu,
Tatsuya Kawahara,
Eng Siong Chng.
Extending Whisper for Emotion Prediction Using Word-level Pseudo Labels,
IEEE-ICASSP,
Dec. 2025.
-
Jun-You Wang,
Sheng Li,
Li-An Lu,
Sydney Chia-Chun Kao,
Jyh-Shing Roger Jang.
Similarity-based accent recognition with continuous and discrete self-supervised speech representations,
IEEE-ICASSP,
Dec. 2025.
-
Jiliang Hu,
Zuchao Li,
Mengjia Shen,
Haojun Ai,
Sheng Li,
Jun Zhang.
Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding,
IEEE-ICASSP,
Dec. 2025.
-
Zhao Ren,
Rathi Adarshi Rammohan,
Kevin Scheck,
Sheng Li,
Tanja Schultz.
End-to-end Acoustic-linguistic Emotion and Intent Recognition Enhanced by Semi-supervised Learning,
International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC),
Dec. 2025.
-
Haowei Lou,
Hye young Paik,
Pari Delir Haghighi,
Sheng Li,
Wen Hu,
Lina Yao.
LatentSpeech: Latent Diffusion for Text-To-Speech Generation,
Proc. RO-MAN,
Dec. 2025.
-
Jing Li,
Felix Schijve,
Sheng Li,
Yuye Yang,
Jun Hu,
Emilia Barakova.
Towards Emotion Co-regulation with LLM-powered Socially Assistive Robots: Integrating LLM Prompts and Robotic Behaviors to Support Parent-Neurodivergent Child Dyads,,
Proc. IROS,
Dec. 2025.
-
Jing Li,
Sheng Li,
Emilia I. Barakova,
Felix Schijve,
Jun Hu.
Designing an LLM-powered Social Robot for Supporting Emotion Regulation In Parent-Child Dyads,
Proc. RO-MAN (late breaking),
Dec. 2025.
-
Zhengdong Yang,
Zhen Wan,
Sheng Li,
Chao-Han Huck Yang,
Chenhui Chu.
CoVoGER: A Multilingual Multitask Benchmark for Speech-to-text Generative Error Correction with Large Language Models,,
EMNLP (main long),
Dec. 2025.
-
Jianing Yang,
Sheng Li,
Takahiro Shinozaki,
Yuki Saito,
Hiroshi Saruwatari.
Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement,
APSIPA ASC,
Dec. 2025.
-
Chengxi Lei,
Sheng Li,
Satwinder Singh,
Feng Hou,
Huia Jahnke,
Ruili Wang.
Empowering Māori Automatic Speech Recognition through EMD-Based Augmentation,,
Pacific Rim International Conference on Artificial Intelligence (PRICAI),
Nov. 2025.
-
Hongli Yang,
Yizhou Peng,
Hao Huang,
Sheng Li.
Adapting Whisper for Parameter-efficient Code-Switching Speech Recognition via Soft Prompt Tuning,,
INTERSPEECH,
Aug. 2025.
-
Hongli Yang,
Sheng Li,
Hao Huang,
Ayiduosi Tuohan,
Yizhou Peng.
Language-Aware Prompt Tuning for Parameter-Efficient Seamless Language Expansion in Multilingual ASR,,
INTERSPEECH,
Aug. 2025.
-
Wangjin Zhou,
Tianjiao Du,
Chenglin Xu,
Sheng Li,
Yi Zhao,
Tatsuya Kawahara.
Simple and Effective Content Encoder for Singing Voice Conversion via Dimension Reduction,,
INTERSPEECH,
Aug. 2025.
-
Pengcheng Wang,
Sheng Li,
Takahiro Shinozaki.
RAG-Boost: Retrieval-Augmented Generation Enhanced LLM-based Speech Recognition,,
Interspeech2025 MLC-SLM Challenge workshop,
Aug. 2025.
-
Zhengdong Yang,
Sheng Li,
Chenhui Chu.
Generative Error Correction for Emotion-aware Speech-to-text Translation,,
Proc. ACL (findings),
July 2025.
-
Zhen Wan,
Chao-Han Huck Yang,
Yahan Yu,
Jinchuan Tian,
Sheng Li,
Ke Hu,
Zhehuai Chen,
Shinji Watanabe,
Fei Cheng,
Chenhui Chu,
Sadao Kurohashi.
SIQ: Exterminating Speech Intelligence Quotient Cross Cognitive Levels in Voice Understanding Large Language Models,,
Proc. ACL main-long,
July 2025.
-
Yu Xu,
Xiaokai Qin,
Tianyu Fan,
Eng Siong Chng,
Sheng Li,
Nobuaki Minematsu,
Daisuke Saito.
Bandwidth Extension System for Throat Microphone Speech Reconstruction,,
Proc. IEEE-ICME,
July 2025.
国内会議発表 (査読なし・不明)
-
楊 家寧,
李 勝,
篠崎 隆弘,
齋藤 佑樹,
猿渡 洋.
相互情報量最小化による感情・音色の分離に基づく感情的音声合成,
ASJ2025 autumn,
Dec. 2025.
-
王鵬程,
李勝,
篠崎隆宏.
RAG-Boost: Retrieval-Augmented Generation Enhanced Speech Recognition in LLM-based Spoken Dialogue Systems,
日本音響学会研究発表会講演論文集, 秋季,
Sept. 2025.
-
Takahiro Shinozaki,
佐藤秋太朗,
李勝.
大規模言語モデルを用いた英語学習者発話のCEFR-Jレベル推定,
Mar. 2025.
[ BibTeX 形式で保存 ]
[ 論文・著書をCSV形式で保存
]
[ 特許をCSV形式で保存
]
|