Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Authors:
Ye Bai,
Jingping Chen,
Jitong Chen,
Wei Chen,
Zhuo Chen,
Chuang Ding,
Linhao Dong,
Qianqian Dong,
Yujiao Du,
Kepan Gao,
Lu Gao,
Yi Guo,
Minglun Han,
Ting Han,
Wenchao Hu,
Xinying Hu,
Yuxiang Hu,
Deyu Hua,
Lu Huang,
Mingkun Huang,
Youjia Huang,
Jishuo Jin,
Fanliu Kong,
Zongwei Lan,
Tianyu Li
, et al. (30 additional authors not shown)
Abstract:
Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor…
▽ More
Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this work, we introduce Seed-ASR, a large language model (LLM) based speech recognition model. Seed-ASR is developed based on the framework of audio conditioned LLM (AcLLM), leveraging the capabilities of LLMs by inputting continuous speech representations together with contextual information into the LLM. Through stage-wise large-scale training and the elicitation of context-aware capabilities in LLM, Seed-ASR demonstrates significant improvement over end-to-end models on comprehensive evaluation sets, including multiple domains, accents/dialects and languages. Additionally, Seed-ASR can be further deployed to support specific needs in various scenarios without requiring extra language models. Compared to recently released large ASR models, Seed-ASR achieves 10%-40% reduction in word (or character, for Chinese) error rates on Chinese and English public test sets, further demonstrating its powerful performance.
△ Less
Submitted 10 July, 2024; v1 submitted 5 July, 2024;
originally announced July 2024.
Cell-Free Massive MIMO SWIPT with Beyond Diagonal Reconfigurable Intelligent Surfaces
Authors:
Thien Duc Hua,
Mohammadali Mohammadi,
Hien Quoc Ngo,
Michail Matthaiou
Abstract:
This paper investigates the integration of beyond-diagonal reconfigurable intelligent surfaces (BD-RISs) into cell-free massive multiple-input multiple-output (CF-mMIMO) systems, focusing on applications involving simultaneous wireless information and power transfer (SWIPT). The system supports concurrently two user groups: information users (IUs) and energy users (EUs). A BD-RIS is employed to en…
▽ More
This paper investigates the integration of beyond-diagonal reconfigurable intelligent surfaces (BD-RISs) into cell-free massive multiple-input multiple-output (CF-mMIMO) systems, focusing on applications involving simultaneous wireless information and power transfer (SWIPT). The system supports concurrently two user groups: information users (IUs) and energy users (EUs). A BD-RIS is employed to enhance the wireless power transfer (WPT) directed towards the EUs. To comprehensively evaluate the system's performance, we present an analytical framework for the spectral efficiency (SE) of IUs and the average harvested energy (HE) of EUs in the presence of spatial correlation among the BD-RIS elements and for a non-linear energy harvesting circuit. Our findings offer important insights into the transformative potential of BD-RIS, setting the stage for the development of more efficient and effective SWIPT networks. Finally, incorporating a heuristic scattering matrix design at the BD-RIS results in a substantial improvement compared to the scenario with random scattering matrix design.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
Multiscale Stochastic Simulation of the US Pacific Northwest Using Distributed Computing and Databases with Integrated Inflow and Variable Renewable Energy
Authors:
Joaquim Dias Garcia,
Guilherme Machado,
André Dias,
Gerson Couto,
John Ollis,
John Fazio,
Daniel Hua
Abstract:
Modelling challenges of the United States Pacific Northwest system have grown in the last decade. Besides classical modelling difficulties such as a complex hydro cascade with many operational constraints, we have seen higher penetration of variable renewable energy inside and outside the system leading to internal issues and completely different power exchanges with the West Coast System. The ana…
▽ More
Modelling challenges of the United States Pacific Northwest system have grown in the last decade. Besides classical modelling difficulties such as a complex hydro cascade with many operational constraints, we have seen higher penetration of variable renewable energy inside and outside the system leading to internal issues and completely different power exchanges with the West Coast System. The analysis of adequacy and reliability of this system motivated the design and implementation of a five-step simulator including four planning phases and an operation step. The five levels were modeled as mathematical programs that are linked from top to bottom by fixed decisions and improvements in forecasts, on the other hand they are also linked from bottom to top by system updated states. The solution of the millions of resulting mathematical programs was made possible by applying state of the art optimization techniques and high performance data bases in a massively parallel environment.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.