Search Results for author: Qiquan Zhang

Found 9 papers, 1 papers with code

Mamba in Speech: Towards an Alternative to Self-Attention

no code implementations • 21 May 2024 • Xiangyu Zhang, Qiquan Zhang, Hexin Liu, Tianyi Xiao, Xinyuan Qian, Beena Ahmed, Eliathamby Ambikairajah, Haizhou Li, Julien Epps

Moreover, experiments demonstrate the effectiveness of BiMamba as an alternative to the self-attention module in Transformer and its derivates, particularly for the semantic-aware task.

Speech Enhancement speech-recognition +1

Paper
Add Code

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

no code implementations • 17 Feb 2024 • Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps

In addition, this approach is not only valuable for the detection of depression but also represents a new perspective in enhancing the ability of LLMs to comprehend and process speech signals.

Depression Detection

Paper
Add Code

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

no code implementations • 16 Feb 2024 • Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia, Eng Siong Chng, Lina Yao

Recently, Denoising Diffusion Probabilistic Models (DDPMs) have attained leading performances across a diverse range of generative tasks.

Denoising Speech Enhancement +1

Paper
Add Code

An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement

no code implementations • 18 Jan 2024 • Qiquan Zhang, Meng Ge, Hongxu Zhu, Eliathamby Ambikairajah, Qi Song, Zhaoheng Ni, Haizhou Li

Transformer architecture has enabled recent progress in speech enhancement.

POS Position +1

Paper
Add Code

EEG-Derived Voice Signature for Attended Speaker Detection

no code implementations • 28 Aug 2023 • Hongxu Zhu, Siqi Cai, Yidi Jiang, Qiquan Zhang, Haizhou Li

\textit{Conclusion:} We conclude that it is possible to derive the attended speaker's voice signature from the EEG signals so as to detect the attended speaker in a listening brain.

EEG

Paper
Add Code

PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment

no code implementations • 18 Dec 2022 • Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li

To tackle the multi-domain dialogue evaluation task, we propose a Panel of Experts (PoE), a multitask network that consists of a shared transformer encoder and a collection of lightweight adapters.

Data Augmentation Dialogue Evaluation +4

Paper
Add Code

FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation

2 code implementations • 25 Oct 2022 • Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li

Recent model-based reference-free metrics for open-domain dialogue evaluation exhibit promising correlations with human judgment.

Dialogue Evaluation

Paper
Code

Monaural Speech Enhancement Using a Multi-Branch Temporal Convolutional Network

no code implementations • 27 Dec 2019 • Qiquan Zhang, Aaron Nicolson, Mingjiang Wang, Kuldip K. Paliwal, Chenxu Wang

Deep learning has achieved substantial improvement on single-channel speech enhancement tasks.

Speech Enhancement

Paper
Add Code

Learning Reinforced Attentional Representation for End-to-End Visual Tracking

no code implementations • 27 Aug 2019 • Peng Gao, Qiquan Zhang, Fei Wang, Liyi Xiao, Hamido Fujita, Yan Zhang

Although numerous recent tracking approaches have made tremendous advances in the last decade, achieving high-performance visual tracking remains a challenge.

Visual Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.