Search Results for author: Minpeng Liao

Found 5 papers, 4 papers with code

BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation

no code implementations • 29 May 2024 • Chen Wang, Minpeng Liao, Zhongqiang Huang, Jiajun Zhang

Recent end-to-end approaches have shown promise in extending large language models (LLMs) to speech inputs, but face limitations in directly assessing and optimizing alignment quality and fail to achieve fine-grained alignment due to speech-text length mismatch.

Paper
Add Code

AlphaMath Almost Zero: process Supervision without process

1 code implementation • 6 May 2024 • Guoxin Chen, Minpeng Liao, Chengxi Li, Kai Fan

The experimental results on both in-domain and out-of-domain datasets demonstrate that even without GPT-4 or human-annotated process supervision, our AlphaMath framework achieves comparable or superior results to previous state-of-the-art methods.

Mathematical Reasoning

101

Paper
Code

MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline

1 code implementation • 16 Jan 2024 • Minpeng Liao, Wei Luo, Chengxi Li, Jing Wu, Kai Fan

Large language models (LLMs) have seen considerable advancements in natural language understanding tasks, yet there remains a gap to bridge before attaining true artificial general intelligence, especially concerning shortcomings in mathematical reasoning capabilities.

GSM8K Math +2

Paper
Code

BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing

1 code implementation • 2 Sep 2023 • Chen Wang, Minpeng Liao, Zhongqiang Huang, Jinliang Lu, Junhong Wu, Yuchen Liu, Chengqing Zong, Jiajun Zhang

One is a cascaded approach where outputs (tokens or states) of a separately trained speech recognition system are used as inputs for LLMs, which limits their potential in modeling alignment between speech and text.

speech-recognition Speech Recognition +1

Paper
Code

Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference

1 code implementation • 14 Mar 2023 • Biao Fu, Minpeng Liao, Kai Fan, Zhongqiang Huang, Boxing Chen, Yidong Chen, Xiaodong Shi

A popular approach to streaming speech translation is to employ a single offline model with a wait-k policy to support different latency requirements, which is simpler than training multiple online models with different latency constraints.

FAD Translation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.