Search Results for author: Jiangsu Du

Found 3 papers, 0 papers with code

Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference

no code implementations27 May 2024 Shengyuan Ye, Jiangsu Du, Liekang Zeng, Wenzhong Ou, Xiaowen Chu, Yutong Lu, Xu Chen

Transformer-based models have unlocked a plethora of powerful intelligent applications at the edge, such as voice assistant in smart home.

SAIH: A Scalable Evaluation Methodology for Understanding AI Performance Trend on HPC Systems

no code implementations7 Dec 2022 Jiangsu Du, Dongsheng Li, Yingpeng Wen, Jiazhi Jiang, Dan Huang, Xiangke Liao, Yutong Lu

In this paper, we propose a scalable evaluation methodology (SAIH) for analyzing the AI performance trend of HPC systems with scaling the problem sizes of customized AI applications.

EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models

no code implementations6 Sep 2022 Jiangsu Du, Ziming Liu, Jiarui Fang, Shenggui Li, Yongbin Li, Yutong Lu, Yang You

Although the AI community has expanded the model scale to the trillion parameter level, the practical deployment of 10-100 billion parameter models is still uncertain due to the latency, throughput, and memory constraints.

Blocking

Cannot find the paper you are looking for? You can Submit a new open access paper.