Search Results for author: Aoyu Li

Found 4 papers, 2 papers with code

PipeFusion: Displaced Patch Pipeline Parallelism for Inference of Diffusion Transformer Models

1 code implementation23 May 2024 Jiannan Wang, Jiarui Fang, Aoyu Li, Pengcheng Yang

This paper introduces PipeFusion, a novel approach that harnesses multi-GPU parallelism to address the high computational and latency challenges of generating high-resolution images with diffusion transformers (DiT) models.

Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance

no code implementations8 Feb 2024 QiPeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Ying Zhang, Yun Ma, Ting Cao, Xuanzhe Liu

Additionally, we noticed that in-browser inference increases the time it takes for graphical user interface (GUI) components to load in web browsers by a significant 67. 2\%, which severely impacts the overall QoE for users of web applications that depend on this technology.

BiBench: Benchmarking and Analyzing Network Binarization

1 code implementation26 Jan 2023 Haotong Qin, Mingyuan Zhang, Yifu Ding, Aoyu Li, Zhongang Cai, Ziwei Liu, Fisher Yu, Xianglong Liu

Network binarization emerges as one of the most promising compression approaches offering extraordinary computation and memory savings by minimizing the bit-width.

Benchmarking Binarization

Informative Sample-Aware Proxy for Deep Metric Learning

no code implementations18 Nov 2022 Aoyu Li, Ikuro Sato, Kohta Ishikawa, Rei Kawakami, Rio Yokota

Among various supervised deep metric learning methods proxy-based approaches have achieved high retrieval accuracies.

Metric Learning Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.