no code implementations • 28 May 2024 • Jiahuan Cao, Yongxin Shi, Dezhi Peng, Yang Liu, Lianwen Jin
To fill this gap, this paper introduces C$^{3}$bench, a Comprehensive Classical Chinese understanding benchmark, which comprises 50, 000 text pairs for five primary CCU tasks, including classification, retrieval, named entity recognition, punctuation, and translation.
1 code implementation • 28 Feb 2024 • Yang Liu, Jiahuan Cao, Chongyu Liu, Kai Ding, Lianwen Jin
Additionally, a comprehensive review of the existing available dataset resources is also provided, including statistics from 444 datasets, covering 8 language categories and spanning 32 domains.