10.18170/DVN/SEYRX5
Yu, Shiwen(Peking University)Duan, Huiming(Peking University)Wu, Yunfang(Peking University)
Corpus of Multi-level Processing for Modern Chinese-亚洲成人在线一二三四五六区
Peking University Open Research Data Platform
2018
Peking University Institute of Computational Linguistics began to research the multi-level processing of the modern Chinese from 1992, and annotated corpus of the People's Daily, 1998 from April 1999 to April 2002. The modern Chinese multi-level processing corpus includes 52 million words of basic processing corpus (word segmentation, part of speech tagging, named entity annotation, phonetic transcription), 28 million words of the same-shaped annotation corpus, in addition, 560,000 words corpus marked parallel structure.
Yu, Shiwen(Peking University)Duan, Huiming(Peking University)Wu, Yunfang(Peking University)