####
: nlp_chinese_corpus@163.com
*** update ****
10 & 9
Language Understanding Evaluation benchmark for Chinese(CLUE benchmark): run 10 tasks & 9 baselines with one line of code, performance comparision with details.
Releasing Pre-trained Model of ALBERT_Chinese:
Training with 30G+ Raw Chinese Corpus, xxlarge, small version and more, Target to match State of the Art performance in Chinese with 30% less parameters, 2019-Oct-7, During the National Day of China!
10 & 3(201951)
30 & 10 & 120191231
Update json(webtext2019zh)NLP520(translation2019zh)
#### 1.(wiki2019zh)100
#### 2.(news2016zh)250
#### 3.(baike2018qa)150
#### 4.json(webtext2019zh)410
#### 5.(translation2019zh)520
####
2019
github
1.json(wiki2019zh)
-------------------------------------------------------------------------
#### 104(1,043,224; 1.6G519M2019.2.7)
Google Drive
####
####
{"id":,"url":,"title":,"text":} titletext"\n\n"
####
{"id": "53", "url": "https://zh.wikipedia.org/wiki?curid=53", "title": "", "text": "\n\n\n\n..."}
####
----
21
1776
-1803Dismal science17981844
.....
2.json(news2016zh)
-------------------------------------------------------------------------
#### 250( 9G3.6G2014-2016)
Google Drive:k265
####
2506.3
2437.7
####
####
{'news_id': ,'title':,'content':,'source': ,'time':