Review Article

A Comprehensive Survey of Abstractive Text Summarization Based on Deep Learning

Table 3

The statistics of the standard datasets.

DatasetLang.#Train#Valid.#Test.Ave. source lengthAve. target length

GigawordEng.3,800,000189,000195131.48.3
CNN/Daily MailEng.287,22613,36811,49078056
NYTEng.589,28432,73632,73954940
NewsroomEng.995,041105,760105,760658.626.7
LCSTSChi.2,400,59110,6661,106103.717.8