Research Article

Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges

Table 3

Training, optimization, mechanism, and search at the decoder.

ReferenceAuthorsTraining and optimizationMechanismSearch at decoder (siz)

[18]Rush et al.Stochastic gradient descent to minimise negative log-likelihoodBeam search
[39]Chopra et al.Minimizing negative log-likelihood using end-to-end using stochastic gradient descentEncodes the position information of the input wordsBeam search
[55]Nallapati et al.Optimize the conditional likelihood using AdadeltaPointer mechanismBeam search (5)
[52]Zhou et al.Stochastic gradient descent, Adam optimizer, optimizing the negative log-likelihoodAttention mechanismBeam search (12)
[53]Cao et al.Adam optimizer, optimizing the negative log-likelihoodCopy mechanism, coverage mechanism, dual-attention decoderBeam search (6)
[54]Cai et al.Cross entropy is used as the loss functionAttention mechanismBeam search (5)
[50]Adelson et al.AdamAttention mechanism
[29]LopyrevRMSProp adaptive gradient methodSimple and complex attention mechanismBeam search
[38]Jobson et al.Adadelta, minimising the negative log probability of prediction wordBilinear attention mechanism, pointer mechanism
[56]See et al.AdadeltaCoverage mechanism, attention mechanism, pointer mechanismBeam search (4)
[57]Paulus et al.Adam, RLIntradecoder attention mechanism, pointer mechanism, copy mechanism, RLBeam search (5)
[58]Liu et al.Adadelta stochastic gradient descentAttention mechanism, pointer mechanism, copy mechanism, RL
[30]Song et al.Attention mechanism, copy mechanism
[35]Al-Sabahi et al.AdagradPointer mechanism, coverage mechanism, copy mechanismBidirectional beam search
[59]Li et al.AdadeltaAttention mechanism, pointer mechanism, copy mechanism, prediction guide mechanismBeam search
[60]Kryściński et al.Asynchronous gradient descent optimizerTemporal attention and intra-attention pointer mechanism, RLBeam search
[61]Yao et al.RL, AdagradAttention mechanism, pointer mechanism, copy mechanism, coverage mechanism, RLBeam search (4)
[62]Wan et al.AdagradAttention mechanism, pointer mechanismBeam-search backward (2) and forward (4)
[65]Liu et al.AdamSelf-attention mechanismBeam search (5)
[63]Wang et al.Gradient of reinforcement learning, Adam, cross-entropy loss functionAttention mechanism, pointer mechanism, copy mechanism, new coverage mechanismBeam search
[64]Egonmwan et al.AdamSelf-attention mechanismGreedy-decoding during training and validation. Beam search at decoding during testing
[49]Peng et al.Adam, gradient descent, cross-entropy lossCoverage mechanism, RL, double attention pointer network (DAPT)Beam search (5)