Complexity

Research Article

ABioNER: A BERT-Based Model for Arabic Biomedical Named-Entity Recognition

An overview of methods used for biomedical named-entity recognition.


Paper	Language	NE	Method	Results

[8]	English	Gene protein	SVM	Best balanced F1-score = 0.79
[9]	English	Chemical mentions	Hybrid (CRF + dictionary)	F1-score = 68.1
[10]	English	Problem treatment test protein DNA RNA cell type cell line	Unsupervised learning	Overall performance (exact micro-F) Pittsburgh dataset: 53.1 GENIA dataset: 39.5
[11]	English	Disease	Hybrid (stacked ensemble + fuzzy matching)	F1-score = 89.12%
[12]	English	Disease	Multiple label convolutional neural networks	F1-score NCBI corpus: 85.17% CDR corpus: 87.83%
[13]	English	Document-level chemical NER	Hybrid (attention-based BiLSTM-CRF)	F1-score HEMDNER corpus: 91.14% CDR corpus: 92.57%
[14]	English	Genes diseases protein DNA RNA cell type cell line	n-Gram character and word embeddings via convolutional neural network	F1-score: NCBI dataset: 87.26% Biocreative II dataset: 87.26% JNLPBA dataset: 72.57%
[15]	English	Chemicals	Transfer learning	F1-score:
		Diseases		88.21
		Species		82.09
		Gene		87.01
		Protein		83.09
[16]	English	Diseases	Bidirectional encoder representations from transformers	Best F1-score 89.71 75.31
[16]	English	Species	Bidirectional encoder representations from transformers	Best F1-score 89.71 75.31
[17]	Spanish/Swedish	Spanish: disease/drug Swedish: body part/disorder/finding	Bidirectional long short-term memory network	Avg. F1-score: Spanish: 75.25 Swedish: 76.04
[2]	Arabic	Disease diagnosis symptoms treatment methods	Bayesian belief network (BBN)	Avg. F1-score: 71.05%