Table of Contents Author Guidelines Submit a Manuscript
Journal of Healthcare Engineering
Volume 2017, Article ID 4898963, 16 pages
https://doi.org/10.1155/2017/4898963
Research Article

A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text

1Second Affiliated Hospital Zhejiang University School of Medicine, Hangzhou, Zhejiang Province 310000, China
2Sir Run Run Shaw Hospital Zhejiang University School of Medicine, Hangzhou, Zhejiang Province 310000, China
3Zhejiang University International Hospital, Hangzhou, Zhejiang Province 310000, China
4National Center for Advancing Translation Sciences, National Institutes of Health, 9800 Medical Center Drive, Building C, Room 312, Rockville, MD 20850, USA
5Hangzhou Medical College, Hangzhou, Zhejiang Province 310000, China
6Peking University Center for Medical Informatics Center, Beijing 100191, China
7Southwest Medical University, Luzhou, Sichuan Province 646000, China

Correspondence should be addressed to Jianbo Lei; nc.ude.ukp.csh@ielbj

Received 31 March 2017; Revised 5 May 2017; Accepted 16 May 2017; Published 5 July 2017

Academic Editor: Jiang Bian

Copyright © 2017 Jun Liang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. B. A. Nctpnh, Wenzi de Chansheng He Fazhan (the Origin and Development of Written Language), Peking University Press, Peking, 1987, in Chinese.
  2. H. Tu, Y. Yu, P. Yang et al., “Building clinical data groups for electronic medical record in China,” Journal of Medical Systems, vol. 36, no. 2, pp. 723–736, 2010. View at Google Scholar
  3. D. Liu, X. Wang, F. Pan et al., “Harmonization of health data at national level: a pilot study in China,” International Journal of Medical Informatics, vol. 79, no. 6, pp. 450–458, 2010. View at Publisher · View at Google Scholar · View at Scopus
  4. J. Liang, M. Xu, L. Zhang et al., “Developing interoperable electronic health record service in China,” International Journal Digital Content Technology Applications, vol. 5, no. 4, pp. 280–295, 2011. View at Google Scholar
  5. B. Li, “China health white paper,” in China Health Information Technology Conference & International Forum, Zhenzou, 2013, in Chinese.
  6. S. M. Meystre, G. K. Savova, K. C. Kipper-Schuler, and J. F. Hurdle, “Extracting information from textual documents in the electronic health record: a review of recent research,” Yearbook of Medical Informatics, vol. 2008, no. 1, pp. 128–144, 2008. View at Google Scholar
  7. O. Uzuner, I. Solti, and E. Cadag, “Extracting medication information from clinical text,” Journal of the American Medical Informatics Association, vol. 17, no. 5, pp. 514–518, 2010. View at Publisher · View at Google Scholar · View at Scopus
  8. L. Wieger and L. Davrout, Chinese Characters: Their Origin, Etymology, History, Classification and Signification: A Thorough Study from Chinese Documents, Dover Publications, New York, 1965.
  9. Q. Xie, M. Cui, Z. D. Wu, and H. Zhao, “Traditional Chinese medicine information digitalization discussion,” Journal of Alternative and Complementary Medicine, vol. 16, no. 11, pp. 1207–1209, 2010. View at Publisher · View at Google Scholar · View at Scopus
  10. Y. Bastanlar and M. Ozuysal, “Introduction to machine learning,” Methods in Molecular Biology, vol. 1107, no. 1, pp. 105–128, 2014. View at Google Scholar
  11. B. Shiner, L. W. D'Avolio, T. M. Nguyen, M. H. Zayed, B. V. Watts, and L. Fiore, “Automated classification of psychotherapy note text: implications for quality assessment in PTSD care,” Journal of Evaluation in Clinical Practice, vol. 18, no. 3, pp. 698–701, 2011. View at Google Scholar
  12. M. Liu, A. Shah, M. Jiang et al., A Study of Transportability of an Existing Smoking Status Detection Module across Institutions, AMIA Annu Symp Proc, Chicago, 2012.
  13. O. Uzuner, “Recognizing obesity and comorbidities in sparse data,” Journal of the American Medical Informatics Association, vol. 16, no. 4, pp. 561–570, 2009. View at Publisher · View at Google Scholar · View at Scopus
  14. E. Sirohi and P. Peissig, “Study of effect of drug lexicons on medication extraction from electronic medical records,” in Pacific Symposium on Biocomputing, pp. 308–318, Hawaii, 2005.
  15. O. Uzuner, I. Goldstein, Y. Luo, and I. Kohane, “Identifying patient smoking status from medical discharge records,” Journal of the American Medical Informatics Association, vol. 15, no. 1, pp. 14–24, 2008. View at Publisher · View at Google Scholar · View at Scopus
  16. O. Uzuner, B. R. South, S. Shen, and S. L. DuVall, “2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text,” Journal of the American Medical Informatics Association, vol. 18, no. 5, pp. 552–556, 2010. View at Google Scholar
  17. K. Roberts and S. M. Harabagiu, “A flexible framework for deriving assertions from electronic medical records,” Journal of the American Medical Informatics Association, vol. 18, no. 5, pp. 568–573, 2011. View at Publisher · View at Google Scholar · View at Scopus
  18. S. Doan, M. Conway, T. M. Phuong, and L. Ohno-Machado, “Natural language processing in biomedicine: a unified system architecture overview,” Methods in Molecular Biology, vol. 1168, no. 1, pp. 275–294, 2014. View at Google Scholar
  19. Y. Wang, Z. Yu, L. Chen et al., “Supervised methods for symptom name recognition in free-text clinical records of traditional Chinese medicine: an empirical study,” Journal of Biomedical Informatics, vol. 47, no. 2, pp. 91–104, 2013. View at Google Scholar
  20. B. Gu, F. Popowich, and V. Dahl, “Recognizing biomedical named entities in Chinese research abstracts,” in Advances in Artificial Intelligence: 21st Conference of the Canadian Society for Computational Studies of Intelligence, Canadian AI 2008, S. Bergler, Ed., pp. 114–125, Windsor, Canada, May 2008, Proceedings. Berlin, Heidelberg: Springer Berlin Heidelberg. View at Publisher · View at Google Scholar · View at Scopus
  21. Y. Xu, Y. Wang, T. Liu et al., “Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries,” Journal of the American Medical Informatics Association, vol. 21, no. e1, pp. e84–e92, 2013. View at Google Scholar
  22. J. Lei, B. Tang, X. Lu, K. Gao, M. Jiang, and H. Xu, “A comprehensive study of named entity recognition in Chinese clinical text,” Journal of the American Medical Informatics Association, vol. 21, no. 5, pp. 808–814, 2014. View at Google Scholar
  23. J. Liang, X. Zheng, M. Xu, X. He, and Y. Xing, “A combined classification model for Chinese clinical notes,” International Journal of Applied Mathematics and Statistics, vol. 49, no. 19, pp. 201–209, 2013. View at Google Scholar
  24. L. NaiFen and S. ZiLin, Chinese Clinical Medical Orders Manual, Science Press, Peking, 2007, in Chinese.
  25. Y. Mo, “Chinese naming and translation of Western medicine,” Journal of Chengde Medical College, vol. 30, no. 3, pp. 264-265, 2013, in Chinese. View at Google Scholar
  26. H.-P. Zhang, H.-K. Yu, D.-Y. Xiong, and Q. Liu, HHMM-Based Chinese Lexical Analyzer ICTCLAS, Proceedings of the second SIGHAN workshop on Chinese language processing, Sapporo, 2003. View at Publisher · View at Google Scholar · View at Scopus
  27. Y. Yang and J. Wilbur, “Using corpus statistics to remove redundant words in text categorization,” Journal of the American Society for Information Science, vol. 47, no. 5, pp. 357–369, 1996. View at Publisher · View at Google Scholar
  28. P. Zweigenbaum, T. Lavergne, N. Grabar, T. Hamon, S. Rosset, and C. Grouin, “Combining an expert-based medical entity recognizer to a machine-learning system: methods and a case study,” Biomedical Informatics Insights, vol. 6, Supplement 1, pp. 51–62, 2013. View at Publisher · View at Google Scholar
  29. S. K. Saha, P. Mitra, and S. Sarkar, “A comparative study on feature reduction approaches in Hindi and Bengali named entity recognition,” Knowledge-Based Systems, vol. 27, no. 1, pp. 322–332, 2012. View at Google Scholar
  30. T. Joachims, “Text categorization with support vector machines: learning with many relevant features,” in 10th European Conference on Machine Learning, pp. 137–142, Chemnitz, 1998.
  31. Y. Wang, Z. Yu, Y. Jiang, K. Xu, and X. Chen, “Automatic symptom name normalization in clinical records of traditional Chinese medicine,” BMC Bioinformatics, vol. 11, pp. 40–50, 2010. View at Publisher · View at Google Scholar · View at Scopus
  32. J. Yu, Y. Wang, and H. Chen, “An improved text feature extraction algorithm based on N-gram,” Library and Information Service, vol. 48, no. 8, pp. 48–50, 2005, in Chinese. View at Google Scholar
  33. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, “The WEKA data mining software: an update,” SIGKDD Exploration Newsletter, vol. 11, no. 1, pp. 10–18, 2009. View at Publisher · View at Google Scholar
  34. B. Tang, H. Cao, Y. Wu, M. Jiang, and H. Xu, “Recognizing clinical entities in hospital discharge summaries using structural support vector machines with word representation features,” BMC Medical Informatics and Decision Making, vol. 13, Supplement 1, pp. 13–23, 2013. View at Google Scholar
  35. C. Sutton and A. McCallum, “An introduction to conditional random fields,” Machine Learning, vol. 4, no. 4, pp. 267–373, 2011. View at Google Scholar
  36. M. Sun and J. Zou, “A critical appraisal of the research on Chinese word segmention,” Contemporary Linguistics, vol. 39, no. 1, pp. 22–32, 2001, in Chinese. View at Google Scholar
  37. S. K. Saha, P. Mitra, and S. Sarkar, “Word clustering and word selection based feature reduction for MaxEnt based Hindi NER,” in Proceedings of ACL-08: HLT, pp. 488–495, OH, USA, 2008.
  38. J. Lei, P. Sockolow, P. Guan, Q. Meng, and J. Zhang, “A comparison of electronic health records at two major Peking University Hospitals in China to United States meaningful use objectives,” BMC Medical Informatics and Decision Making, vol. 13, no. 1, pp. 96–105, 2013. View at Google Scholar
  39. Y. Wang, Z. Yu, Y. Jiang, Y. Liu, and L. Chen, “A framework and its empirical study of automatic diagnosis of traditional Chinese medicine utilizing raw free-text clinical records,” Journal of Biomedical Informatics, vol. 45, no. 2, pp. 210–223, 2011. View at Google Scholar
  40. J. Zhao and F. Liu, “Product named entity recognition in Chinese text,” Language Resources and Evaluation, vol. 42, no. 2, pp. 197–217, 2008, in Chinese. View at Publisher · View at Google Scholar · View at Scopus
  41. O. Uzuner, I. Solti, F. Xia, and E. Cadag, “Community annotation experiment for ground truth generation for the i2b2 medication challenge,” Journal of the American Medical Informatics Association, vol. 17, no. 5, pp. 519–523, 2010. View at Publisher · View at Google Scholar · View at Scopus
  42. J. J. Cimino and X. Zhu, “The practical impact of ontologies on biomedical informatics,” Yearbook of Medical Informatics, vol. 1, pp. 124–135, 2006. View at Google Scholar
  43. Y. Xu, Y. Wang, J.-T. Sun, J. Zhang, J. Tsujii, and E. Chang, “Building large collections of Chinese and English medical terms from semi-structured and encyclopedia websites,” PloS One, vol. 8, no. 7, article e67526, 2013. View at Publisher · View at Google Scholar · View at Scopus
  44. R. H. Dolin, L. Alschuler, S. Boyer, C. Beebe, F. M. Behlen, and P. V. Biron, “HL7 clinical document architecture, release 2,” Journal of the American Medical Informatics Association, vol. 13, no. 1, pp. 30–39, 2005. View at Google Scholar
  45. T. Kang, S. Zhang, N. Xu, D. Wen, X. Zhang, and J. Lei, “Detecting negation and scope in Chinese clinical notes using character and word embedding,” Computer Methods and Programs in Biomedicine, vol. 140, no. 3, pp. 53–59, 2017. View at Google Scholar