Extracting eligibility criteria from the narrative text of scientific research articles

Ching Yun Lin, Der Ming Liou, Mei Lien Pan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Eligibility criteria among hundreds of National Health Insurance Research Database (NHIRD) research papers have similar constituent elements, such as demographic characteristics or diagnostic codes. The study results of the same disease could vary among different research due to the variation of the criteria statements, therefore the narrative patterns analysis tool would be helpful for summarizing the knowledge implicitly contained in the eligibility criteria. In this study, we developed a series of R-based text processing methods to extract the narrative eligibility criteria in NHIRD papers by simplifying the article titles and content paragraphs, identifying medical concepts and abbreviations, then detecting basic demographic characteristics and ICD-9-CM diagnosis codes. Although there is still room for improvement on study type identifying, the high performance in classifying the study type, detecting age restrictions and extracting ICD-9-CM codes still shows the system usefulness for the analysis of eligibility criteria.

Original languageEnglish
Title of host publicationMEDINFO 2017
Subtitle of host publicationPrecision Healthcare through Informatics - Proceedings of the 16th World Congress on Medical and Health Informatics
EditorsZhao Dongsheng, Adi V. Gundlapalli, Jaulent Marie-Christine
PublisherIOS Press
Pages481-485
Number of pages5
ISBN (Electronic)9781614998297
DOIs
Publication statusPublished - Jan 1 2017
Externally publishedYes
Event16th World Congress of Medical and Health Informatics: Precision Healthcare through Informatics, MedInfo 2017 - Hangzhou, China
Duration: Aug 21 2017Aug 25 2017

Publication series

NameStudies in Health Technology and Informatics
Volume245
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Other

Other16th World Congress of Medical and Health Informatics: Precision Healthcare through Informatics, MedInfo 2017
CountryChina
CityHangzhou
Period8/21/178/25/17

Fingerprint

Health insurance
National Health Programs
International Classification of Diseases
Research
Demography
Databases
Text processing

Keywords

  • Database
  • Natural language processing

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics
  • Health Information Management

Cite this

Lin, C. Y., Liou, D. M., & Pan, M. L. (2017). Extracting eligibility criteria from the narrative text of scientific research articles. In Z. Dongsheng, A. V. Gundlapalli, & J. Marie-Christine (Eds.), MEDINFO 2017: Precision Healthcare through Informatics - Proceedings of the 16th World Congress on Medical and Health Informatics (pp. 481-485). (Studies in Health Technology and Informatics; Vol. 245). IOS Press. https://doi.org/10.3233/978-1-61499-830-3-481

Extracting eligibility criteria from the narrative text of scientific research articles. / Lin, Ching Yun; Liou, Der Ming; Pan, Mei Lien.

MEDINFO 2017: Precision Healthcare through Informatics - Proceedings of the 16th World Congress on Medical and Health Informatics. ed. / Zhao Dongsheng; Adi V. Gundlapalli; Jaulent Marie-Christine. IOS Press, 2017. p. 481-485 (Studies in Health Technology and Informatics; Vol. 245).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lin, CY, Liou, DM & Pan, ML 2017, Extracting eligibility criteria from the narrative text of scientific research articles. in Z Dongsheng, AV Gundlapalli & J Marie-Christine (eds), MEDINFO 2017: Precision Healthcare through Informatics - Proceedings of the 16th World Congress on Medical and Health Informatics. Studies in Health Technology and Informatics, vol. 245, IOS Press, pp. 481-485, 16th World Congress of Medical and Health Informatics: Precision Healthcare through Informatics, MedInfo 2017, Hangzhou, China, 8/21/17. https://doi.org/10.3233/978-1-61499-830-3-481
Lin CY, Liou DM, Pan ML. Extracting eligibility criteria from the narrative text of scientific research articles. In Dongsheng Z, Gundlapalli AV, Marie-Christine J, editors, MEDINFO 2017: Precision Healthcare through Informatics - Proceedings of the 16th World Congress on Medical and Health Informatics. IOS Press. 2017. p. 481-485. (Studies in Health Technology and Informatics). https://doi.org/10.3233/978-1-61499-830-3-481
Lin, Ching Yun ; Liou, Der Ming ; Pan, Mei Lien. / Extracting eligibility criteria from the narrative text of scientific research articles. MEDINFO 2017: Precision Healthcare through Informatics - Proceedings of the 16th World Congress on Medical and Health Informatics. editor / Zhao Dongsheng ; Adi V. Gundlapalli ; Jaulent Marie-Christine. IOS Press, 2017. pp. 481-485 (Studies in Health Technology and Informatics).
@inproceedings{9495e24bd15d430e8c2e9cc816045924,
title = "Extracting eligibility criteria from the narrative text of scientific research articles",
abstract = "Eligibility criteria among hundreds of National Health Insurance Research Database (NHIRD) research papers have similar constituent elements, such as demographic characteristics or diagnostic codes. The study results of the same disease could vary among different research due to the variation of the criteria statements, therefore the narrative patterns analysis tool would be helpful for summarizing the knowledge implicitly contained in the eligibility criteria. In this study, we developed a series of R-based text processing methods to extract the narrative eligibility criteria in NHIRD papers by simplifying the article titles and content paragraphs, identifying medical concepts and abbreviations, then detecting basic demographic characteristics and ICD-9-CM diagnosis codes. Although there is still room for improvement on study type identifying, the high performance in classifying the study type, detecting age restrictions and extracting ICD-9-CM codes still shows the system usefulness for the analysis of eligibility criteria.",
keywords = "Database, Natural language processing",
author = "Lin, {Ching Yun} and Liou, {Der Ming} and Pan, {Mei Lien}",
year = "2017",
month = "1",
day = "1",
doi = "10.3233/978-1-61499-830-3-481",
language = "English",
series = "Studies in Health Technology and Informatics",
publisher = "IOS Press",
pages = "481--485",
editor = "Zhao Dongsheng and Gundlapalli, {Adi V.} and Jaulent Marie-Christine",
booktitle = "MEDINFO 2017",
address = "Netherlands",

}

TY - GEN

T1 - Extracting eligibility criteria from the narrative text of scientific research articles

AU - Lin, Ching Yun

AU - Liou, Der Ming

AU - Pan, Mei Lien

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Eligibility criteria among hundreds of National Health Insurance Research Database (NHIRD) research papers have similar constituent elements, such as demographic characteristics or diagnostic codes. The study results of the same disease could vary among different research due to the variation of the criteria statements, therefore the narrative patterns analysis tool would be helpful for summarizing the knowledge implicitly contained in the eligibility criteria. In this study, we developed a series of R-based text processing methods to extract the narrative eligibility criteria in NHIRD papers by simplifying the article titles and content paragraphs, identifying medical concepts and abbreviations, then detecting basic demographic characteristics and ICD-9-CM diagnosis codes. Although there is still room for improvement on study type identifying, the high performance in classifying the study type, detecting age restrictions and extracting ICD-9-CM codes still shows the system usefulness for the analysis of eligibility criteria.

AB - Eligibility criteria among hundreds of National Health Insurance Research Database (NHIRD) research papers have similar constituent elements, such as demographic characteristics or diagnostic codes. The study results of the same disease could vary among different research due to the variation of the criteria statements, therefore the narrative patterns analysis tool would be helpful for summarizing the knowledge implicitly contained in the eligibility criteria. In this study, we developed a series of R-based text processing methods to extract the narrative eligibility criteria in NHIRD papers by simplifying the article titles and content paragraphs, identifying medical concepts and abbreviations, then detecting basic demographic characteristics and ICD-9-CM diagnosis codes. Although there is still room for improvement on study type identifying, the high performance in classifying the study type, detecting age restrictions and extracting ICD-9-CM codes still shows the system usefulness for the analysis of eligibility criteria.

KW - Database

KW - Natural language processing

UR - http://www.scopus.com/inward/record.url?scp=85040510156&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85040510156&partnerID=8YFLogxK

U2 - 10.3233/978-1-61499-830-3-481

DO - 10.3233/978-1-61499-830-3-481

M3 - Conference contribution

C2 - 29295141

AN - SCOPUS:85040510156

T3 - Studies in Health Technology and Informatics

SP - 481

EP - 485

BT - MEDINFO 2017

A2 - Dongsheng, Zhao

A2 - Gundlapalli, Adi V.

A2 - Marie-Christine, Jaulent

PB - IOS Press

ER -