Text Mining of Journal Articles for Sleep Disorder Terminologies

Calvin Lam, Fu-Chih Lai, Chia Hui Wang, MH Lai, N Hsu, Min-Huey Chung

研究成果: 雜誌貢獻文章

3 引文 (Scopus)

摘要

OBJECTIVE: Research on publication trends in journal articles on sleep disorders (SDs) and the associated methodologies by using text mining has been limited. The present study involved text mining for terms to determine the publication trends in sleep-related journal articles published during 2000-2013 and to identify associations between SD and methodology terms as well as conducting statistical analyses of the text mining findings.

METHODS: SD and methodology terms were extracted from 3,720 sleep-related journal articles in the PubMed database by using MetaMap. The extracted data set was analyzed using hierarchical cluster analyses and adjusted logistic regression models to investigate publication trends and associations between SD and methodology terms.

RESULTS: MetaMap had a text mining precision, recall, and false positive rate of 0.70, 0.77, and 11.51%, respectively. The most common SD term was breathing-related sleep disorder, whereas narcolepsy was the least common. Cluster analyses showed similar methodology clusters for each SD term, except narcolepsy. The logistic regression models showed an increasing prevalence of insomnia, parasomnia, and other sleep disorders but a decreasing prevalence of breathing-related sleep disorder during 2000-2013. Different SD terms were positively associated with different methodology terms regarding research design terms, measure terms, and analysis terms.

CONCLUSION: Insomnia-, parasomnia-, and other sleep disorder-related articles showed an increasing publication trend, whereas those related to breathing-related sleep disorder showed a decreasing trend. Furthermore, experimental studies more commonly focused on hypersomnia and other SDs and less commonly on insomnia, breathing-related sleep disorder, narcolepsy, and parasomnia. Thus, text mining may facilitate the exploration of the publication trends in SDs and the associated methodologies.
原文英語
頁(從 - 到)e0156031
期刊PLoS One
11
發行號5
DOIs
出版狀態已發佈 - 2016

指紋

Data Mining
terminology
Terminology
Parasomnias
Publications
Narcolepsy
Respiration
Sleep Initiation and Maintenance Disorders
Logistic Models
breathing
Sleep Wake Disorders
sleep disorders
Sleep
Cluster Analysis
sleep
Logistics
methodology
Disorders of Excessive Somnolence
PubMed

引用此文

Text Mining of Journal Articles for Sleep Disorder Terminologies. / Lam, Calvin; Lai, Fu-Chih; Wang, Chia Hui; Lai, MH; Hsu, N; Chung, Min-Huey.

於: PLoS One, 卷 11, 編號 5, 2016, p. e0156031.

研究成果: 雜誌貢獻文章

@article{3e1ef59432894965883f508a6ba0c902,
title = "Text Mining of Journal Articles for Sleep Disorder Terminologies",
abstract = "OBJECTIVE: Research on publication trends in journal articles on sleep disorders (SDs) and the associated methodologies by using text mining has been limited. The present study involved text mining for terms to determine the publication trends in sleep-related journal articles published during 2000-2013 and to identify associations between SD and methodology terms as well as conducting statistical analyses of the text mining findings.METHODS: SD and methodology terms were extracted from 3,720 sleep-related journal articles in the PubMed database by using MetaMap. The extracted data set was analyzed using hierarchical cluster analyses and adjusted logistic regression models to investigate publication trends and associations between SD and methodology terms.RESULTS: MetaMap had a text mining precision, recall, and false positive rate of 0.70, 0.77, and 11.51{\%}, respectively. The most common SD term was breathing-related sleep disorder, whereas narcolepsy was the least common. Cluster analyses showed similar methodology clusters for each SD term, except narcolepsy. The logistic regression models showed an increasing prevalence of insomnia, parasomnia, and other sleep disorders but a decreasing prevalence of breathing-related sleep disorder during 2000-2013. Different SD terms were positively associated with different methodology terms regarding research design terms, measure terms, and analysis terms.CONCLUSION: Insomnia-, parasomnia-, and other sleep disorder-related articles showed an increasing publication trend, whereas those related to breathing-related sleep disorder showed a decreasing trend. Furthermore, experimental studies more commonly focused on hypersomnia and other SDs and less commonly on insomnia, breathing-related sleep disorder, narcolepsy, and parasomnia. Thus, text mining may facilitate the exploration of the publication trends in SDs and the associated methodologies.",
keywords = "Journal Article",
author = "Calvin Lam and Fu-Chih Lai and Wang, {Chia Hui} and MH Lai and N Hsu and Min-Huey Chung",
year = "2016",
doi = "10.1371/journal.pone.0156031",
language = "English",
volume = "11",
pages = "e0156031",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "5",

}

TY - JOUR

T1 - Text Mining of Journal Articles for Sleep Disorder Terminologies

AU - Lam, Calvin

AU - Lai, Fu-Chih

AU - Wang, Chia Hui

AU - Lai, MH

AU - Hsu, N

AU - Chung, Min-Huey

PY - 2016

Y1 - 2016

N2 - OBJECTIVE: Research on publication trends in journal articles on sleep disorders (SDs) and the associated methodologies by using text mining has been limited. The present study involved text mining for terms to determine the publication trends in sleep-related journal articles published during 2000-2013 and to identify associations between SD and methodology terms as well as conducting statistical analyses of the text mining findings.METHODS: SD and methodology terms were extracted from 3,720 sleep-related journal articles in the PubMed database by using MetaMap. The extracted data set was analyzed using hierarchical cluster analyses and adjusted logistic regression models to investigate publication trends and associations between SD and methodology terms.RESULTS: MetaMap had a text mining precision, recall, and false positive rate of 0.70, 0.77, and 11.51%, respectively. The most common SD term was breathing-related sleep disorder, whereas narcolepsy was the least common. Cluster analyses showed similar methodology clusters for each SD term, except narcolepsy. The logistic regression models showed an increasing prevalence of insomnia, parasomnia, and other sleep disorders but a decreasing prevalence of breathing-related sleep disorder during 2000-2013. Different SD terms were positively associated with different methodology terms regarding research design terms, measure terms, and analysis terms.CONCLUSION: Insomnia-, parasomnia-, and other sleep disorder-related articles showed an increasing publication trend, whereas those related to breathing-related sleep disorder showed a decreasing trend. Furthermore, experimental studies more commonly focused on hypersomnia and other SDs and less commonly on insomnia, breathing-related sleep disorder, narcolepsy, and parasomnia. Thus, text mining may facilitate the exploration of the publication trends in SDs and the associated methodologies.

AB - OBJECTIVE: Research on publication trends in journal articles on sleep disorders (SDs) and the associated methodologies by using text mining has been limited. The present study involved text mining for terms to determine the publication trends in sleep-related journal articles published during 2000-2013 and to identify associations between SD and methodology terms as well as conducting statistical analyses of the text mining findings.METHODS: SD and methodology terms were extracted from 3,720 sleep-related journal articles in the PubMed database by using MetaMap. The extracted data set was analyzed using hierarchical cluster analyses and adjusted logistic regression models to investigate publication trends and associations between SD and methodology terms.RESULTS: MetaMap had a text mining precision, recall, and false positive rate of 0.70, 0.77, and 11.51%, respectively. The most common SD term was breathing-related sleep disorder, whereas narcolepsy was the least common. Cluster analyses showed similar methodology clusters for each SD term, except narcolepsy. The logistic regression models showed an increasing prevalence of insomnia, parasomnia, and other sleep disorders but a decreasing prevalence of breathing-related sleep disorder during 2000-2013. Different SD terms were positively associated with different methodology terms regarding research design terms, measure terms, and analysis terms.CONCLUSION: Insomnia-, parasomnia-, and other sleep disorder-related articles showed an increasing publication trend, whereas those related to breathing-related sleep disorder showed a decreasing trend. Furthermore, experimental studies more commonly focused on hypersomnia and other SDs and less commonly on insomnia, breathing-related sleep disorder, narcolepsy, and parasomnia. Thus, text mining may facilitate the exploration of the publication trends in SDs and the associated methodologies.

KW - Journal Article

UR - http://www.ncbi.nlm.nih.gov/pubmed/27203858

UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-85000352013&origin=resultslist&sort=plf-f&src=s&nlo=&nlr=&nls=&sid=d6c4b65b901e8e4d113257428ae20451&sot=a&sdt=a&sl=36&s=AU-ID%28%22Chung%2c+Min+Huey%22+22953113600%29&relpos=11&citeCnt=3&searchTerm=

UR - https://www.scopus.com/results/citedbyresults.uri?sort=plf-f&cite=2-s2.0-85000352013&src=s&imp=t&sid=81a051afb79247c432df9270342beb17&sot=cite&sdt=a&sl=0&origin=recordpage&editSaveSearch=&txGid=088a0982ad9c67e42e15877c988aa90d

U2 - 10.1371/journal.pone.0156031

DO - 10.1371/journal.pone.0156031

M3 - Article

C2 - 27203858

VL - 11

SP - e0156031

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 5

ER -