Feature engineering for recognizing adverse drug reactions from twitter posts

Hong Jie Dai, Musa Touray, Jitendra Jonnagaddala, Shabbir Syed-Abdul

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

Social media platforms are emerging digital communication channels that provide aneasy way for common people to share their health and medication experiences online. With morepeople discussing their health information online publicly, social media platforms present a richsource of information for exploring adverse drug reactions (ADRs). ADRs are major public healthproblems that result in deaths and hospitalizations of millions of people. Unfortunately, not allADRs are identified before a drug is made available in the market. In this study, an ADR eventmonitoring system is developed which can recognize ADR mentions from a tweet and classify itsassertion. We explored several entity recognition features, feature conjunctions, and feature selectionand analyzed their characteristics and impacts on the recognition of ADRs, which have never beenstudied previously. The results demonstrate that the entity recognition performance for ADR canachieve an F-score of 0.562 on the PSB Social Media Mining shared task dataset, which outperformsthe partial-matching-based method by 0.122. After feature selection, the F-score can be furtherimproved by 0.026. This novel technique of text mining utilizing shared online social media data willopen an array of opportunities for researchers to explore various health related issues.

Original languageEnglish
Article number27
JournalInformation (Switzerland)
Volume7
Issue number2
DOIs
Publication statusPublished - May 25 2016

Fingerprint

Health
Feature extraction

Keywords

  • Adverse drug reactions
  • Named entity recognition
  • Natural language processing
  • Social media
  • Word embedding

ASJC Scopus subject areas

  • Information Systems

Cite this

Feature engineering for recognizing adverse drug reactions from twitter posts. / Dai, Hong Jie; Touray, Musa; Jonnagaddala, Jitendra; Syed-Abdul, Shabbir.

In: Information (Switzerland), Vol. 7, No. 2, 27, 25.05.2016.

Research output: Contribution to journalArticle

Dai, Hong Jie ; Touray, Musa ; Jonnagaddala, Jitendra ; Syed-Abdul, Shabbir. / Feature engineering for recognizing adverse drug reactions from twitter posts. In: Information (Switzerland). 2016 ; Vol. 7, No. 2.
@article{cc0a5b04a433418aa48b2f4241c63de6,
title = "Feature engineering for recognizing adverse drug reactions from twitter posts",
abstract = "Social media platforms are emerging digital communication channels that provide aneasy way for common people to share their health and medication experiences online. With morepeople discussing their health information online publicly, social media platforms present a richsource of information for exploring adverse drug reactions (ADRs). ADRs are major public healthproblems that result in deaths and hospitalizations of millions of people. Unfortunately, not allADRs are identified before a drug is made available in the market. In this study, an ADR eventmonitoring system is developed which can recognize ADR mentions from a tweet and classify itsassertion. We explored several entity recognition features, feature conjunctions, and feature selectionand analyzed their characteristics and impacts on the recognition of ADRs, which have never beenstudied previously. The results demonstrate that the entity recognition performance for ADR canachieve an F-score of 0.562 on the PSB Social Media Mining shared task dataset, which outperformsthe partial-matching-based method by 0.122. After feature selection, the F-score can be furtherimproved by 0.026. This novel technique of text mining utilizing shared online social media data willopen an array of opportunities for researchers to explore various health related issues.",
keywords = "Adverse drug reactions, Named entity recognition, Natural language processing, Social media, Word embedding",
author = "Dai, {Hong Jie} and Musa Touray and Jitendra Jonnagaddala and Shabbir Syed-Abdul",
year = "2016",
month = "5",
day = "25",
doi = "10.3390/info7020027",
language = "English",
volume = "7",
journal = "Information (Switzerland)",
issn = "2078-2489",
publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",
number = "2",

}

TY - JOUR

T1 - Feature engineering for recognizing adverse drug reactions from twitter posts

AU - Dai, Hong Jie

AU - Touray, Musa

AU - Jonnagaddala, Jitendra

AU - Syed-Abdul, Shabbir

PY - 2016/5/25

Y1 - 2016/5/25

N2 - Social media platforms are emerging digital communication channels that provide aneasy way for common people to share their health and medication experiences online. With morepeople discussing their health information online publicly, social media platforms present a richsource of information for exploring adverse drug reactions (ADRs). ADRs are major public healthproblems that result in deaths and hospitalizations of millions of people. Unfortunately, not allADRs are identified before a drug is made available in the market. In this study, an ADR eventmonitoring system is developed which can recognize ADR mentions from a tweet and classify itsassertion. We explored several entity recognition features, feature conjunctions, and feature selectionand analyzed their characteristics and impacts on the recognition of ADRs, which have never beenstudied previously. The results demonstrate that the entity recognition performance for ADR canachieve an F-score of 0.562 on the PSB Social Media Mining shared task dataset, which outperformsthe partial-matching-based method by 0.122. After feature selection, the F-score can be furtherimproved by 0.026. This novel technique of text mining utilizing shared online social media data willopen an array of opportunities for researchers to explore various health related issues.

AB - Social media platforms are emerging digital communication channels that provide aneasy way for common people to share their health and medication experiences online. With morepeople discussing their health information online publicly, social media platforms present a richsource of information for exploring adverse drug reactions (ADRs). ADRs are major public healthproblems that result in deaths and hospitalizations of millions of people. Unfortunately, not allADRs are identified before a drug is made available in the market. In this study, an ADR eventmonitoring system is developed which can recognize ADR mentions from a tweet and classify itsassertion. We explored several entity recognition features, feature conjunctions, and feature selectionand analyzed their characteristics and impacts on the recognition of ADRs, which have never beenstudied previously. The results demonstrate that the entity recognition performance for ADR canachieve an F-score of 0.562 on the PSB Social Media Mining shared task dataset, which outperformsthe partial-matching-based method by 0.122. After feature selection, the F-score can be furtherimproved by 0.026. This novel technique of text mining utilizing shared online social media data willopen an array of opportunities for researchers to explore various health related issues.

KW - Adverse drug reactions

KW - Named entity recognition

KW - Natural language processing

KW - Social media

KW - Word embedding

UR - http://www.scopus.com/inward/record.url?scp=84976524192&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84976524192&partnerID=8YFLogxK

U2 - 10.3390/info7020027

DO - 10.3390/info7020027

M3 - Article

VL - 7

JO - Information (Switzerland)

JF - Information (Switzerland)

SN - 2078-2489

IS - 2

M1 - 27

ER -