Feature engineering for recognizing adverse drug reactions from twitter posts

Hong Jie Dai, Musa Touray, Jitendra Jonnagaddala, Shabbir Syed-Abdul

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)

Abstract

Social media platforms are emerging digital communication channels that provide aneasy way for common people to share their health and medication experiences online. With morepeople discussing their health information online publicly, social media platforms present a richsource of information for exploring adverse drug reactions (ADRs). ADRs are major public healthproblems that result in deaths and hospitalizations of millions of people. Unfortunately, not allADRs are identified before a drug is made available in the market. In this study, an ADR eventmonitoring system is developed which can recognize ADR mentions from a tweet and classify itsassertion. We explored several entity recognition features, feature conjunctions, and feature selectionand analyzed their characteristics and impacts on the recognition of ADRs, which have never beenstudied previously. The results demonstrate that the entity recognition performance for ADR canachieve an F-score of 0.562 on the PSB Social Media Mining shared task dataset, which outperformsthe partial-matching-based method by 0.122. After feature selection, the F-score can be furtherimproved by 0.026. This novel technique of text mining utilizing shared online social media data willopen an array of opportunities for researchers to explore various health related issues.

Original languageEnglish
Article number27
JournalInformation (Switzerland)
Volume7
Issue number2
DOIs
Publication statusPublished - May 25 2016

Keywords

  • Adverse drug reactions
  • Named entity recognition
  • Natural language processing
  • Social media
  • Word embedding

ASJC Scopus subject areas

  • Information Systems

Fingerprint Dive into the research topics of 'Feature engineering for recognizing adverse drug reactions from twitter posts'. Together they form a unique fingerprint.

Cite this