An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media

Neha Warikoo, Yung Chun Chang, Hong Jie Dai, Wen Lian Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Extensive use of social media for communication has made it a desired resource in human behavior intensive tasks like product popularity, public polls and more recently for public health surveillance tasks such as lifestyle associated diseases and mental health. In this paper, we exploited Twitter data for detecting pregnancy cases and used tweets about pregnancy to study trigger terms associated with maternal physical and mental health. Such systems can enable clinicians to offer a more comprehensive health care in real time. Using a Twitter-based corpus, we have developed an ensemble Long-short Term Memory (LSTM) – Recurrent Neural Networks (RNN) and Convolution Neural Networks (CNN) network representation model to learn legitimate pregnancy cases discussed online. These ensemble representations were learned by a SVM classifier, which can achieve F1-score of 95% in predicting pregnancy accounts discussed in tweets. We also further investigate the words most commonly associated with physical disease symptoms ‘Distress’ and negative emotions ‘Annoyed’ sentiment. Results from our sentiment analysis study are quite encouraging, identifying more accurate triggers for pregnancy sentiment classes.

Original languageEnglish
Title of host publicationInformation Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings
EditorsLun-Wei Ku, Jui-Feng Yeh, Liang-Chih Yu, Yuen-Hsien Tseng, Zhi-Hong Chen, Tetsuya Sakai, Jing Jiang, Lung-Hao Lee, Dae Hoon Park
PublisherSpringer Verlag
Pages3-15
Number of pages13
ISBN (Print)9783030035198
DOIs
Publication statusPublished - Jan 1 2018
Event14th Asia Information Retrieval Societies conference, AIRS 2018 - Taipei, Taiwan
Duration: Nov 28 2018Nov 30 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11292 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th Asia Information Retrieval Societies conference, AIRS 2018
CountryTaiwan
CityTaipei
Period11/28/1811/30/18

Fingerprint

Social Media
Pregnancy
Neural Network Model
Mining
Health
Ensemble
Neural networks
Recurrent neural networks
Public health
Convolution
Health care
Classifiers
Trigger
Communication
Sentiment Analysis
Memory Term
Human Behavior
Public Health
Recurrent Neural Networks
Surveillance

Keywords

  • Ensemble deep learning
  • Health surveillance
  • Pregnancy health stats
  • Sentiment analysis
  • Text mining of Twitter data

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Warikoo, N., Chang, Y. C., Dai, H. J., & Hsu, W. L. (2018). An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media. In L-W. Ku, J-F. Yeh, L-C. Yu, Y-H. Tseng, Z-H. Chen, T. Sakai, J. Jiang, L-H. Lee, ... D. H. Park (Eds.), Information Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings (pp. 3-15). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11292 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-030-03520-4_1

An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media. / Warikoo, Neha; Chang, Yung Chun; Dai, Hong Jie; Hsu, Wen Lian.

Information Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings. ed. / Lun-Wei Ku; Jui-Feng Yeh; Liang-Chih Yu; Yuen-Hsien Tseng; Zhi-Hong Chen; Tetsuya Sakai; Jing Jiang; Lung-Hao Lee; Dae Hoon Park. Springer Verlag, 2018. p. 3-15 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11292 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Warikoo, N, Chang, YC, Dai, HJ & Hsu, WL 2018, An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media. in L-W Ku, J-F Yeh, L-C Yu, Y-H Tseng, Z-H Chen, T Sakai, J Jiang, L-H Lee & DH Park (eds), Information Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11292 LNCS, Springer Verlag, pp. 3-15, 14th Asia Information Retrieval Societies conference, AIRS 2018, Taipei, Taiwan, 11/28/18. https://doi.org/10.1007/978-3-030-03520-4_1
Warikoo N, Chang YC, Dai HJ, Hsu WL. An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media. In Ku L-W, Yeh J-F, Yu L-C, Tseng Y-H, Chen Z-H, Sakai T, Jiang J, Lee L-H, Park DH, editors, Information Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings. Springer Verlag. 2018. p. 3-15. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-030-03520-4_1
Warikoo, Neha ; Chang, Yung Chun ; Dai, Hong Jie ; Hsu, Wen Lian. / An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media. Information Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings. editor / Lun-Wei Ku ; Jui-Feng Yeh ; Liang-Chih Yu ; Yuen-Hsien Tseng ; Zhi-Hong Chen ; Tetsuya Sakai ; Jing Jiang ; Lung-Hao Lee ; Dae Hoon Park. Springer Verlag, 2018. pp. 3-15 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{b479627b08dc48cabb1ed9e6eed51831,
title = "An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media",
abstract = "Extensive use of social media for communication has made it a desired resource in human behavior intensive tasks like product popularity, public polls and more recently for public health surveillance tasks such as lifestyle associated diseases and mental health. In this paper, we exploited Twitter data for detecting pregnancy cases and used tweets about pregnancy to study trigger terms associated with maternal physical and mental health. Such systems can enable clinicians to offer a more comprehensive health care in real time. Using a Twitter-based corpus, we have developed an ensemble Long-short Term Memory (LSTM) – Recurrent Neural Networks (RNN) and Convolution Neural Networks (CNN) network representation model to learn legitimate pregnancy cases discussed online. These ensemble representations were learned by a SVM classifier, which can achieve F1-score of 95{\%} in predicting pregnancy accounts discussed in tweets. We also further investigate the words most commonly associated with physical disease symptoms ‘Distress’ and negative emotions ‘Annoyed’ sentiment. Results from our sentiment analysis study are quite encouraging, identifying more accurate triggers for pregnancy sentiment classes.",
keywords = "Ensemble deep learning, Health surveillance, Pregnancy health stats, Sentiment analysis, Text mining of Twitter data",
author = "Neha Warikoo and Chang, {Yung Chun} and Dai, {Hong Jie} and Hsu, {Wen Lian}",
year = "2018",
month = "1",
day = "1",
doi = "10.1007/978-3-030-03520-4_1",
language = "English",
isbn = "9783030035198",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "3--15",
editor = "Lun-Wei Ku and Jui-Feng Yeh and Liang-Chih Yu and Yuen-Hsien Tseng and Zhi-Hong Chen and Tetsuya Sakai and Jing Jiang and Lung-Hao Lee and Park, {Dae Hoon}",
booktitle = "Information Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings",
address = "Germany",

}

TY - GEN

T1 - An Ensemble Neural Network Model for Benefiting Pregnancy Health Stats from Mining Social Media

AU - Warikoo, Neha

AU - Chang, Yung Chun

AU - Dai, Hong Jie

AU - Hsu, Wen Lian

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Extensive use of social media for communication has made it a desired resource in human behavior intensive tasks like product popularity, public polls and more recently for public health surveillance tasks such as lifestyle associated diseases and mental health. In this paper, we exploited Twitter data for detecting pregnancy cases and used tweets about pregnancy to study trigger terms associated with maternal physical and mental health. Such systems can enable clinicians to offer a more comprehensive health care in real time. Using a Twitter-based corpus, we have developed an ensemble Long-short Term Memory (LSTM) – Recurrent Neural Networks (RNN) and Convolution Neural Networks (CNN) network representation model to learn legitimate pregnancy cases discussed online. These ensemble representations were learned by a SVM classifier, which can achieve F1-score of 95% in predicting pregnancy accounts discussed in tweets. We also further investigate the words most commonly associated with physical disease symptoms ‘Distress’ and negative emotions ‘Annoyed’ sentiment. Results from our sentiment analysis study are quite encouraging, identifying more accurate triggers for pregnancy sentiment classes.

AB - Extensive use of social media for communication has made it a desired resource in human behavior intensive tasks like product popularity, public polls and more recently for public health surveillance tasks such as lifestyle associated diseases and mental health. In this paper, we exploited Twitter data for detecting pregnancy cases and used tweets about pregnancy to study trigger terms associated with maternal physical and mental health. Such systems can enable clinicians to offer a more comprehensive health care in real time. Using a Twitter-based corpus, we have developed an ensemble Long-short Term Memory (LSTM) – Recurrent Neural Networks (RNN) and Convolution Neural Networks (CNN) network representation model to learn legitimate pregnancy cases discussed online. These ensemble representations were learned by a SVM classifier, which can achieve F1-score of 95% in predicting pregnancy accounts discussed in tweets. We also further investigate the words most commonly associated with physical disease symptoms ‘Distress’ and negative emotions ‘Annoyed’ sentiment. Results from our sentiment analysis study are quite encouraging, identifying more accurate triggers for pregnancy sentiment classes.

KW - Ensemble deep learning

KW - Health surveillance

KW - Pregnancy health stats

KW - Sentiment analysis

KW - Text mining of Twitter data

UR - http://www.scopus.com/inward/record.url?scp=85057269768&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057269768&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-03520-4_1

DO - 10.1007/978-3-030-03520-4_1

M3 - Conference contribution

SN - 9783030035198

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 3

EP - 15

BT - Information Retrieval Technology - 14th Asia Information Retrieval Societies Conference, AIRS 2018, Proceedings

A2 - Ku, Lun-Wei

A2 - Yeh, Jui-Feng

A2 - Yu, Liang-Chih

A2 - Tseng, Yuen-Hsien

A2 - Chen, Zhi-Hong

A2 - Sakai, Tetsuya

A2 - Jiang, Jing

A2 - Lee, Lung-Hao

A2 - Park, Dae Hoon

PB - Springer Verlag

ER -