MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network

Yu Lun Hsieh, Yung-Chun Chang, Yi Jie Huang, Shu Hao Yeh, Chun-Hung Chen, Wen Lian Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Part-of-speech (POS) tagging and named entity recognition (NER) are crucial steps in natural language processing. In addition, the difficulty of word segmentation places extra burden on those who deal with languages such as Chinese, and pipelined systems often suffer from error propagation. This work proposes an endto-end model using character-based recurrent neural network (RNN) to jointly accomplish segmentation, POS tagging and NER of a Chinese sentence. Experiments on previous word segmentation and NER competition datasets show that a single joint model using the proposed architecture is comparable to those trained specifically for each task, and outperforms freely-available softwares. Moreover, we provide a web-based interface for the public to easily access this resource.
Original languageEnglish
Title of host publicationProceedings of the Eighth International Joint Conference on Natural Language Processing
PublisherAsian Federation of Natural Language Processing
Pages80-85
Publication statusPublished - 2017

Cite this

Hsieh, Y. L., Chang, Y-C., Huang, Y. J., Yeh, S. H., Chen, C-H., & Hsu, W. L. (2017). MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (pp. 80-85). Asian Federation of Natural Language Processing.

MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network. / Hsieh, Yu Lun; Chang, Yung-Chun; Huang, Yi Jie; Yeh, Shu Hao; Chen, Chun-Hung; Hsu, Wen Lian.

Proceedings of the Eighth International Joint Conference on Natural Language Processing. Asian Federation of Natural Language Processing, 2017. p. 80-85.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hsieh, YL, Chang, Y-C, Huang, YJ, Yeh, SH, Chen, C-H & Hsu, WL 2017, MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network. in Proceedings of the Eighth International Joint Conference on Natural Language Processing. Asian Federation of Natural Language Processing, pp. 80-85.
Hsieh YL, Chang Y-C, Huang YJ, Yeh SH, Chen C-H, Hsu WL. MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network. In Proceedings of the Eighth International Joint Conference on Natural Language Processing. Asian Federation of Natural Language Processing. 2017. p. 80-85
Hsieh, Yu Lun ; Chang, Yung-Chun ; Huang, Yi Jie ; Yeh, Shu Hao ; Chen, Chun-Hung ; Hsu, Wen Lian. / MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network. Proceedings of the Eighth International Joint Conference on Natural Language Processing. Asian Federation of Natural Language Processing, 2017. pp. 80-85
@inproceedings{70493e24df084808b750893f9ae6dd81,
title = "MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network",
abstract = "Part-of-speech (POS) tagging and named entity recognition (NER) are crucial steps in natural language processing. In addition, the difficulty of word segmentation places extra burden on those who deal with languages such as Chinese, and pipelined systems often suffer from error propagation. This work proposes an endto-end model using character-based recurrent neural network (RNN) to jointly accomplish segmentation, POS tagging and NER of a Chinese sentence. Experiments on previous word segmentation and NER competition datasets show that a single joint model using the proposed architecture is comparable to those trained specifically for each task, and outperforms freely-available softwares. Moreover, we provide a web-based interface for the public to easily access this resource.",
author = "Hsieh, {Yu Lun} and Yung-Chun Chang and Huang, {Yi Jie} and Yeh, {Shu Hao} and Chun-Hung Chen and Hsu, {Wen Lian}",
year = "2017",
language = "English",
pages = "80--85",
booktitle = "Proceedings of the Eighth International Joint Conference on Natural Language Processing",
publisher = "Asian Federation of Natural Language Processing",

}

TY - GEN

T1 - MONPA: Multi-objective Named-entity and Part-of-speech Annotator for Chinese using Recurrent Neural Network

AU - Hsieh, Yu Lun

AU - Chang, Yung-Chun

AU - Huang, Yi Jie

AU - Yeh, Shu Hao

AU - Chen, Chun-Hung

AU - Hsu, Wen Lian

PY - 2017

Y1 - 2017

N2 - Part-of-speech (POS) tagging and named entity recognition (NER) are crucial steps in natural language processing. In addition, the difficulty of word segmentation places extra burden on those who deal with languages such as Chinese, and pipelined systems often suffer from error propagation. This work proposes an endto-end model using character-based recurrent neural network (RNN) to jointly accomplish segmentation, POS tagging and NER of a Chinese sentence. Experiments on previous word segmentation and NER competition datasets show that a single joint model using the proposed architecture is comparable to those trained specifically for each task, and outperforms freely-available softwares. Moreover, we provide a web-based interface for the public to easily access this resource.

AB - Part-of-speech (POS) tagging and named entity recognition (NER) are crucial steps in natural language processing. In addition, the difficulty of word segmentation places extra burden on those who deal with languages such as Chinese, and pipelined systems often suffer from error propagation. This work proposes an endto-end model using character-based recurrent neural network (RNN) to jointly accomplish segmentation, POS tagging and NER of a Chinese sentence. Experiments on previous word segmentation and NER competition datasets show that a single joint model using the proposed architecture is comparable to those trained specifically for each task, and outperforms freely-available softwares. Moreover, we provide a web-based interface for the public to easily access this resource.

UR - https://aclanthology.coli.uni-saarland.de/papers/I17-2014/i17-2014

UR - http://ijcnlp2017.org/site/page.aspx?pid=172&sid=1133&lang=en

M3 - Conference contribution

SP - 80

EP - 85

BT - Proceedings of the Eighth International Joint Conference on Natural Language Processing

PB - Asian Federation of Natural Language Processing

ER -