A frame-based approach for reference metadata extraction

Yu Lun Hsieh, Shih Hung Liu, Ting Hao Yang, Yu Hsuan Chen, Yung Chun Chang, Gladys Hsieh, Cheng Wei Shih, Chun Hung Lu, Wen Lian Hsu

研究成果: 雜誌貢獻文章同行評審

1 引文 斯高帕斯(Scopus)

摘要

In this paper, we propose a novel frame-based approach (FBA) and use reference metadata extraction as a case study to demonstrate its advantages. The main contributions of this research are three-fold. First, the new frame matching algorithm, based on sequence alignment, can compensate for the shortcomings of traditional rule-based approach, in which rule matching lacks flexibility and generality. Second, an approximate matching is adopted for capturing reasonable abbreviations or errors in the input reference string to further increase the coverage of the frames. Third, experiments conducted on extensive datasets show that the same knowledge framework performed equally well on various untrained domains. Comparing to a widely-used machine learning method, Conditional Random Fields (CRFs), the FBA can drastically reduce the average field error rate across all four independent test sets by 70%\ (2.24% vs. 7.54%).
原文英語
頁(從 - 到)154-163
頁數10
期刊Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
8916
出版狀態已發佈 - 2014
對外發佈

ASJC Scopus subject areas

  • 電腦科學(全部)
  • 理論電腦科學

指紋

深入研究「A frame-based approach for reference metadata extraction」主題。共同形成了獨特的指紋。

引用此