Imputing manufacturing material in data mining

Ruey Ling Yeh, Ching Liu, Ben Chang Shia, Yu Ting Cheng, Ya Fang Huwang

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

Data plays a vital role as a source of information to organizations, especially in times of information and technology. One encounters a not-so-perfect database from which data is missing, and the results obtained from such a database may provide biased or misleading solutions. Therefore, imputing missing data to a database has been regarded as one of the major steps in data mining. The present research used different methods of data mining to construct imputative models in accordance with different types of missing data. When the missing data is continuous, regression models and Neural Networks are used to build imputative models. For the categorical missing data, the logistic regression model, neural network, C5.0 and CART are employed to construct imputative models. The results showed that the regression model was found to provide the best estimate of continuous missing data; but for categorical missing data, the C5.0 model proved the best method.

Original languageEnglish
Pages (from-to)109-118
Number of pages10
JournalJournal of Intelligent Manufacturing
Volume19
Issue number1
DOIs
Publication statusPublished - Feb 2008
Externally publishedYes

Fingerprint

Data mining
Neural networks
Logistics

Keywords

  • BPNN
  • C5.0
  • Data mining
  • Imputation
  • Missing data
  • Regression

ASJC Scopus subject areas

  • Industrial and Manufacturing Engineering
  • Artificial Intelligence

Cite this

Imputing manufacturing material in data mining. / Yeh, Ruey Ling; Liu, Ching; Shia, Ben Chang; Cheng, Yu Ting; Huwang, Ya Fang.

In: Journal of Intelligent Manufacturing, Vol. 19, No. 1, 02.2008, p. 109-118.

Research output: Contribution to journalArticle

Yeh, Ruey Ling ; Liu, Ching ; Shia, Ben Chang ; Cheng, Yu Ting ; Huwang, Ya Fang. / Imputing manufacturing material in data mining. In: Journal of Intelligent Manufacturing. 2008 ; Vol. 19, No. 1. pp. 109-118.
@article{835e94f298604438ae70bac2659f8ae7,
title = "Imputing manufacturing material in data mining",
abstract = "Data plays a vital role as a source of information to organizations, especially in times of information and technology. One encounters a not-so-perfect database from which data is missing, and the results obtained from such a database may provide biased or misleading solutions. Therefore, imputing missing data to a database has been regarded as one of the major steps in data mining. The present research used different methods of data mining to construct imputative models in accordance with different types of missing data. When the missing data is continuous, regression models and Neural Networks are used to build imputative models. For the categorical missing data, the logistic regression model, neural network, C5.0 and CART are employed to construct imputative models. The results showed that the regression model was found to provide the best estimate of continuous missing data; but for categorical missing data, the C5.0 model proved the best method.",
keywords = "BPNN, C5.0, Data mining, Imputation, Missing data, Regression",
author = "Yeh, {Ruey Ling} and Ching Liu and Shia, {Ben Chang} and Cheng, {Yu Ting} and Huwang, {Ya Fang}",
year = "2008",
month = "2",
doi = "10.1007/s10845-007-0067-z",
language = "English",
volume = "19",
pages = "109--118",
journal = "Journal of Intelligent Manufacturing",
issn = "0956-5515",
publisher = "Springer Netherlands",
number = "1",

}

TY - JOUR

T1 - Imputing manufacturing material in data mining

AU - Yeh, Ruey Ling

AU - Liu, Ching

AU - Shia, Ben Chang

AU - Cheng, Yu Ting

AU - Huwang, Ya Fang

PY - 2008/2

Y1 - 2008/2

N2 - Data plays a vital role as a source of information to organizations, especially in times of information and technology. One encounters a not-so-perfect database from which data is missing, and the results obtained from such a database may provide biased or misleading solutions. Therefore, imputing missing data to a database has been regarded as one of the major steps in data mining. The present research used different methods of data mining to construct imputative models in accordance with different types of missing data. When the missing data is continuous, regression models and Neural Networks are used to build imputative models. For the categorical missing data, the logistic regression model, neural network, C5.0 and CART are employed to construct imputative models. The results showed that the regression model was found to provide the best estimate of continuous missing data; but for categorical missing data, the C5.0 model proved the best method.

AB - Data plays a vital role as a source of information to organizations, especially in times of information and technology. One encounters a not-so-perfect database from which data is missing, and the results obtained from such a database may provide biased or misleading solutions. Therefore, imputing missing data to a database has been regarded as one of the major steps in data mining. The present research used different methods of data mining to construct imputative models in accordance with different types of missing data. When the missing data is continuous, regression models and Neural Networks are used to build imputative models. For the categorical missing data, the logistic regression model, neural network, C5.0 and CART are employed to construct imputative models. The results showed that the regression model was found to provide the best estimate of continuous missing data; but for categorical missing data, the C5.0 model proved the best method.

KW - BPNN

KW - C5.0

KW - Data mining

KW - Imputation

KW - Missing data

KW - Regression

UR - http://www.scopus.com/inward/record.url?scp=39549085594&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=39549085594&partnerID=8YFLogxK

U2 - 10.1007/s10845-007-0067-z

DO - 10.1007/s10845-007-0067-z

M3 - Article

AN - SCOPUS:39549085594

VL - 19

SP - 109

EP - 118

JO - Journal of Intelligent Manufacturing

JF - Journal of Intelligent Manufacturing

SN - 0956-5515

IS - 1

ER -