Estimating the Probability of Rare Events Occurring Using a Local Model Averaging

Jin-Hua Chen, Chun-Shu Chen, Meng-Fan Huang, Hung-Chih Lin

研究成果: 雜誌貢獻文章

2 引文 (Scopus)

摘要

n statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed.
原文英語
頁(從 - 到)n/a-n/a
期刊Risk Analysis
DOIs
出版狀態已發佈 - 2016

指紋

Logistic Models
Logistics
Necrotizing Enterocolitis
Perturbation techniques

Keywords

  • Kullback-Leibler loss
  • logistic regression
  • maximum likelihood estimate
  • uncertainty

引用此文

Estimating the Probability of Rare Events Occurring Using a Local Model Averaging. / Chen, Jin-Hua; Chen, Chun-Shu; Huang, Meng-Fan; Lin, Hung-Chih.

於: Risk Analysis, 2016, p. n/a-n/a.

研究成果: 雜誌貢獻文章

Chen, Jin-Hua ; Chen, Chun-Shu ; Huang, Meng-Fan ; Lin, Hung-Chih. / Estimating the Probability of Rare Events Occurring Using a Local Model Averaging. 於: Risk Analysis. 2016 ; 頁 n/a-n/a.
@article{8a55cf319b264cbb83309021c16cc3a1,
title = "Estimating the Probability of Rare Events Occurring Using a Local Model Averaging",
abstract = "In statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed.",
keywords = "Kullback-Leibler loss, logistic regression, maximum likelihood estimate, uncertainty, Kullback-Leibler loss, logistic regression, maximum likelihood estimate, uncertainty",
author = "Jin-Hua Chen and Chun-Shu Chen and Meng-Fan Huang and Hung-Chih Lin",
year = "2016",
doi = "10.1111/risa.12558",
language = "English",
pages = "n/a--n/a",
journal = "Risk Analysis",
issn = "0272-4332",
publisher = "Wiley-Blackwell",

}

TY - JOUR

T1 - Estimating the Probability of Rare Events Occurring Using a Local Model Averaging

AU - Chen, Jin-Hua

AU - Chen, Chun-Shu

AU - Huang, Meng-Fan

AU - Lin, Hung-Chih

PY - 2016

Y1 - 2016

N2 - In statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed.

AB - In statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed.

KW - Kullback-Leibler loss, logistic regression, maximum likelihood estimate, uncertainty

KW - Kullback-Leibler loss

KW - logistic regression

KW - maximum likelihood estimate

KW - uncertainty

U2 - 10.1111/risa.12558

DO - 10.1111/risa.12558

M3 - Article

SP - n/a-n/a

JO - Risk Analysis

JF - Risk Analysis

SN - 0272-4332

ER -