JISE


  [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15]


Journal of Information Science and Engineering, Vol. 30 No. 2, pp. 371-385


Selecting One Dependency Estimators in Bayesian Network Using Different MDL Scores and Overfitting Criterion


MENG HAN1,2, ZHIHAI WANG1 AND YASHU LIU1
1School of Computer and Information Technology
Beijing Jiaotong University
Beijing, 100044 P.R. China
2School of Computer Science and Engineering
Beifang University of Nationalities
Yinchuan, 750021 P.R. China

 


    The Averaged One Dependency Estimator (AODE) is integrated all possible Super- Parent-One-Dependency Estimators (SPODEs) and estimates class conditional probabilities by averaging them. In an AODE network some redundant SPODEs maybe result in some bias of classifiers, as a consequence, it could reduce the classification accuracy substantially. In this paper, a kind of MDL metrics is used to select SPODEs in a whole or partially, therefore there are three different classifiers presented. The performance comparisons between them and AODE have been shown not only the theoretical analyses are reasonable, but also efficient and effective. And Mean Square Error (MSE) is used to test overfitting. Experiential results have indicated that the classifier using MDL score metrics had better performance than original AODE, and at the same time, has less overfitting. At the end of the paper, further discussions and verifications of some properties of overfitting have also shown in the experiments.


Keywords: averaged one dependence estimators, overfitting, minimum description length, mean square error, Bayesian network, data mining

  Retrieve PDF document (JISE_201402_06.pdf)