Forward stagewise na�ve Bayes

Vidaurre, Diego; Bielza, Concha; Larrañaga, Pedro

doi:10.1007/s13748-011-0001-7

Forward stagewise naïve Bayes

Regular Paper
Published: 13 January 2012

Volume 1, pages 57–69, (2012)
Cite this article

Download PDF

Progress in Artificial Intelligence Aims and scope Submit manuscript

Forward stagewise na�ve Bayes

Download PDF

Diego Vidaurre¹,
Concha Bielza¹ &
Pedro Larra�aga¹�

857 Accesses
4 Citations
Explore all metrics

Abstract

The naïve Bayes approach is a simple but often satisfactory method for supervised classification. In this paper, we focus on the naïve Bayes model and propose the application of regularization techniques to learn a naïve Bayes classifier. The main contribution of the paper is a stagewise version of the selective naïve Bayes, which can be considered a regularized version of the naïve Bayes model. We call it forward stagewise naïve Bayes. For comparison’s sake, we also introduce an explicitly regularized formulation of the naïve Bayes model, where conditional independence (absence of arcs) is promoted via an L ₁/L ₂-group penalty on the parameters that define the conditional probability distributions. Although already published in the literature, this idea has only been applied for continuous predictors. We extend this formulation to discrete predictors and propose a modification that yields an adaptive penalization. We show that, whereas the L ₁/L ₂ group penalty formulation only discards irrelevant predictors, the forward stagewise naïve Bayes can discard both irrelevant and redundant predictors, which are known to be harmful for the naïve Bayes classifier. Both approaches, however, usually improve the classical naïve Bayes model’s accuracy.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Akaike H.: A new look at the statistical model identification. IEEE Trans. Autom. Control 19, 716–723 (1974)
Article MathSciNet� MATH� Google Scholar
Bergmann, G., Hommel, G.: Improvements of general multiple test procedures for redundant systems of hypotheses. In: Multiple Hypotheses Testing, pp. 100–115. Springer, Berlin (1988)
Boyd S., Vandenberghe L.: Convex Optimization. Cambridge University Press, Cambridge (2004)
MATH Google Scholar
Domingos P., Pazzani M.: Beyond independence: conditions for the optimality of the simple Bayesian classifier. Mach. Learn. 29, 103–130 (1997)
Article MATH Google Scholar
Drugan M.M., Wiering M.A.: Feature selection for Bayesian network classifiers using the MDL-FS score. Int. J. Approx. Reason. 51, 695–717 (2010)
Article MathSciNet MATH Google Scholar
Ferreira, J.T.A.S., Denison, D.G.T., Hand, D.J.: Data mining with products of trees. In: Advances in Intelligent Data Analysis, Lecture Notes in Computer Science, vol. 2189, pp. 167–176. Springer, Berlin (2001)
Friedman N., Geiger D., Goldszmidt M.: Bayesian network classifiers. Mach. Learn. 29, 131–163 (1997)
Article MATH Google Scholar
García S., Herrera F.: An extension on “statistical comparisons of classifiers over multiple data sets” for all pairwise comparisons. J. Mach. Learn. Res. 9, 2677–2694 (2008)
MATH Google Scholar
van Gerven, M., Heskes, T.: L ₁/L _p regularization of differences. Tech. Rep. ICIS-R08009, Radboud University Nijmegen (2008)
Hall, M.: Induction of selective Bayesian classifiers. In: Proceedings of the 17th International Conference on Machine Learning, pp. 359–366 (2000)
Hand D.J., Yu K.: Idiot’s Bayes—not so stupid after all. Int. Stat. Rev. 69, 385–398 (2001)
Article MATH Google Scholar
Kohavi R., John G.H.: Wrappers for feature subset selection. Artif. Intell. 29, 273–324 (1996)
Google Scholar
Langley, P., Sage, S.: Induction of selective Bayesian classifiers. In: Proceedings of the 10th Conference on Uncertainty in Artificial Intelligence, pp. 399–406 (1994)
Minsky, M.: Steps toward artificial intelligence. In: Computers and Thought, pp. 406–450. McGraw-Hill, New York (1961)
Tibshirani R.: Regression shrinkage and selection via the Lasso. J. Royal Stat. Soc. Ser. B 58, 267–288 (1996)
MathSciNet MATH Google Scholar
Tseng P.: Convergence of block coordinate descent method for nondifferentiable minimation. J. Optim. Theory Appl. 109, 475–494 (2001)
Article MathSciNet MATH Google Scholar
Weisberg S.: Applied Linear Regression. Wiley, New York (1980)
MATH Google Scholar
Yuan M., Lin Y.: Model selection and estimation in regression with grouped variables. J. Royal Stat. Soc. Ser. B 70, 53–71 (2006)
Google Scholar
Zou H.: The adaptive Lasso and its oracle properties. J. Am. Stat. Assoc. 101, 1418–1429 (2006)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Computational Intelligence Group, Departamento de Inteligencia Artificial, Universidad Polit�cnica de Madrid, Madrid, Spain
Diego Vidaurre,�Concha Bielza�&�Pedro Larra�aga

Authors

Diego Vidaurre
View author publications
You can also search for this author in PubMed�Google Scholar
Concha Bielza
View author publications
You can also search for this author in PubMed�Google Scholar
Pedro Larra�aga
View author publications
You can also search for this author in PubMed�Google Scholar

Corresponding author

Correspondence to Pedro Larra�aga.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vidaurre, D., Bielza, C. & Larrañaga, P. Forward stagewise naïve Bayes. Prog Artif Intell 1, 57–69 (2012). https://doi.org/10.1007/s13748-011-0001-7

Download citation

Received: 10 May 2011
Accepted: 25 October 2011
Published: 13 January 2012
Issue Date: April 2012
DOI: https://doi.org/10.1007/s13748-011-0001-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Forward stagewise naïve Bayes

Abstract

Article PDF

Similar content being viewed by others

When is the Naive Bayes approximation not so naive?

Regularization and Model Selection with Categorical Covariates

Beyond support in two-stage variable selection

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Forward stagewise naïve Bayes

Abstract

Article PDF

Similar content being viewed by others

When is the Naive Bayes approximation not so naive?

Regularization and Model Selection with Categorical Covariates

Beyond support in two-stage variable selection

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation