An aggregation method for sparse logistic regression Online publication date: Wed, 03-May-2017
by Zhe Liu
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 17, No. 1, 2017
Abstract: L1 regularised logistic regression has now become a workhorse of data mining and bioinformatics: it is widely used for many classification problems, particularly ones with many features. However, L1 regularisation typically selects too many features and that so-called false positives are unavoidable. In this paper, we demonstrate and analyse an aggregation method for sparse logistic regression in high dimensions. This approach linearly combines the estimators from a suitable set of logistic models with different underlying sparsity patterns and can balance the predictive ability and model interpretability. Numerical performance of our proposed aggregation method is then investigated using simulation studies. We also analyse a published genome-wide case-control dataset to further evaluate the usefulness of the aggregation method in multi-locus association mapping.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com