Lookup NU author(s): Dr Mark Willis,
Dr Moritz von Stosch
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND).
In this work, sparse regression using a penalized least absolute deviations objective function is considered. Regression model sparsity is promoted using a L0 - pseudo norm penalty (the cardinality of the model parameter vector). Implemented using mixed integer linear programming (MILP) it is demonstrated that the use of the L0 - norm (without approximation) enables efficient and accurate solutions to sparse regression problems of practical size. For model development with a large number of potential model parameters (or features) methods to relax the MILP are also developed; using nonlinear function approximations to the L0- norm, penalty terms are linearized and solved using sequential linear programming. Experimental results (using both simulated and real data) demonstrate that these algorithms are also computationally efficient producing accurate and parsimonious model structures. Applications considered are the development of a calibration model for prediction with Near Infrared (NIR) data and the development of a model for the prediction of chemical toxicity - a quantitative structure activity relationship (QSAR).
Author(s): Willis MJ, von-Stosch M
Publication type: Article
Publication status: Published
Journal: Chemometrics and intelligent laboratory systems
Print publication date: 15/06/2017
Online publication date: 12/04/2017
Acceptance date: 02/12/2016
Date deposited: 25/04/2017
ISSN (print): 0169-7439
ISSN (electronic): 1873-3239
Altmetrics provided by Altmetric