Open Access Open Access  Restricted Access Subscription Access

doi:10.3808/jei.201600341
Copyright © 2017 ISEIS. All rights reserved

Variable Selection Based on Statistical Learning Approaches to Improve PM10 Concentration Forecasting

A. Ben Ishak*

    Université de Tunis, ISGT, LR99ES04 BESTMOD, Le Bardo 2000, Tunisia

*Corresponding author. Tel: +216-97-549940 Fax: +216-71-588487 Email: anis_isg@yahoo.fr

Abstract


In this work, the problem of variable selection for regression is investigated in order to improve the forecasting accuracy. To that effect, the Support Vector Regression (SVR) and the Random Forests (RF) are used to assess the variable importance. Then, a stepwise algorithm is built to select the best subset of predictors. An intensive comparative study is conducted on simulated and real datasets. The real datasets expose the problem of particulate matter concentration forecasting in two monitoring stations from Tunisia. We have proposed a combined approach using SVR and RF for variable importance assessment and for variable selection. We have achieved a significant improvement in forecasts accuracy for the two stations when using only a reduced number of selected predictors.

Keywords: support vector regression, random forests, variable selection, stepwise algorithm, selection bias, particulate matter forecasting


Full Text:

PDF

Supplementary Files:

Refbacks

  • There are currently no refbacks.