Land-use regression is a popular method for predicting ambient pollutant concentrations at points of interest where no measurements are taken. However, the model-building process is complicated, and systematically understanding when and how the process works is difficult. To overcome these limitations, we reformulate the existing land use regression method as a sign-constrained regression problem with an explicit objective function to be minimized. This novel formulation always leads to estimated regression coefficients that satisfy the predefined direction based on subject matter knowledge while simultaneously substantially improving the prediction performance of the existing land-use regression method. The advantages of the proposed sign-constrained regression method are confirmed through a numerical study and real data analysis.
Woojoo Lee was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (no. 2021R1A2C1014409 ). Soon-Sun Kwon was supported by the Basic Science Research Program of the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT ( 2017R1E1A1A030 70345 , 2021R1A6A1A10044950 ).