Assessment of Statistical Models for Rainfall Forecasting Using Machine Learning Technique

Document Type : Regular Article

Authors

1 School of Computing, SASTRA Deemed to be University, Thanjavur, India

2 School of Civil Engineering, SASTRA Deemed to be University, Thanjavur, India

Abstract

As heavy rainfall can lead to several catastrophes; the prediction of rainfall is vital. The forecast encourages individuals to take appropriate steps and should be reasonable in the forecast. Agriculture is the most important factor in ensuring a person's survival. The most crucial aspect of agriculture is rainfall. Predicting rain has been a big issue in recent years. Rainfall forecasting raises people's awareness and allows them to plan ahead of time to preserve their crops from the elements. To predict rainfall, many methods have been developed. Instant comparisons between past weather forecasts and observations can be processed using machine learning. Weather models can better account for prediction flaws, such as overestimated rainfall, with the help of machine learning, and create more accurate predictions. Thanjavur Station rainfall data for the period of 17 years from 2000 to 2016 is used to study the accuracy of rainfall forecasting. To get the most accurate prediction model, three prediction models ARIMA (Auto-Regression Integrated with Moving Average Model), ETS (Error Trend Seasonality Model) and Holt-Winters (HW) were compared using R package. The findings show that the model of HW and ETS performs well compared to models of ARIMA. Performance criteria such as Akaike Information Criteria (AIC) and Root Mean Square Error (RMSE) have been used to identify the best forecasting model for Thanjavur station.

Keywords

Main Subjects


[1]     Hipni A, El-shafie A, Najah A, Karim OA, Hussain A, Mukhlisin M. Daily Forecasting of Dam Water Levels: Comparing a Support Vector Machine (SVM) Model With Adaptive Neuro Fuzzy Inference System (ANFIS). Water Resour Manag 2013;27:3803–23. https://doi.org/10.1007/s11269-013-0382-4.
[2]     Najah A, El-Shafie A, Karim OA, Jaafar O. Integrated versus isolated scenario for prediction dissolved oxygen at progression of water quality monitoring stations. Hydrol Earth Syst Sci 2011;15:2693–708. https://doi.org/10.5194/hess-15-2693-2011.
[3]     Mahsin M, Akhter Y, Begum M. Modeling Rainfall in Dhaka Division of Bangladesh Using Time Series. J Math Model Appl 2012;1:67–73.
[4]     Tektaş M. Weather Forecasting Using ANFIS and ARIMA MODELS. A Case Study for Istanbul. Environ Res Eng Manag 2010;1:5–10. https://doi.org/10.5755/j01.erem.51.1.58.
[5]     Sciences E. Time Series Analysis Model for Rainfall Data in Jordan : Case Study for Using Time Series Analysis P . E . Naill M . Momani King Abdul Aziz University , Jeddah , Kingdom of Saudi Arabia. Am J Environ Sci 2009;5:599–604.
[6]     Shamsnia SA, Shahidi N, Liaghat A, Sarraf A, Vahdat SF. Modeling of weather parameters using stochastic methods (ARIMA model)(case study: Abadeh Region, Iran). Int Conf Environ Ind Innov IPCBEE 2011;12:282–5.
[7]     Suhartono, Faulina R, Lusia DA, Otok BW, Sutikno, Kuswanto H. Ensemble method based on ANFIS-ARIMA for rainfall prediction. ICSSBE 2012 - Proceedings, 2012 Int Conf Stat Sci Bus Eng "Empowering Decis Mak with Stat Sci 2012:240–3. https://doi.org/10.1109/ICSSBE.2012.6396564.
[8]     Li G, Chang W, Yang H. A Novel Combined Prediction Model for Monthly Mean Precipitation with Error Correction Strategy. IEEE Access 2020;8:141432–45. https://doi.org/10.1109/ACCESS.2020.3013354.
[9]     Vienna A. R Core Team R: A language and environment for statistical computing 2017.
[10]   Hyndman [10] R. J. Forecasting functions for time series and linear models_. R package version 8.2. 2017.
[11]   Mila FA, Parvin MT. Forecasting Area, Production and Yield of Onion in Bangladesh by Using ARIMA Model. Asian J Agric Extension, Econ Sociol 2019:1–12. https://doi.org/10.9734/ajaees/2019/v37i430274.
[12]   Punia M, Joshi PK, Porwal MC. Decision tree classification of land use land cover for Delhi, India using IRS-P6 AWiFS data. Expert Syst Appl 2011;38:5577–83. https://doi.org/10.1016/j.eswa.2010.10.078.
[13]   Burlando, P.; Rosso, R.; Cadavid, L.G.; Salas J. Forecasting of short-term rainfall using ARMA models. J Hydrol 1993:144: 193–211.
[14]   Salas, J.D.; Obeysekera JT. ARMA model identification of hydrologic time series. Water Resour Manag 1982;18:1011–1021.
[15]   Ridwan WM, Sapitang M, Aziz A, Kushiar KF, Ahmed AN, El-Shafie A. Rainfall forecasting model using machine learning methods: Case study Terengganu, Malaysia. Ain Shams Eng J 2021;12:1651–63. https://doi.org/10.1016/j.asej.2020.09.011.
[16]   Valipour M. Ability of Box-Jenkins Models to Estimate of Reference Potential Evapotranspiration (A Case Study: Mehrabad Synoptic Station, Tehran, Iran). IOSR J Agric Vet Sci 2012;1:01–11. https://doi.org/10.9790/2380-0150111.
[17]   Papalaskaris T, Panagiotidis T, Pantrakis A. Stochastic Monthly Rainfall Time Series Analysis, Modeling and Forecasting in Kavala City, Greece, North-Eastern Mediterranean Basin. Procedia Eng 2016;162:254–63. https://doi.org/10.1016/j.proeng.2016.11.054.
[18]   Thakkar AK, Desai VR, Patel A, Potdar MB. Post-classification corrections in improving the classification of Land Use/Land Cover of arid region using RS and GIS: The case of Arjuni watershed, Gujarat, India. Egypt J Remote Sens Sp Sci 2017;20:79–89. https://doi.org/10.1016/j.ejrs.2016.11.006.