Feature Engineering for Tropical Rainfall Forecasting Using Random Forest and Support Vector Regression
Abstract
The complex dynamics of weather variability in Indonesia, influenced by multiple climatic drivers, make rainfall forecasting in tropical regions a significant scientific challenge. This study proposes an automated feature engineering pipeline to enhance the performance of Random Forest Regression (RFR) and Support Vector Regression (SVR) models for tropical rainfall prediction. Monthly rainfall data spanning 388 months (1993–2025) from a BMKG station were used as the basis for model development. The pipeline systematically generates temporal, seasonal, statistical, and anomaly-based features to provide domain-informed representations for non-sequential learning algorithms. Model performance was evaluated under four temporal data partitioning scenarios using R², RMSE, and probabilistic confidence intervals derived from bootstrap residual simulations. Results indicate that RFR achieved the highest predictive accuracy (R² = 0.93; RMSE = 31.01 mm) and demonstrated superior temporal–seasonal stability (Rolling CV: R² = 0.81 ± 0.07; RMSE = 55.44 ± 16.18), with comparable performance between wet and dry seasons. Conversely, SVR showed greater sensitivity to seasonal variability, with R² dropping to 0.55 during the wet season, indicating higher uncertainty under extreme rainfall conditions. Robustness and drift analyses further revealed that RFR adapts better to temporal and seasonal shifts, while SVR remains relevant as an adaptive model for extreme risk analysis. Overall, this study contributes to the development of automated feature engineering, reproducible climatological forecasting pipelines, and probabilistic modeling frameworks for rainfall prediction under uncertainty in tropical regions.
Article Metrics
Abstract: 196 Viewers PDF: 39 ViewersKeywords
Feature Engineering; Probabilistic Modeling; Rainfall Forecasting; Random Forest Regression; Support Vector Regression; Tropical Climate
Full Text:
PDF
DOI:
https://doi.org/10.47738/jads.v7i1.1111
Citation Analysis:
Refbacks
- There are currently no refbacks.
Journal of Applied Data Sciences
| ISSN | : | 2723-6471 (Online) |
| Collaborated with | : | Computer Science and Systems Information Technology, King Abdulaziz University, Kingdom of Saudi Arabia. |
| Publisher | : | Bright Publisher |
| Website | : | http://bright-journal.org/JADS |
| : | taqwa@amikompurwokerto.ac.id (principal contact) | |
| support@bright-journal.org (technical issues) |
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0




.png)