OPTIMIZING PREDICTIVE ACCURACY WITH GRADIENT BOOSTED TREES IN FINANCIAL FORECASTING

Main Article Content

Sathishkumar Chintala
Madan Mohan Tito Ayyalasomayajula

Abstract

Financial forecasting is indispensable for various financial sectors' strategic decision-making, risk management, and investment planning. Traditional statistical methods, though valuable, frequently struggle to capture the intricate, non-linear patterns embedded within financial data. In response to these limitations, Gradient Boosted Trees (GBTs) have emerged as a formidable ensemble learning technique renowned for enhancing predictive accuracy. This comprehensive review paper delves into applying GBTs in financial forecasting, scrutinizing their methodological underpinnings, distinct advantages, and comparative performance against other machine learning approaches. GBTs, with their real-world applications in financial forecasting, operate by sequentially combining multiple weak learners, typically decision trees, to iteratively refine predictions by focusing on residual errors. This iterative approach enables GBTs to effectively model complex relationships and interactions in financial datasets, outperforming traditional models like ARIMA and linear regression in many scenarios. Moreover, the paper addresses critical implementation challenges associated with GBTs, such as hyperparameter tuning and computational complexity, which are pivotal for achieving optimal performance. The review identifies promising avenues for future research, including integrating GBTs with deep learning techniques and advancements in real-time forecasting capabilities. By elucidating these aspects, this paper aims to provide insights that enhance the application and efficacy of GBTs in financial forecasting contexts.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Article Details

How to Cite
Chintala, S. ., & Ayyalasomayajula, M. M. T. . (2019). OPTIMIZING PREDICTIVE ACCURACY WITH GRADIENT BOOSTED TREES IN FINANCIAL FORECASTING. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 10(3), 1710–1721. https://doi.org/10.61841/turcomat.v10i3.14707
Section
Research Articles

References

. Agapitos, A., Brabazon, A., & O'Neill, M. (2017). Regularised gradient boosting for financial timeseries modelling. Computational Management Science, 14(3), 367-391.https://doi.org/10.1007/s10287-017-0280-y

. Anghel et al., A. (2019). Benchmarking and Optimization of Gradient Boosting Decision Tree Algorithms. Cornell University. https://doi.org/10.48550/arXiv.1809.04559

. Bakar, N. A., & Rosbi, S. (2017). Data clustering using autoregressive integrated moving average (ARIMA) model for Islamic country currency: An econometrics method for Islamic financial engineering. The International Journal of Engineering and Science, 06(06), 22-31. https://doi.org/10.9790/1813-0606022231

. Bhalaji, N., Kumar, K. S., & Selvaraj, C. (2018). Empirical study of feature selection methods over classification algorithms. International Journal of Intelligent Systems Technologies and Applications, 17(1/2), 98. https://doi.org/10.1504/ijista.2018.091590

. Chang, Y., Chang, K., & Wu, G. (2018). Application of extreme gradient boosting trees in the construction of credit risk assessment models for financial institutions. Applied Soft Computing, 73, 914-920. https://doi.org/10.1016/j.asoc.2018.09.029

. Deng, S., Wang, C., Wang, M., & Sun, Z. (2019). A gradient boosting decision tree approach for insider trading identification: An empirical model evaluation of China stock market. Applied Soft Computing, 83, 105652. https://doi.org/10.1016/j.asoc.2019.105652

. Döpke, J., Fritsche, U., & Pierdzioch, C. (2017). Predicting recessions with boosted regression trees. International Journal of Forecasting, 33(4), 745-759. https://doi.org/10.1016/j.ijforecast.2017.02.003

. Grover, P. (2017, December 13). Gradient boosting from scratch. Medium. https://blog.mlreview.com/gradient-boosting-from-scratch-1e317ae4587d

. Guo, T., Bifet, A., & Antulov-Fantulin, N. (2018). Bitcoin volatility forecasting with a glimpse into buy and sell orders. 2018 IEEE International Conference on Data Mining (ICDM). https://doi.org/10.1109/icdm.2018.00123

. H. R Sanabila; Wisnu Jatmiko. (2018). Ensemble Learning on Large Scale Financial Imbalanced Data. 2018 International Workshop on Big Data and Information Security (IWBIS). https://doi.org/10.1109/IWBIS.2018.8471702

. Haohua Wan. (2017). LightGBM: A Highly Efficient Gradient Boosting Decision Tree. courses.grainger.illinois.edu.

. Ilbeigi, M., Castro-Lacouture, D., & Joukar, A. (2017). Generalized autoregressive conditional Heteroscedasticity model to quantify and forecast uncertainty in the price of asphalt cement. Journal of Management in Engineering, 33(5). https://doi.org/10.1061/(asce)me.1943-5479.0000537

. Jiao, Y., & Jakubowicz, J. (2017). Predicting stock movement direction with machine learning: An extensive study on S&P 500 stocks. 2017 IEEE International Conference on Big Data (Big Data). https://doi.org/10.1109/bigdata.2017.8258518

. Kumar, H. P., & Patil, B. S. (2018). Forecasting volatility trend of INR USD currency pair with deep learning LSTM techniques. 2018 3rd International Conference on Computational Systems and Information Technology for Sustainable Solutions (CSITSS). https://doi.org/10.1109/csitss.2018.8768767

. Körner et al., P. (2018). Advantages of GBTs in Financial Forecasting. Technische Universität Dresden.

. Li, P., & Zhang, J. (2018). A new hybrid method for China's energy supply security forecasting based on ARIMA and XGBoost. Energies, 11(7), 1687. https://doi.org/10.3390/en11071687

. Li, Z. (2018). GBDT-SVM credit risk assessment model and empirical analysis of peer-to-peer borrowers under consideration of audit information. Open Journal of Business and Management, 06(02), 362-372. https://doi.org/10.4236/ojbm.2018.62026

. Mishra, A., & Ghorpade, C. (2018). Credit card fraud detection on the skewed data using various classification and ensemble techniques. 2018 IEEE International Students' Conference on Electrical, Electronics and Computer Science (SCEECS). https://doi.org/10.1109/sceecs.2018.8546939

. Oland, Anders, et al. (2017). Be Careful What You Backpropagate: A Case For Linear Output Activations & Gradient Boosting. arXiv preprint arXiv:1707.04199.

. Sakata, R., Ohama, I., & Taniguchi, T. (2018). An extension of gradient boosted decision tree incorporating statistical tests. 2018 IEEE International Conference on Data Mining Workshops (ICDMW). https://doi.org/10.1109/icdmw.2018.00139

. Touzani, S., Granderson, J., & Fernandes, S. (2018). Gradient boosting machine for modeling the energy consumption of commercial buildings. Energy and Buildings, 158, 1533-1543. https://doi.org/10.1016/j.enbuild.2017.11.039

. Wyrobek, J. (2018). Predicting bankruptcy at Polish companies: A comparison of selected machine learning and deep learning algorithms. Zeszyty Naukowe Uniwersytetu Ekonomicznego w Krakowie, (6(978)), 41-60. https://doi.org/10.15678/znuek.2018.0978.0603

. Xia, Y., Liu, C., Li, Y., & Liu, N. (2017). A boosted decision tree approach using Bayesian hyperparameter optimization for credit scoring. Expert Systems with Applications, 78, 225-241. https://doi.org/10.1016/j.eswa.2017.02.017

. Zhang, T., & Meng, S. (2018). Internet financial credit evaluation based on the fusion of GBDT and LR. Proceedings of the 2018 International Conference on Management, Economics, Education and Social Sciences (MEESS 2018). https://doi.org/10.2991/meess-18.2018.17