Evaluating Regression and Ensemble Models for Financial Forecasting: The Case of Apple Stock

Authors

  • Runhai He Belarusian State University
  • Zhenxing Zhang Belarusian State University
  • Shoujing Zhang Belarusian State University
  • Qingqing Song Belarusian State University

DOI:

https://doi.org/10.5281/zenodo.13821480

ARK:

https://n2t.net/ark:/40704/JIEAS.v2n5a01

References:

18

Keywords:

Stock Prediction, Machine Learning, SVM, Random Forest, Ensemble Model

Abstract

This paper investigates the performance of a variety of machine learning models for Apple stock price prediction, covering linear regression, ridge regression, Lasso regression, SVM, weighted average, stacked model, and random forest methods. The dataset contains daily closing prices of Apple stock for the period from 2010 to 2024, using data from the first 13 years for model training and data from the last year for testing. The study results show that after many times of parameters tuning and testing, all models except Random Forest exhibit good prediction results, with simple models such as linear regression and ridge regression performing particularly well with fewer features, while the Random Forest model exhibits severe overfitting/underfitting problems. This study provides an empirical reference for the application of machine learning in financial time series forecasting, which can help to improve financial forecasting ability and investment decision-making in the future.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Author Biographies

Runhai He, Belarusian State University

Research fields: Data analysis and Software Engineering.

Zhenxing Zhang, Belarusian State University

Postgraduate student at Belarusian State University, Minsk, Belarus.

Shoujing Zhang, Belarusian State University

Postgraduate student at Belarusian State University, Minsk, Belarus.

Qingqing Song, Belarusian State University

Faculty of Applied Mathematics and Computer Science; Belarusian State University; 4 Nezavisimosti Avenue, Minsk 220030, Belarus; e-mails: fpm.sunC@bsu.by

References

Fama, E. F. (1970). Efficient capital markets: A review of theory and empirical work. Journal of Finance, 25(2), 383-417.

Patel, J., Shah, S., Thakkar, P., & Kotecha, K. (2015). Predicting stock market index using fusion of machine learning techniques. Expert Systems with Applications, 42(4), 2162-2172.

Zhang, G. P. (2003). Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing, 50, 159-175.

Dietterich, T. G. (2000). Ensemble methods in machine learning. In International Workshop on Multiple Classifier Systems (pp. 1-15). Springer.

Xinjie, (2014). Stock Trend Prediction With Technical Indicators using SVM. Stanford University.

Khaidem, L., Saha, S., & Dey, S. R. (2016). Predicting the direction of stock market prices using random forest. arXiv preprint arXiv:1605.00003.

Madge, S., & Bhatt, S. (2015). Predicting stock price direction using support vector machines. Independent Work Report Spring, 45.

Draper, N. R., & Smith, H. (1998). Applied Regression Analysis. John Wiley & Sons.

Hoerl, A. E., & Kennard, R. W. (1970). Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics, 12(1), 55–67.

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer.

Zou, H., & Hastie, T. (2005). Regularization and Variable Selection via the Elastic Net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2), 301–320.

Friedman, J., Hastie, T., & Tibshirani, R. (2010). Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software, 33(1), 1-22.

Vapnik, V. N. (1995). The Nature of Statistical Learning Theory. Springer.

Smola, A. J., & Schölkopf, B. (2004). A Tutorial on Support Vector Regression. Statistics and Computing, 14(3), 199–222.

Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5–32.

Song, Q., & Xia, S. (2024). Research on the Effectiveness of Different Outlier Detection Methods in Common Data Distribution Types. Journal of Computer Technology and Applied Mathematics, 1(1), 13-25.

Neloy, M. A. I., Nahar, N., Hossain, M. S., et al. (2022). A weighted average ensemble technique to predict heart disease. In Proceedings of the Third International Conference on Trends in Computational and Cognitive Engineering: TCCE 2021 (pp. 17-29). Springer Nature Singapore.

Wolpert, D. H. (1992). Stacked Generalization. Neural Networks, 5(2), 241–259.

Downloads

Published

2024-10-01

How to Cite

[1]
R. He, Z. Zhang, S. Zhang, and Q. Song, “Evaluating Regression and Ensemble Models for Financial Forecasting: The Case of Apple Stock”, Journal of Industrial Engineering & Applied Science, vol. 2, no. 5, pp. 1–9, Oct. 2024.

Issue

Section

Articles

ARK