Laikrindu modelēšana un prognozēšana ar statistiskiem un dziļās mašīnmācīšanās modeļiem

Zavjalova, Ksenija

View/Open

302-87816-Zavjalova_Ksenija_kz11004.pdf (1.740Mb)

Author

Zavjalova, Ksenija

Co-author

Latvijas Universitāte. Datorikas fakultāte

Advisor

Bārzdiņš, Guntis

Date

2022

Metadata

Show full item record

Abstract

Šajā maģistra darbā tiek aplūkota datu zinātnes problemātika – laikrindu prognozēšana ar statistiskiem un jaunākiem dziļās mašīnmācīšanās modeļiem. Mērķis ir salīdzināt trīs prognozēšanas modeļu veiktspēju un precizitāti, izmantojot veiktspējas mērījumus. Darbs teorētiskajā daļā 3., 4., un 5. nodaļās ir aprakstīta vispārējo laikrindu datu analīze, datu sagatavošana un galvenie statistisko modeļu ARIMA komponenti. Darba 6. nodaļa iepazīstina ar galvenajiem dziļās mašīnmācīšanās komponentiem un jaunākajām arhitektūrām – DeepAR un Temporal Future Transformer jeb TFT. Darbā praktiskajā daļā 7. un 8. nodaļās tiek izvērtētas statistisko un dziļās mašīnmācīšanās modeļu priekšrocības un trūkumi. Modeļu veiktspējas novērtēšanai tiek izmantotas trīs dažāda apjoma datu kopas. Darba rezultātā tiek parādīts, ka statistiskais modelis ARIMA pārspēj dziļās mašīnmācīšanās modeļus, ja tiek izmantotas mazākas datu kopas. Dziļās mašīnmācīšanās modelis TFT spēj parādīt salīdzinoši labus rezultātus. Ar TFT modeli iespējams efektīvi izmantot eksogēnos datus, lai uzlabotu prognozēšanas precizitāti.

In this thesis a data science topic is discussed – time series forecasting with statistical and novel deep learning models. Thesis presents the general analysis of time series data, data preparation and the main components of ARIMA statistical model. The second part of the thesis discusses the main components of deep learning and novel architectures - DeepAR and Temporal Future Transformer, or TFT. The main goal of this work is to compare the performance and the prediction accuracy of three models using performance measurements. The paper evaluates the advantages and disadvantages of statistical and deep learning models. Within the scope of this work three different data sets were used to evaluate the performance of the models. As a result, it has been shown that the statistical model ARIMA outperforms deep learning models when using smaller data sets, however, deep learning model TFT is able to show comparably good results. TFT model is able to effectively use exogenous data in order to improve forecasting accuracy.

URI

https://dspace.lu.lv/dspace/handle/7/59998

Collections

Bakalaura un maģistra darbi (EZTF) / Bachelor's and Master's theses [5688]