Paper 2

TSPredIT: Integrated Tuning of Data Preprocessing and Time Series Prediction Models

Authors: Rebecca Salles, Esther Pacitti, Eduardo Bezerra, Celso Marques, Carla Pacheco, Carla Oliveira et al.

Volume 54 (2023)

Abstract

Prediction is one of the most important activities while working with time series. There are many alternative ways to model the time series. Finding the right one is challenging to model them. Most data-centric models (either statistical or machine learning) have hyperparameters to tune. Setting them right is mandatory for good predictions. It is even more complex since time series prediction also demands choosing a data preprocessing that complies with the chosen model. Many time series frameworks, such as Scikit Learning, have features to build models and tune their hyperparameters. However, only some works address tuning data preprocessing hyperparameters and model building. TSPredIT addresses this issue in this scope by providing a framework that seamlessly integrates data preprocessing activities with models’ hyperparameters. TSPredIT is made available as an R-package, which provides functions for defining and conducting time series prediction, including data pre(post)processing, decomposition, hyperparameter optimization, modeling, prediction, and accuracy assessment. Besides, TSPredIT is also extensible, which significantly expands the framework’s applicability, especially with other languages such as Python.

Keywords

time series, prediction, data preprocessing, machine learning, hyperparameter optimization