The finnts package, commonly referred to as “Finn”, is a standardized times series forecast framework developed by Microsoft Finance. It’s a result of years of effort trying to perfect a centralized forecasting practice that everyone in finance could leverage. Even though it was built for finance like forecasts, it can easily be extended to any type of time series forecast.
Finn takes years of hard work and thousands of lines of code, and simplifies the forecasting process down to one line of code. A single function, “forecast_time_series”, takes in historical data and applies dozens of models to produce a state of the art forecast. While simplifying the forecasting process down to a single function call might seem limiting, Finn actually allows for a lot of flexibility under the hood. In order to leverage the best components of Finn, please check out all of the other vignettes within the package.
library(finnts)
browseVignettes("finnts")
Getting started with Finn is as simple as 1..2..3
Data used in Finn needs to follow a few requirements, called out below.
A good example to use when producing your first Finn forecast is to leverage existing data examples from the timetk package. Let’s take a monthly example and trim it down to speed up the run time of your first Finn forecast.
library(finnts)
<- timetk::m4_monthly %>%
hist_data ::filter(date >= "2010-01-01") %>%
dplyr::rename(Date = date) %>%
dplyr::mutate(id = as.character(id))
dplyr
print(hist_data)
#> # A tibble: 264 x 3
#> id Date value
#> <chr> <date> <dbl>
#> 1 M1 2010-01-01 9480
#> 2 M1 2010-02-01 8800
#> 3 M1 2010-03-01 8810
#> 4 M1 2010-04-01 7690
#> 5 M1 2010-05-01 7710
#> 6 M1 2010-06-01 9660
#> 7 M1 2010-07-01 11870
#> 8 M1 2010-08-01 12920
#> 9 M1 2010-09-01 10890
#> 10 M1 2010-10-01 9500
#> # ... with 254 more rows
print(unique(hist_data$id))
#> [1] "M1" "M2" "M750" "M1000"
The above data set contains 4 individual time series, identified using the “id” column.
Calling for “forecast_time_series” function is the easiest part. In this example we will be running just two models.
<- forecast_time_series(
finn_output input_data = hist_data,
combo_variables = c("id"),
target_variable = "value",
date_type = "month",
forecast_horizon = 3,
back_test_scenarios = 6,
models_to_run = c("arima", "ets"),
run_global_models = FALSE,
run_model_parallel = FALSE
)
$final_fcst %>%
finn_outputhead() %>%
print()
$final_fcst %>%
finn_outputtail() %>%
print()
print(finn_output$back_test_data)
print(finn_output$back_test_best_MAPE)
Note: the best model for the “M1” combination is a simple average of “arima” and “ets” models.