Datasets#
The datasets module contains:
dataset objects, which are in-memory representations of time series datasets. The programmatic way to represent and access datasets in
sktime.loaders which fetch datasets from data repositories on the internet, and retrieve them as in-memory datasets in
sktimecompatible formatsloaders which fetch an individual dataset, usually for illustration purposes
toy data generators for didactic and illustrative purposes
utilities to write to, and load from, time series specific file formats
Forecasting datasets#
Dataset repositories#
Interfaces to dataset repositories, instances of the classes
represent different datasets. Downloaded from the sktime huggingface space,
cached on first use.
|
Monash Forecasting Repository dataset loader. |
Individual datasets - onboard#
Individual datasets distributed with sktime, available without internet access.
Load the airline univariate time series dataset [1]. |
|
|
Monash Forecasting Repository dataset loader. |
Return hierarchical sales toy data to demonstrate hierarchical forecasting. |
|
|
Load the Longley dataset for forecasting with exogenous variables. |
Load the Lynx dataset for univariate time series forecasting. |
|
|
Fetch M5 dataset from https://zenodo.org/records/12636070 . |
Load the US Macroeconomic dataset for multivariate time series forecasting. |
|
|
Load the Pharmaceutical Benefit Scheme (PBS) univariate time series dataset [Rb1f075efaefb-1]. |
Load the Shampoo Sales dataset for univariate time series forecasting. |
|
|
Load USChange dataset for forecasting growth rates of consumption and income. |
Individual datasets - downloaded#
Individual datasets downloadable from the sktime huggingface space,
cached on first use.
|
Fetch M5 dataset from https://zenodo.org/records/12636070 . |
|
Load the GB National Solar Estimates dataset for time series forecasting. |
Classification datasets#
Dataset repositories#
Interfaces to dataset repositories, instances of the classes
represent different datasets. Downloaded from the sktime huggingface space,
cached on first use.
|
UCR-UEA time series classification dataset loader. |
Individual datasets - onboard#
Individual datasets distributed with sktime, available without internet access.
|
ACSF1 time series classification problem. |
|
ArrowHead time series classification problem. |
|
BasicMotions time series classification problem. |
|
GunPoint time series classification problem. |
ItalyPowerDemand time series classification problem. |
|
|
JapaneseVowels time series classification problem. |
|
OSULeaf time series classification problem. |
|
PLAID time series classification problem. |
Regression datasets#
Load the Tecator time series regression problem. |
Dataset loader functions#
Loaders are raw functions which return datasets in sktime compatible formats.
For programmatic access to datasets, the dataset objects above should be preferred.
Loaders from dataset repositories#
These loaders access dataset repositories on the internet and fetch one or multiple datasets from there, individual datasets specifiable as strings.
These loaders can be used to access reference datasets for benchmarking.
Functions to load and write datasets.
|
Fetch forecasting datasets from Monash Time Series Forecasting Archive. |
|
Load a dataset from the fpp3 package. |
|
Fetch M5 dataset from https://zenodo.org/records/12636070 . |
|
Load dataset from UCR UEA time series archive. |
Individual datasets#
These loaders fetch a commonly used individual dataset, usually for illustration purposes.
Single time series#
Functions to load and write datasets.
Load the airline univariate time series dataset [1]. |
|
|
Load the Longley dataset for forecasting with exogenous variables. |
Load the lynx univariate time series dataset for forecasting. |
|
Load the US Macroeconomic Data [R7ccb796becb7-1]. |
|
Load the shampoo sales univariate time series dataset for forecasting. |
|
|
Get national solar estimates for GB from Sheffield Solar PV_Live API. |
|
Load MTS dataset for forecasting Growth rates of personal consumption and income. |
Panels of time series#
Functions to load and write datasets.
|
Load dataset on power consumption of typical appliances. |
|
Load the ArrowHead time series classification problem and returns X and y. |
|
Load the BasicMotions time series classification problem and returns X and y. |
|
Load the GunPoint time series classification problem and returns X and y. |
|
Load ItalyPowerDemand time series classification problem. |
|
Load the JapaneseVowels time series classification problem. |
Load the US Macroeconomic Data [R7ccb796becb7-1]. |
|
|
Load the OSULeaf time series classification problem and returns X and y. |
|
Load the Tecator time series regression problem and returns X and y. |
Toy data generators#
Hierarchical time series data#
Functions to load and write datasets.
Return hierarchical sales toy data to demonstrate hierarchical forecasting. |
File format loaders and writers#
These utilities load and write from time series specific data file formats.
Note: for loading/writing from formats not specific to time series,
use common utilities such as pandas.read_csv
Functions to load and write datasets.
|
Load data from a .arff file into a Pandas DataFrame. |
|
Load time series .ts file into X and (optionally) y. |
|
Load data from a .ts file into a Pandas DataFrame. |
|
Load data from a .tsv file into a Pandas DataFrame. |
|
Load data from a long format file into a Pandas DataFrame. |
|
Convert the contents in a .tsf file into a dataframe. |
|
Write an sktime multi-instance dataset to text file in .ts format. |
|
Output a dataset in dataframe format to .ts file. |
|
Output a dataset in ndarray format to .ts file. |
|
Transform dataset using a tabular transformer and write the result to arff file. |
|
Write the predictions for an experiment in the standard format used by sktime. |