Skip to content

Commit

Permalink
#4 Update: paper proofread and updated
Browse files Browse the repository at this point in the history
  • Loading branch information
PetoLau committed Mar 7, 2018
1 parent f7899d5 commit 984e902
Show file tree
Hide file tree
Showing 4 changed files with 33 additions and 12 deletions.
Binary file added paper/modelBased.pdf
Binary file not shown.
21 changes: 20 additions & 1 deletion paper/paper.bib
Original file line number Diff line number Diff line change
Expand Up @@ -20,13 +20,21 @@ @Manual{jmotif
note = {R package version 1.0.2.900},
url = {https://CRAN.R-project.org/package=jmotif},
}
@inproceedings{Lau,
@inproceedings{Lau1,
title={Comparison of Representations of Time Series for Clustering Smart Meter Data},
author={Laurinec, Peter and Luck{\'a}, M{\'a}ria},
booktitle={Lecture Notes in Engineering and Computer Science: Proceedings of The World Congress on Engineering and Computer Science 2016},
pages={458--463},
year={2016}
}
@inproceedings{Lau2,
title={Adaptive Time Series Forecasting of Energy Consumption Using Optimized Cluster Analysis},
author={Laurinec, Peter and L{\'o}derer, Marek and Vrablecov{\'a}, Petra and Luck{\'a}, M{\'a}ria and Rozinajov{\'a}, Viera and Ezzeddine, Anna Bou},
booktitle={Data Mining Workshops (ICDMW), 2016 IEEE 16th International Conference on},
pages={398--405},
year={2016},
organization={IEEE}
}
@article{rcpp,
author = {Dirk Eddelbuettel and Romain Francois},
title = {Rcpp: Seamless R and C++ Integration},
Expand All @@ -39,3 +47,14 @@ @article{rcpp
doi = {10.18637/jss.v040.i08},
url = {https://www.jstatsoft.org/v040/i08}
}
@article{Esling,
author = {Esling, Philippe and Agon, Carlos},
journal = {ACM Computing Surveys},
keywords = {Distance measures,data indexing,data mining,query by content,sequence matching,similarity measures,stream analysis,temporal analysis,time series},
number = {1},
pages = {1--34},
publisher = {ACM},
title = {{Time-series data mining}},
volume = {45},
year = {2012}
}
24 changes: 13 additions & 11 deletions paper/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,27 +11,29 @@ affiliations:
- name: Faculty of Informatics and Information Technologies,
Slovak University of Technology in Bratislava
index: 1
date: 27 January 2018
date: 7 March 2018
bibliography: paper.bib
output: pdf_document
---

# Summary

TSrepr [@tsrepr] is R package for time series representations computing. Time series representations are, in other words, methods for dimensionality reduction, feature extraction or preprocessing of time series. They are used for:
`TSrepr` [@tsrepr] is an R package for time series representations computing. Time series representations are, in other words, methods for dimensionality reduction, feature extraction or for the preprocessing of time series. They are used for [@Esling]:

* significant reduction of the time series dimensionality
* emphasis on fundamental (essential) shape characteristics
* implicit noise handling
* reducing the dimension will reduce memory requirements and computational complexity of consequent machine learning methods (classification or clustering).
* Significant reduction of the time series dimensionality,
* Emphasis of fundamental (essential) shape characteristics,
* Implicit noise handling,
* Dimension reduction will reduce the memory requirements and computational complexity of consequent machine learning methods (e.g., classification or clustering).

The `TSrepr` package contains various methods and types of time series representations including the Piecewise Aggregate Approximation (PAA), the Discrete Fourier Transform (DFT), the Perceptually Important Points (PIP), the Symbolic Aggregate approXimation (SAX), the Piecewise Linear Approximation (PLA) and Clipping. Except for these well-known methods, additional methods suitable for seasonal time series are implemented. These methods are based on the model, for example multiple linear regression, robust regression, generalised additive model or triple exponential smoothing [@Lau1, @Lau2]. Own developed feature extraction methods from the Clipping representation are also implemented - FeaClip and FeaTrend.
In Figure 1, the comparison of all eight available model-based representations in the `TSrepr` on electricity consumption time series from the randomly picked residential consumer is shown.

The TSrepr package contains various methods and types of time series representations including Piecewise Aggregate Approximation (PAA), Discrete Fourier Transform (DFT), Perceptually Important Points (PIP), Symbolic Aggregate approXimation (SAX), Piecewise Linear Approximation (PLA) or Clipping. Except for these well-known methods, additional methods suitable for seasonal time series are implemented. These methods are based on the model as for example multiple linear regression, robust regression, generalized additive model or triple exponential smoothing [@Lau]. Own feature extraction methods from Clipping representation are also implemented - FeaClip and FeaTrend.

Additional useful functions and methods related to time series representations were also implemented. Package includes functions for normalisations and denormalisations of time series - z-score and min-max methods. It supports simple computation of windowing method, a matrix of representations and forecasting accuracy measures (MAE, RMSE, MAPE, sMAPE etc.).
![](modelBased.pdf)
Figure 1. The comparison of model-based time series representations on electricity consumption time series. The length of representations is $48$, the same as frequency of the daily season of the used time series.

Additional useful functions and methods related to time series representations were also implemented. The `TSrepr` package includes functions for normalisations and denormalisations of time series - z-score and min-max methods. It supports the simple computation of the windowing method, a matrix of representations and forecasting accuracy measures (MAE, RMSE, MAPE, sMAPE etc.).
Methods (functions) were implemented in base R and also in C++ for fast computations. In R, C++ programmes can be written thanks to the package Rcpp [@rcpp].

The general package for time series representations computations missed until now.
The CRAN's time series task view proves previous statement. Packages TSMining [@tsmining] and jmotif [@jmotif] both includes implementations of PAA and SAX time series representations methods. However, these packages are mainly focused on motif discovery in time series.
The general package for time series representations computations had missed so far. The CRAN's time series task view proves the previous statement. Packages `TSMining` [@tsmining] and `jmotif` [@jmotif] both includes implementations of PAA and SAX time series representations methods. However, these packages are mainly focused on motif discovery in time series.

# References
Binary file modified paper/paper.pdf
Binary file not shown.

0 comments on commit 984e902

Please sign in to comment.