GitHub - JingweiZuo/TSStreamMining: Time Series Stream Mining

Incremental and Adaptive Feature Exploration over Time Series Stream

DAVID Lab
Unversity of Versaille Saint-Quentin (UVSQ)
Université Paris-Saclay

A demonstration video can be found here -> Demo Video [1]

You can find the paper here -> IEEE Big Data 2019

By adopting the concepts Shapelet [2] and Matrix Profile [3], we conduct the first attempt to extract the incremental and adaptive features from Time Series Stream:

For data source with stable concept, learning model will be updated incrementally;
For data source with Concept Drift, we extract the adaptive Shapelets under the most recent concept;
An elastic caching mechanism allows to tackle the infinite TS instances in streaming context.

Configurations (demo)

Input File: the name should be end with "Train.csv"
dataset_folder: in each file, change the location of the datasets in the background. The selected input file will be saved/uploaded into this folder.
Data Augmentation: refer to preprocessing/TS_stream_preprocess.py. As Shapelet-based methods (e.g., SMAP) are noise resistant, we put randomly the noise of random durations into the original TS data to augment the data volume.

Web application (demo)

ISETS_webapp.py: main program, a web application based on Flask and Bokeh
ISETS_webbackend.py: the program for adaptive shapelet extraction and Concept Drift detection
draw_adaptive_shapelets.py: show the adaptive shapelets in the web interface
draw_TS_Stream.py: show in real time the input TS instances in the stream

Core algorithms

utils/: the repository which contains the basic file operations and similairty measure functions
memory_block.py: the caching mechanism including the computation of Matrix Profile for cached instance
SMAP_block.py: Shapelet extraction on MAtrix Profile
evaluation_block.py: the loss computation and the Concept Drift detection on TS Stream
adaptive_features/adaptive_features.py: Concept Drift detection and adaptive feature extraction
ISMAP/ISAMP.py: incremental Shapelet extraction on MAtrix Profile

Main contributions:

A novel strategy to evaluate Shapelet, which shows the first attempt of transferring the techniques in Time Series community to Data Stream community

Figure 1. Shapelet Evaluation by a loss-smoothed approach.
Test-then-Train Stategy: The novel strategy, not only accelerates the incremental Shapelet extraction in stable-concept context, but also helps with detecting Concept Drift in streaming context.
Elastic Caching Mechanism in Streaming context
Scalability & Explainability

Based on SMAP (Shapelet extraction on MAtrix Profile) [4], our system is capable of being distributed in Spark environment. In addition, apart from the interpretability provided by Shapelet features, the system shows a strong explainability for Shapelet Extraction and Shapelet updating process.
Traceability of extracted features

Citation

If you find this repository useful in your research, please consider citing the following paper:

@inproceedings{zuo2019incremental,
  title={Incremental and Adaptive Feature Exploration over Time Series Stream},
  author={Zuo, Jingwei and Zeitouni, Karine and Taher, Yehia},
  booktitle={2019 IEEE International Conference on Big Data (Big Data)},
  pages={593--602},
  year={2019},
  organization={IEEE}
}

[1] J. Zuo, K. Zeitouni, and Y. Taher, “ISETS: Incremental Shapelet Extraction from Time Series Stream“，(demo paper) ECML-PKDD’19

[2] L. Ye and E. Keogh, “Time series shapelets: A New Primitive for Data Mining,” in Proc. KDD 2009

[3] C. Yeh, Y. Zhu, L. Ulanova, and N. Begum, “Matrix Profile I: All Pairs Similarity Joins for Time Series: A Unifying View that Includes Motifs, Discords and Shapelets,” IEEE ICDM 2016

[4] J. Zuo, K. Zeitouni, and Y. Taher, “Exploring interpretable features for large time series with se4tec,” in Proc. EDBT 2019

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.idea		.idea
.vscode		.vscode
SourceCode		SourceCode
Spark_Cluster_Docker		Spark_Cluster_Docker
TestDataset		TestDataset
figures		figures
venv/ISETS/__pycache__		venv/ISETS/__pycache__
DemoPKDD19.mp4		DemoPKDD19.mp4
README.md		README.md
pycharm-debug-py3k.egg		pycharm-debug-py3k.egg
skeleton.py.lprof		skeleton.py.lprof

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Incremental and Adaptive Feature Exploration over Time Series Stream

A demonstration video can be found here -> Demo Video [1]

You can find the paper here -> IEEE Big Data 2019

Configurations (demo)

Web application (demo)

Core algorithms

Main contributions:

Citation

About

Releases

Packages

Languages

JingweiZuo/TSStreamMining

Folders and files

Latest commit

History

Repository files navigation

Incremental and Adaptive Feature Exploration over Time Series Stream

A demonstration video can be found here -> Demo Video [1]

You can find the paper here -> IEEE Big Data 2019

Configurations (demo)

Web application (demo)

Core algorithms

Main contributions:

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages