(Translated by https://www.hiragana.jp/)
GitHub - AIRI-Institute/fddbenchmark: Benchmarking fault detection and diagnosis methods
Skip to content

AIRI-Institute/fddbenchmark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

73 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FDDBenchmark

Benchmarking fault detection and diagnosis methods.

FDDBenchmark is a package for the Python containing 3 components: dataset, dataloader, evaluator.

fddbench_overview.png

Sensor data represented as the dataset is used for creating the dataloader. The dataloader is used for generating samples of sensor data. The evaluator takes labels from testing data loader and labels predicted by a FDD model and reports values of metrics. The package allows to train and evaluate models efficiently in terms of memory and time costs due to the dynamic generation of samples.

Datasets

FDDBenchmark provides three datasets:

  1. rieth_tep. Dataset of Tennessee Eastman Process based on the dataset Rieth, C. A., Amsel, B. D., Tran, R., & Cook, M. B. (2017). Additional Tennessee Eastman Process Simulation Data for Anomaly Detection Evaluation (Version V1) [Computer software]. Harvard Dataverse. https://doi.org/10.7910/DVN/6C3JR1
  2. reinartz_tep. Dataset of Tennessee Eastman Process based on the paper Reinartz, C., Kulahci, M., & Ravn, O. (2021). An extended Tennessee Eastman simulation dataset for fault-detection and decision support systems. Computers & Chemical Engineering, 149, 107281. The dataset can be found in https://web.mit.edu/braatzgroup/links.html.
  3. small_tep. Cropped version of rieth_tep.
  4. lessmeier_bearing. Dataset of bearing vibration, based on the paper Lessmeier, Christian, et al. "Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: A benchmark data set for data-driven classification." PHM Society European Conference. Vol. 3. No. 1. 2016. The dataset can be found in http://groups.uni-paderborn.de/kat/BearingDataCenter/.
Dataset N of sensors N of states (faults) N of runs Length of a run Total size
rieth_tep 52 21 (20) 21000 500, 960 1.84 GB
reinartz_tep 52 29 (28) 2800 2000 1.88 GB
small_tep 52 21 (20) 210 500, 960 18.9 MB
lessmeier_bearing 1 3 (2) 220 256 000 755 MB

Metrics

Evaluation metrics can be divided into three groups: detection metrics, diagnosis metrics, clustering metrics (used in unsupervised setting).

Detection metrics

  • Detection TPR and Detection FPR. TPR and FPR in the binary classification task where all faulty samples are positive and all normal samples are negative examples.
  • ADD, Average Detection Delay. The average number of samples between the first faulty sample and the first detected faulty sample. The averaging is performed among all faulty runs, excluding runs with undetected faults (false negative).

Diagnosis metrics

  • TPR_i, True Positive Rate, aka Detection Rate. The number of detected faulty samples of the type i divided by the number of faulty samples of the type i.
  • FPR_i, False Positive Rate, aka False Alarm Rate. The number of false alarms of the type i divided by the number of normal samples.
  • CDR, Correct Diagnosis Rate. The total number of correctly diagnosed faulty samples divided by the number of detected faulty samples.

Clustering metrics

  • ACC, Unsupervised Clustering Accuracy. Maximum accuracy over all possible matching between cluster indices and ground truth labels.
  • NMI, Normalized Mutual Information. Mutual information between ground truth labels and cluster indices normalized by the average of their entropies.
  • ARI, Adjusted Rand Index. The Rand Index considers all pairs of samples and accounts the ratio of pairs with correct cluster index respectively to ground truth labels.

Installing

FDDBenchmark can be installed using pip as follows:

pip install git+https://github.com/AIRI-Institute/fddbenchmark

Examples

Examples of using FDDBenchmark can be found in the tutorial.