(Translated by https://www.hiragana.jp/)
Deep Autoencoders for Unsupervised Anomaly Detection in Wildfire Prediction

Deep Autoencoders for Unsupervised Anomaly Detection in Wildfire Prediction

Abstract

Wildfires pose a significantly increasing hazard to global ecosystems due to the climate crisis. Due to its complex nature, there is an urgent need for innovative approaches to wildfire prediction, such as machine learning. This research took a unique approach, differentiating from classical supervised learning, and addressed the gap in unsupervised wildfire prediction using autoencoders and clustering techniques for anomaly detection. Historical weather and normalised difference vegetation index datasets of Australia for 2005 – 2021 were utilised. Two main unsupervised approaches were analysed. The first used a deep autoencoder to obtain latent features, which were then fed into clustering models, isolation forest, local outlier factor and one-class SVM for anomaly detection. The second approach used a deep autoencoder to reconstruct the input data and use reconstruction errors to identify anomalies. Long Short-Term Memory (LSTM) autoencoders and fully connected (FC) autoencoders were employed in this part, both in an unsupervised way learning only from nominal data. The FC autoencoder outperformed its counterparts, achieving an accuracy of 0.71, an F1-score of 0.74, and an MCC of 0.42. These findings highlight the practicality of this method, as it effectively predicts wildfires in the absence of ground truth, utilising an unsupervised learning technique.

\draftfalse\journalname

Earth and Space Science

School of Aerospace, Transport and Manufacturing, Cranfield University, UK Spire Global, Inc., Boulder, CO, United States

\correspondingauthor

Miguel Arana-Cataniamiguel.aranacatania@cranfield.ac.uk

{keypoints}

This research addresses a gap in the literature by exploring the advantages of unsupervised learning in wildfire prediction

This study focuses on Australia using a dataset comprising historical weather and normalized difference vegetation index

Our findings demonstrate the effectiveness of unsupervised learning using deep autoencoders for anomaly detection with an F1-score of 0.74

1 Introduction

Forests are essential to maintaining the planet’s ecological equilibrium, but they face threats from fires caused by both natural phenomena and human activities [Qiang \BOthers. (\APACyear2011)]. These wildfires are catastrophic events that negatively impact the environment, the economy, and the natural resources [Meddour-Sahar (\APACyear2015), Sayad \BOthers. (\APACyear2019)]. On the human front, they can lead to loss of life, respiratory ailments, and other health challenges. At the same time, economically, they can devastate communities, affecting livelihoods and leading to billions in damages. The impact on wildlife is also severe, threatening a diverse range of species with effects such as respiratory distress, altered behaviour, and even the threat of extinction. Simultaneously, the natural environment suffers degradation through soil erosion, loss of biodiversity, and the destruction of essential ecosystems such as forests [Sanderfoot \BOthers. (\APACyear2022)].

In recent years, the frequency of wildfires has surged, becoming a global concern that has drawn attention from various fields of study [Abid (\APACyear2021)]. A very recent example would be the wildfire episodes encompassing 25 million acres in Canada in 2023 [Canada (\APACyear\bibnodate)]. The growing threat of wildfire disasters is primarily attributed to climate change. These changes, characterised by higher temperatures and extended dry periods, have led to unprecedented bushfire activities, particularly within Australia [Vardoulakis \BOthers. (\APACyear2020)].

In 2019, Australia, the focus area of this research, encountered its hottest year on record, with the annual national mean temperature surpassing the average by 1.52°C. It also experienced its driest year on record, with notable heatwaves occurring in January and December [Yu \BOthers. (\APACyear2020)]. Between 2019 and 2020, Australia witnessed one of the largest wildfires in modern record. A minimum area of 46 million acres of land was burnt [Sulova \BBA Jokar Arsanjani (\APACyear2020)]. The 2019-20 wildfire is noted for having the strongest measures on record in terms of both magnitude and intensity [Zhang \BOthers. (\APACyear2021)]. Regarding their impacts on climate change and the environment, these fires directly claimed 33 human lives and over one billion indigenous animals [Norman \BOthers. (\APACyear2021)]. Furthermore, 417 individuals succumbed to smoke inhalation, and it is believed that 80% of Australia’s population was directly or indirectly affected by the fires [Borchers Arriagada \BOthers. (\APACyear2020), Norman \BOthers. (\APACyear2021), Ogie \BOthers. (\APACyear2022)]. Moreover, projections indicate that such wildfires will continue to occur due to multiple factors. Based on scientific estimates by the United Nations, extreme wildfires could increase by as much as 50% by the century’s end, spurred by the climate crisis and poor land management [Qiang \BOthers. (\APACyear2011)].

Simulating wildfires is challenging due to the complex interactions of weather conditions, fuel setups, and unpredictable fire movements. The wide geographical and time scales involved make this difficulty even greater, making it a problem that requires significant computational resources [Sayad \BOthers. (\APACyear2019)]. The difficulty in predicting wildfires emphasises the urgent need for innovative approaches, such as machine learning (ML) techniques. Indeed, numerous studies have been conducted to predict or detect wildfires in various scenarios using supervised learning techniques, such as support vector machines (SVM), random forests (RF), artificial neural networks (ANN), decision trees (DT), and more [Abid (\APACyear2021), Pang \BOthers. (\APACyear2022), Pérez-Porras \BOthers. (\APACyear2021), Shmuel \BBA Heifetz (\APACyear2022), Sulova \BBA Jokar Arsanjani (\APACyear2020)]. However, unsupervised learning in wildfire detection has not received much attention from researchers in the prediction phase, and only has been utilized in exploratory data analysis [Sayad \BOthers. (\APACyear2019)]. This research aims to bridge this gap by leveraging the advantages of unsupervised learning, such as accessibility to more data without the need for specific wildfire datasets, and the capability for broader and more flexible analysis without requiring labeled data. Advancements in unsupervised learning, exemplified by generative adversarial networks (GANs), a type of unsupervised autoencoder, have been shown to have potential for rapid, real-time applications in contexts such as wildfire detection [Zenati \BOthers. (\APACyear2018)]. This underscores the broader applicability and benefits of unsupervised learning techniques in this field. The benefits of using unsupervised learning methods are also shown in multiple and varied domains, requiring less computational power, less time consumption and use of less amount of data while yielding relatively good quality results in near-real time [Nhangumbe \BOthers. (\APACyear2023)], reducing the data processing effort by up to 97% while being reliable enough for the task requirements [Lehr \BOthers. (\APACyear2021)], showing strong scalability and interpretability [Chang \BOthers. (\APACyear2020)], achieving the task requirements while reducing the training cost [Tuo \BOthers. (\APACyear2023)], and sometimes producing results on par with supervised techniques [Haar \BOthers. (\APACyear2019), Pradhan \BOthers. (\APACyear2021)].

The study will specifically narrow its focus to Australia. This area is particularly prone to wildfires and has a significant history in this domain, making it a valuable subject for examination. The research aims to predict future wildfires based on anomaly detection before they happen by employing unsupervised learning techniques, using deep autoencoders and clustering. The dataset employed for this project is the combination of two separate datasets: historical weather and Normalised Difference Vegetation Index (NDVI) [IBM (\APACyear2020)]. It employs unsupervised learning techniques, specifically deep autoencoders and clustering, to predict wildfires through anomaly detection, utilizing a unique dataset comprising historical weather and normalized difference vegetation index data. The techniques employed are some of the most common unsupervised methods for anomaly detection [Li \BOthers. (\APACyear2021), Z. Cheng \BOthers. (\APACyear2021)]. This research addresses a gap in the literature by exploring the potential advantages of unsupervised learning in wildfire prediction, such as its reliance on readily available data and reduced computational demands [Nhangumbe \BOthers. (\APACyear2023), Lehr \BOthers. (\APACyear2021), Tuo \BOthers. (\APACyear2023)]. In addition to primary contributions, this study also addresses challenges in wildfire prediction and recommends ways to enhance predictive models in this domain.

The remaining sections of this paper are organised as follows: Section 2 offers a literature review; Section 3 provides details about the study area, dataset, and methodology; Section 4 presents results and findings; Section 5 discusses the results; and Section 6 provides conclusions.

2 Related Work

In this section we review relevant research to our project, beginning with a general study on wildfire prediction using machine learning, followed by specific reviews of the techniques employed in our work: anomaly detection, autoencoders, and the combination of both.

2.1 Wildfire Prediction Using ML Models

Various ML techniques and features have been applied and researched regarding forest fire prediction and detection. The most common and popular ML models used are the artificial neural networks (ANN), support vector machines (SVM), random forests (RF), XGBoost, and decision trees (DT), as highlighted by \citeAjain2020review in their review of 300 different papers.

A common focus in the works exploring ML techniques has been the joint use or comparison between SVM and ANN models. For instance, \citeAsayad2019predictive combined in a single pipeline SVM and ANN to forecast wildfires using a dataset based on the NDVI, land surface temperature (LST), and thermal anomalies from the Moderate Resolution Imaging Spectroradiometer (MODIS), and three characteristics relevant to the status of crops. The findings demonstrated high fire occurrence prediction accuracy (ANNs: 98.32%; SVM: 97.48%), defining it as the ratio of correctly predicted instances to the total instances. The study integrated NDVI, LST, and thermal anomalies with ground truth labelled data. However, the researchers recommended integrating weather data due to its strong relationship with wildfire incidence, development, spread, and extinction. They especially highlighted utilising certain variables which drastically affect wildfire incidences and likelihood: air temperature, wind, and soil moisture \citeAsayad2019predictive.

Another study included weather data, as suggested by \citeAsayad2019predictive. \citeAsakr_efficient_2011 compared SVM and ANN for predicting the likelihood of a fire using cumulative precipitation and relative humidity for several months of the year. SVM performed better in binary fire/no fire classification with the highest accuracy of 94.21%, while ANN obtained a maximum of 91.2%.

\citeA

gholamnia2020comparisons took a distinct approach and explored ML models for mapping wildfire susceptibility instead of predicting them. Using wildfire inventory data from global positioning systems (GPS) and MODIS fire occurrences, they considered sixteen conditioning factors, including meteorological parameters and a vegetation index. Among the eleven ML techniques compared, RF showed the highest accuracy of 88%, followed by SVM at 79%. The study is noteworthy for implementing the unsupervised self-organising maps (SOM) technique, which was however applied to a two-class data and has been adapted for use for classification purposes. The least accurate models were RBF, logistic regression, and SOM. Although not directly comparable to the subject of this paper, it serves as a valuable example of ML’s potential in the wildfire susceptibility mapping domain.

Turning attention to different geographical contexts, \citeAsulova2020exploratory conducted research similar to this study in terms of the study area, focusing on predicting wildfire risk in Australia over six months. They used topography, vegetation, infrastructure, meteorology, and socioeconomic data. The study employed Naïve Bayes (NB), Classification and Regression Tree (CART), and RF models, with RF achieving the highest accuracy of 96% and NB having the lowest accuracy of 64%.

\citeA

kondylatos2022wildfire aimed to accurately forecast wildfire danger using deep learning (DL) models such as LSTM and ConvLSTM and traditional models like RF and XGBoost. The research covered Greece, parts of the Balkan peninsula, and western Turkey. They utilised various datasets, which were including daily weather data from ERA-5 Land and satellite variables from MODIS, such as NDVI and Land Surface Temperature. Results from the study showed that LSTM and ConvLSTM achieved F1-scores greater than 0.8, outperforming RF and XGBoost in all metrics. ConvLSTM exhibited fewer false positives and higher precision, while LSTM had the lowest number of false negatives and highest recall.

\citeA

arpaci2014using examined fire occurrence data in Austria’s Tyrol area from 1993 to 2011, incorporating terrain, vegetation, climate, and socioeconomic factors. Their research intended to locate fire-prone locations and predict the geographical spread of fires. They used the Maximum Entropy (MaxEnt) and RF methods to identify the primary variables influencing the spread of fires. Climate change and population density have become essential factors in the susceptibility of a given area to wildfires. RF model was able to forecast the distribution of fire ignition with a classification accuracy of between 75 and 78%. Both MaxEnt and RF algorithms were usually equivalent when categorising fire susceptibility levels, except for the extremely high susceptibility class, where MaxEnt identified 28 places compared to 2 by RF [Arif \BOthers. (\APACyear2021)].

\citeA

malik2021wildfire conducted fire risk prediction in San Diego, California, using weather and fire history data, including temperature, soil moisture, relative humidity, and wind speed. SVM, XGBoost, and RF achieved accuracies of 98.18%, 98.23%, and 97.54%, respectively, while the F1-scores were 97%, 98%, and 91%, respectively.

2.2 Anomaly Detection in Wildfire Prediction

Anomaly detection aims to identify anomalous patterns, often known as outliers or anomalies, that significantly deviate from the rest of the data [Thudumu \BOthers. (\APACyear2020)]. Such a task can be performed using supervised or unsupervised learning techniques. In this study, an unsupervised learning version will be implemented. In extreme event scenarios, it is difficult to find appropriately labelled datasets because detecting anomalous observations requires effort and expertise. Therefore, supervised learning based on labelled data may not be a viable and suitable solution in all anomaly detection circumstances [Esmaeili \BOthers. (\APACyear2023)]. This underscores the value and contribution of unsupervised anomaly detection.

To the best of our knowledge, there is no study implementing anomaly detection directly in wildfire prediction using unsupervised learning techniques. However, there exists a closely related study working in wildfire risk prediction using anomaly detection method, even though it is not an exact match with this research’s topic. In their research, \citeAsalehi_dynamic_2016 set out to use weather data to better predict the risk of wildfires. They devised a new model named Context-Based Fire Risk (CBFR) that considers the changing patterns of weather over time using a context-based anomaly detection method. They used ECMWF ERA-5 weather data from the Blue Mountains in Australia. To assess the performance of their model, the authors compare it to the commonly used McArthur FFDI method. They looked at AUC and the straight-up accuracy rate. Their CBFR model performed better on both counts. Specifically, CBFR had an AUC of 0.96 and an accuracy of 0.94, while the FFDI method scored 0.89 for AUC and 0.87 for accuracy.

2.3 Autoencoders and Dimensionality Reduction

The first autoencoder neural network was developed to reduce dimensionality [Masci \BOthers. (\APACyear2011)], showing specific advantages over other types of traditional dimensionality reduction methods. For instance, since they are non-linear, autoencoders can generally outperform linear traditional methods in the context of dimensionality reduction [Bank \BOthers. (\APACyear2020)].

Although there have been no previous examples of such use of autoencoders in the wildfire prediction domain, there is one study focusing on post-fire assessment [Coca \BBA Datcu (\APACyear2021)]. The authors introduced a framework for burned area estimation using multispectral images. Relying on ESA’s Sentinel-2 (S2) sun synchronous satellite imagery, they focused on a wildfire-impacted area in Portugal from 2017. The authors employed an autoencoder to extract deep features from the data. They used the autoencoder’s capability to not only reduce dimensionality, but also extract deep features from input data. After attaining these features, one-class Support Vector Machine (OCSVM) [Shin \BOthers. (\APACyear2005)] is utilised for anomaly detection. The experiment processed both normal and abnormal patches from the dataset, achieving an accuracy of 0.71 at 7.9% abnormality.

2.4 Autoencoders and Anomaly Detection

2.4.1 Anomaly Detection Based on Reconstruction Error

Using autoencoders to detect anomalies using the reconstruction error is a technique well applied in several areas. The idea is as follows: a trained autoencoder would learn the latent subspace based on only normal (non-anomalous) instances. Once trained, the input data is reconstructed, and reconstruction errors, i.e., the difference between the original and reconstructed data, are calculated. As a result, while normal instances have low reconstruction errors, anomalies have high reconstruction errors [Bank \BOthers. (\APACyear2020)].

To the best of our knowledge, no existing literature has explored anomaly detection using only deep autoencoders and reconstruction error in the context of wildfire prediction. The forthcoming referenced studies, while they are from varying domains, share a methodology closely aligned with this research.

\citeA

cheng2022resnet utilised a ResNet autoencoder to identify anomalies in radar data. Their model incorporated both convolutional and LSTM layers. Convolutional layers captured features, while LSTM layers assessed temporal dependencies in radar data. The model achieved an accuracy potential of up to 85%. \citeAesmaeili2023anomaly trained deep autoencoders on normal data to spot anomalies in a similar domain. Using autoencoder-based prediction models on electrochemical aptasensor recordings with various signal lengths, they used reconstruction errors to set up a threshold for assigning anomalies and used kernel density estimation. They implemented vanilla, ULSTM, and Bi-LSTM autoencoder. The integrated ULSTM-vanilla model achieved about 80% accuracy for longer signal datasets, 65% and 40% for shorter ones.

\citeA

zhao2017spatio introduced a unique model called Spatio-Temporal Autoencoder (STAE) in a different domain that learns video representation and detects anomalies in the video by extracting features from spatial and temporal dimensions using deep autoencoders. The utilisation of reconstruction error shares common points with the approach employed by \citeAhasan2016learning. Once their model was trained, both studies produced a “regularity score” based on the reconstruction error, highlighting that video sequences with regular events had low errors. In contrast, anomalous sequences had high errors, allowing them to spot anomalies.

2.4.2 Anomaly Detection Based on Clustering through Latent Features

Clustering is organising unlabelled data into similar groupings known as clusters. A cluster is a group of data items that are similar to one another but differ from other clusters [Serra \BBA Tagliaferri (\APACyear2019)]. When labelled data is not available, clustering offers a valuable alternative to supervised learning techniques.

Nevertheless, most traditional clustering algorithms are sensitive to the dimension of the input data, making them suffer from “the curse of dimensionality” [Saxena \BOthers. (\APACyear2017)], which makes them less effective in high-dimensional spaces than in low-dimensional data [Aggarwal \BBA Reddy (\APACyear2013)]. In order to solve this problem, dimensionality reduction-based clustering algorithms have been developed. These techniques translate data to lower dimensions utilising techniques like PCA, kernel-based PCA, and autoencoders [Ma (\APACyear2022)]. For instance, \citeAtian2014learning described a simple method that involved employing stacked autoencoders to acquire the non-linear embedding of the features (latent features) and then feeding that information directly to k-means clustering. Similarly, \citeAma2022achieving examined several models for various clustering techniques on several datasets, including MNIST, HHAR, and REUTERS-10K. It has been noted that the Gaussian Mixture Model (GMM) alone only achieved 47.8% accuracy for the MNIST dataset. However, the accuracy rose to 79.34% when autoencoders and GMM were combined (AE+GMM), which might be a promising improvement. Like this, when AE and latent characteristics were included, normalised mutual information (NMI) findings rose from 38.24% to 81.41%.

3 Methodology

The code within this research has been developed using Python 3.10.10. The procedures related to autoencoder modelling, including training, compilation, fine-tuning, and adaptation, were developed using TensorFlow 2.10.0 [Abadi \BOthers. (\APACyear2016)]. In the context of clustering, the Scikit-learn 1.2.2 library was employed [Pedregosa \BOthers. (\APACyear2018)].

3.1 Data

This paper focuses on the region of Australia. It includes seven states: New South Wales (NSW), Northern Territory (NT), Queensland (QL), South Australia (SA), Victoria (VI), Western Australia (WA), and Tasmania (TA). The dataset used is a combination of 2 datasets. The first dataset, released by IBM, is the historical weather and vegetation index. It was released for the Call for Code Spot Challenge for Wildfires competition, and is publicly available [IBM (\APACyear2020)]. It includes historical weather and vegetation index information from 2005 to 2021 as daily aggregated values. The second dataset, the Wildfires dataset is a component of the main dataset [IBM (\APACyear2020)] and it is only used to evaluate the performances of the obtained model. Since we are following an unsupervised approach, the “wildfire” ground truth information obtained from the Wildfires dataset is not used during any training phase. Necessary information regarding the datasets is summarised in Table 1.

Table 1: Details of used datasets [IBM (\APACyear2020)].
Dataset Pre-processing details
Wildfires [\APACcitebtitleMODIS/Aqua+Terra Thermal Anomalies/Fire locations 1km FIRMS V0061 NRT (\APACyear2020)] Spatial averaging into seven regions in Australia Aggregation by day from 2005 to 2021 Only high-confidence fires (>>>75%) considered
Historical Weather - ERA5-Land hourly data [Muñoz Sabater (\APACyear2019)] Raw ERA5 data was processed using IBM PAIRS Geoscope Temporal aggregation, unit conversion, and transformations are applied to each parameter. Spatial aggregation using physical area weights of each pixel
NDVI - MOD13Q1 MODIS/Terra Vegetation Indices 16-Day L3 Global 250m SIN Grid V006 [Didan (\APACyear2015)] Spatial averaging into seven regions in Australia from 2005 to 2021

The data presented is aggregated daily and encompasses estimations by NASA, having undergone certain processing stages [IBM (\APACyear2020)].

Figure 1 provides an overview of fire area size across all Australian regions from 2005 to 2021. A significant surge in fire incidents stands out in 2006 and 2007. Remarkably, there was another notable increase in active fire occurrences during 2011 and 2012, followed by a subsequent rise in wildfires during 2019 and 2020.

Refer to caption
Figure 1: Fire area size over the years for all regions.

The two initial datasets (Weather and NDVI) were merged to be used as input data. Concerning the target variable, given that this is an unsupervised learning work, the actual wildfire ground truth is not utilised during the training and modelling phases. Nonetheless, to evaluate model performance and outcomes, a binary target variable labelled as“Fire” is generated from the Estimated Fire Area Size variable using the subsequent formula:

Fire={1,EstimatedFireArea00,EstimatedFireArea=0\mathrm{Fire}=\left\{\begin{matrix}1,\mathrm{Estimated\ Fire\ Area\ }{\neq}0\\ ~{}0,\mathrm{Estimated\ Fire\ Area\ }=0\end{matrix}\right.roman_Fire = { start_ARG start_ROW start_CELL 1 , roman_Estimated roman_Fire roman_Area ≠ 0 end_CELL end_ROW start_ROW start_CELL 0 , roman_Estimated roman_Fire roman_Area = 0 end_CELL end_ROW end_ARG (1)
Table 2: Details of features.
Dataset Features Range and Unit
Wildfires Date 2005-01-01 - 2021-01-23
Region [NSW, NT, QL, SA, VI, WA, TA]
Estimated Fire Area Size 0 \leq Estimated Fire Area Size (km2)<<<10120.94
(Estimated Fire Area Size is converted to a binary variable) Fire [0,1]
Historical Weather Date 2005-01-01 - 2021-01-23
Region [NSW, NT, QL, SA, VI, WA, TA]
Minimum, maximum, mean, and variance of each of these parameters: Precipitation 0 \leq Precipitation (mm/day) \leq 509.83
Relative humidity 0 \leq Relative humidity (mm/day) \leq 509.83
Soil water content 0 \leq Soil water content (m3 m-3) \leq 0.52
Solar radiation 0.41 \leq Solar radiation (MJ/day) \leq 35.69
Temperature -5.05 \leq Temperature (°C) \leq 41.73
Wind speed 0.25 \leq Wind speed (m/s) \leq 24.27
NDVI Date 2005-01-01 - 2021-01-23
Region [NSW, NT, QL, SA, VI, WA, TA]
Minimum, maximum, mean, and variance of: NDVI 0 \leq NDVI (index) \leq 1

Table 2 provides an overall list of all features and their range and units. In the final version of the dataset, there are 26,677 instances of wildfires and 14,299 instances of non-wildfires. This imbalance arises because the dataset is aggregated daily. If even a single wildfire occurs on a given day, that day is labelled as a “wildfire” day.

3.1.1 Feature Importance

Given that certain variables might be interrelated, and some may not provide significant value, RF models inherently select a subset of significant variables for classification, instead of using all variables, and thus enhancing the classification performance of high-dimensional data. This feature selection can be visualized through “Gini importance”, also known as mean decrease in impurity (MDI), which ranks the relevance of features [Sutera (\APACyear2021)]. RF was employed to get a feature importance plot with MDI values, as seen in Figure 2. Parameters used for the RF fit can be found in Table 3.

Table 3: Parameters used in the RF fit.
Parameter Value
Number of estimators 100
Criterion “Gini”
Min samples split 2
Min samples leaf 2
Refer to caption
Figure 2: Feature importance values generated by RF.

Figure 2 provides evidence that several variables, such as the Vegetation Index Min, Wind Speed Mean or Wind Speed Variance, contribute minimally to the model. This observation is particularly pronounced in the RF, highlighting numerous less impactful features.

In the deep autoencoder modelling part, two datasets will be analysed:

  1. 1.

    Dataset 1 includes all 28 variables without any exclusions.

  2. 2.

    Dataset 2 is composed of 17 features. Based on a close look at Figure 2, it is seen that there is a significant decrease in MDI after the Vegetation Index Max value. Yet, since many studies emphasize the significance of the Temperature variable in this field, Temperature Min is retained as well. Consequently, any variable with an MDI value below Temperature Min was manually removed for experimental purposes.

Comparison of these two dataset results will allow us to assess the models’ performance concerning variable contribution and dimensional variance. The complete set of features for both data versions can be found in Table 4. ‘Region’ and ‘Date’ features are omitted as they do not offer any insights into the model.

Table 4: Features for two versions of the dataset.
Dataset 1 Dataset 2
Precipitation_Max Vegetation_index_Mean
Precipitation_Mean Temperature_Max
Precipitation_Variance SoilWaterContent_Mean
RelativeHumidity_Max RelativeHumidity_Min
RelativeHumidity_Mean RelativeHumidity_Mean
RelativeHumidity_Min RelativeHumidity_Variance
RelativeHumidity_Variance Temperature_Variance
SoilWaterContent_Max SolarRadiation_Max
SoilWaterContent_Mean Vegetation_index_Std
SoilWaterContent_Min Vegetation_index_Variance
SoilWaterContent_Variance Temperature_Mean
SolarRadiation_Max SoilWaterContent_Max
SolarRadiation_Mean SoilWaterContent_Variance
SolarRadiation_Min SolarRadiation_Mean
SolarRadiation_Variance WindSpeed_Min
Temperature_Max Vegetation_index_max
Temperature_Mean Temperature_Min
Temperature_Min
Temperature_Variance
WindSpeed_Max
WindSpeed_Mean
WindSpeed_Min
WindSpeed_Variance
Vegetation_index_Mean
Vegetation_index_Max
Vegetation_index_Min
Vegetation_index_Std
Vegetation_index_Variance

3.1.2 Data Split

The data is split into three parts: training, validation, and testing. For training, data with only normal (non-wildfire) cases are used. A mix of data from both classes is needed for the validation and test parts. The original dataset has 14,299 non-wildfire (NW) cases (35%) and 26,677 wildfire (W) cases (65%). This imbalance within the classes arises due to the daily aggregation of data. If even a single minor wildfire is reported on a given day, that day is labelled as a “wildfire”, increasing the wildfire count.

The autoencoder is trained solely on non-wildfire data. Therefore, a substantial portion of NW from the original dataset (12,869 cases) is allocated to the train data, ensuring maximum use of the non-wildfire information present. The remaining NW cases are evenly divided between the test and validation datasets, with each receiving 715 NW cases. For test and validation datasets, wildfire cases are initially split into 13,338 and 13,339, respectively. However, given that the train data consisted of a total of 12,869 instances, these numbers seemed disproportionate. To address this, 1,000 wildfire cases are randomly dropped for both the test and validation datasets. Thus, each test and validation set consists of 1,000 wildfire cases with 715 non-wildfire cases, resulting in datasets with (W=1,000, NW=715).

This process ensures that the test and validation dataset sizes are not excessively larger than the train data due to the class imbalance. Consequently, the distribution for training, testing, and validation sets is roughly 80%, 10%, and 10%. Still, it is worth noting that, since some instances had to be dropped due to the imbalance in the original dataset, some information in the original data was not fully utilized during modelling, which could be listed as a limitation. This whole procedure is applied separately for both Dataset 1 and Dataset 2. Min-max scaling was employed to ensure that all features in the dataset have a uniform scale, allowing them to contribute equally to the model’s performance. Each dataset - train, validation, and test, was scaled based on its own statistics. Therefore, it was ensured that data leakage did not occur, preserving the integrity of the evaluation. Through this scaling procedure, it is ensured that every feature within the datasets is within the range of 0 to 1.

3.2 Methods

In this study, we focus on two main unsupervised approaches: using a deep autoencoder to obtain latent features for clustering models and using a deep autoencoder to reconstruct input data and identify anomalies through reconstruction errors. These strategies have been discussed extensively in the Related Work section, where their applications and effectiveness in different contexts were reviewed in detail. The main aim here is to fill the research gap in unsupervised learning for wildfire prediction using unlabelled data. Deep autoencoders are utilised to detect anomalies in data, classifying them as “wildfire” or “non-wildfire.” The model is trained on non-anomaly instances. Two strategies are explored:

  1. 1.

    Using the deep autoencoder for training, then reconstructing input data in lower dimensions and applying clustering methods to these latent features to differentiate wildfires.

  2. 2.

    Using the deep autoencoder to reconstruct input data, where high reconstruction errors signify anomalies. Here, both Fully Convolutional (FC) and Long-Short-Term-Memory (LSTM) autoencoders are employed, with the latter being adept at capturing temporal patterns in time-series data, a feature that FC architectures lack.

In the following sections, we start by explaining what a deep autoencoder is in Section 3.2.1, including its variants such as Fully Connected (FC) and LSTM Autoencoders. Then, in Sections 3.2.2 and 3.2.2, we detail the two main methods utilized in this paper that employ deep autoencoders for anomaly detection.

3.2.1 Deep Autoencoders

An autoencoder is a unique variant of feed-forward neural networks aiming to have the output closely mirror the input. Figure 3 shows that both the input and output layers have matching dimensions. An autoencoder consists of two main parts: an encoder and a decoder as provided in Figure 3.

Refer to caption
Figure 3: Architecture of a sample autoencoder [Kucharski \BOthers. (\APACyear2020)]

The encoder section comprises one or multiple hidden layers, in this case, FC or LSTM layers, that compress the input into a condensed representation. Conversely, the decoder uses its hidden layers, which can also be FC or LSTM layers, to reconstruct this condensed representation back to the original input’s dimensionality [Kucharski \BOthers. (\APACyear2020)]. The unsupervised training process optimises these hidden layers to minimise the difference between the reconstructed output and the original input. This study will explore autoencoders with FC and LSTM hidden layers to comprehensively understand their performance and capabilities.

LSTM Autoencoder.

As mentioned, a deep autoencoder can include hidden LSTM layers in its encoder and decoder components. Using LSTM layers provides the advantage of capturing temporal information, if any, in the time-series data. To utilise the LSTM autoencoder, the dataset must be reshaped into a format compatible with what the LSTM requires. Thus, for the training, testing, and validation datasets, sequences of a 10-day length were prepared. Table 5 showcases the precise shape details of the inputs prior to feeding them into the LSTM autoencoder. Table 6 provides detailed information on the final model’s hyperparameter tuning details and the library used for this.

Table 5: Shape of the datasets to be used in the LSTM autoencoder.
Train (batch, timesteps, feature) Test (batch, timesteps, feature) Validation (batch, timesteps, feature)
Shape (Dataset 1) (1286, 10, 28) (171, 10, 28) (171, 10, 28)
Shape (Dataset 2) (1286, 10, 17) (171, 10, 17) (171, 10, 17)
Table 6: Hyperparameter tuning details for LSTM autoencoder.
Parameter & Library Range Final Choice
Number of layers (tf.keras.layers.LSTM) [2, 3, 4, 5, 6, 7, 8] 5
Number of units (tf.keras.layers.LSTM) [1024, 512, 256, 128, 64, 32, 16, 8] [256, 128, 64, 32,16]
Activation function (tf.keras.layers.LSTM) [ReLU, Sigmoid, Tanh, Softmax] Tanh
Optimizer (tf.keras.optimizers) [Adam, SGD, RMSProp] Adam
Learning rate scheduler (tfa.optimizers) Exponential Decay, Polynomial Decay, Inverse Time Decay, Cosine Decay, Cyclical Learning Rate, None None
Batch size [16, 32, 64, 128, 256, 512] 32

Figure 4 represents the LSTM autoencoder architecture. The design uses a stacked LSTM structure, where each layer feeds its output into the subsequent layer as its input. Since it is an autoencoder, a mirrored architecture could be observed in the layers.

Refer to caption
Figure 4: Final LSTM autoencoder architecture.
Fully Connected (FC) Autoencoder

Like the LSTM autoencoder, the FC autoencoder incorporates hidden, fully connected layers in both its encoder and decoder parts. The hyperparameter tuning details for the FC autoencoder are detailed in Table 7, which shows the final model setup and the parameters explored. Like the LSTM autoencoder, the FC autoencoder also establishes mirrored architectures, formed with FC layers. Figure 5 represents the FC autoencoder architecture.

Table 7: Hyperparameter tuning details for FC autoencoder
Parameter Range Final Choice
Number of layers (tf.keras.layers.LSTM) [2, 3, 4, 5, 6, 7, 8] 5
Number of units (tf.keras.layers.LSTM) [1024, 512, 256, 128, 64, 32, 16, 8] [512, 256, 128, 64,32]
Activation function (tf.keras.layers.LSTM) [ReLU, Sigmoid, Tanh, Softmax] ReLU
Optimizer (tf.keras.optimizers) [Adam, SGD, RMSProp] Adam
Learning rate scheduler (tfa.optimizers) Exponential Decay, Polynomial Decay, Inverse Time Decay, Cosine Decay, Cyclical Learning Rate, None Cyclical Learning Rate
Batch size [16, 32, 64, 128, 256, 512] 128
Refer to caption
Figure 5: Final FC autoencoder architecture.

3.2.2 Anomaly Detection with Deep Autoencoders

This section delves into the anomaly detection methodology utilising the deep autoencoder models discussed in the preceding sections.

The choice of the two detection strategies is informed by their proven efficacy in the domain of anomaly detection, as highlighted in Section 2. The first approach, using latent features obtained from a deep autoencoder for clustering models, leverages the ability of autoencoders to reduce dimensionality and extract significant features from the data. This method has shown promise in various applications, including environmental monitoring and anomaly detection, as discussed by authors such as \citeAtian2014learning and \citeAma2022achieving. The clustering models used—isolation forest, local outlier factor (LOF), and one-class SVM—are well-established techniques for identifying outliers in high-dimensional data [Liu \BOthers. (\APACyear2012), Breunig \BOthers. (\APACyear2000), Schölkopf \BOthers. (\APACyear1999)].

The second approach, using reconstruction errors from a deep autoencoder to identify anomalies, exploits the inherent capability of autoencoders to learn the normal patterns in the data. When the autoencoder reconstructs data points, anomalies typically result in higher reconstruction errors, which can be flagged as outliers. This method has been successfully applied in various fields, including video surveillance and medical diagnostics, as demonstrated by researchers such as \citeAcheng2022resnet and \citeAesmaeili2023anomaly.

Implementation details of these two primary methods are explained in more detail in the following sections.

Anomaly Detection Based on Clustering through Latent Features.

In this case, the encoder segment of the FC autoencoder is employed to extract the latent features from the train and test datasets. These features are compressed from a 28-dimensional space down to an 8-dimensional one. An FC autoencoder is used, pre-trained on the train data (which consists of only non-wildfire cases). Specifically, to obtain latent features, the encoder component of the FC autoencoder is employed. The data dimension for both is reduced from 28 to 8. Then, latent train data is used to fit the cluster model. Three clustering techniques are adopted separately here: isolation forest, LOF, and one-class SVM. Finally, the prediction of the latent test data is done using the clustering model, and anomalies are identified as a result. Each clustering model has multiple parameters that require tuning.

Each method has unique characteristics that make it suitable for specific aspects of anomaly detection in high-dimensional data. The isolation forest method, which isolates anomalies rather than profiling normal instances, operates by constructing binary trees through random selection of features and split values, creating shorter paths for anomalies due to their rarity in the dataset, offering rapid anomaly detection efficiency [Liu \BOthers. (\APACyear2012)]. LOF measures the local density deviation of a data point relative to its neighbors, identifying anomalies in areas of varying density [Breunig \BOthers. (\APACyear2000)]. Lastly, one-class SVM is designed for scenarios where only the ‘normal’ class is present during training; it defines a decision boundary around this class, with deviations from this boundary treated as anomalies, making it commonly suitable for the preliminary phases of environmental anomaly detection of the unsupervised learning scenarios [Schölkopf \BOthers. (\APACyear1999)]. The incorporation of isolation forest, LOF, and one-class SVM in this study is designed to facilitate a comprehensive comparative analysis. By evaluating each method’s performance, the study aims to provide essential insights into their comparative strengths and weaknesses, thereby informing future research and practical applications in the field. To achieve this, all three methods are implemented concurrently, allowing for a direct and comprehensive comparison of their results.

Table 8 provides the parameters adjusted for each clustering model and the final choice based on the evaluation results. Only Dataset 1 is utilised for this approach.

Table 8: Hyperparameter tuning details for clustering models.
Parameter Range Final Choice
Isolation forest Number of estimators [50, 100, 200] 200
Max samples [0.5, 0.7, 0.9] 0.9
Contamination [‘auto’, 0.01, 0.02, 0.05, 0.1, 0.2,0.3, 0.4, 0.5] 0.5
LOF Number of neighbours [2, 3, 4, 5, 6, 7, 8, 10, 12, 15, 20] 8
Contamination [‘auto’, 0.01, 0.02, 0.05, 0.1, 0.2,0.3, 0.4, 0.5] 0.5
Distance metric [Euclidean, Manhattan, Minkowski] Manhattan
One class SVM Kernel [Linear, Polynomial, RBF, Sigmoid] Linear
Nu value [0.01, 0.05, 0.1, 0.2, 0.3,0.4,0.5,0.6] 0.6
Anomaly Detection Based on Reconstruction Error.

In this approach, both FC and LSTM autoencoders are employed independently, though the procedure for each remains the same. The hyperparameter tuning details for both FC and LSTM autoencoders are detailed in Table 9. The selected LSTM model is presented in table 10. This model uses a batch size of 32, Tanh activation functions for both the encoder, decoder, and Adam optimizer with LSTM layers [256, 128, 64, 32,16].

Table 9: FC and LSTM autoencoder final hyperparameter tuning range.
Parameter & Library Range
Activation function (tf.keras.layers.LSTM) [ReLU, Sigmoid, Tanh, Softmax]
Optimizer (tf.keras.optimizers) [Adam, SGD, RMSProp]
Learning rate scheduler (tfa.optimizers) Exponential Decay, Polynomial Decay, Inverse Time Decay, Cosine Decay, Cyclical Learning Rate, None
Batch size [16, 32, 64, 128, 256, 512]
Table 10: LSTM autoencoder model selected after hyperparameter tuning on the test data.
Batch Size Encoder Activation Function Decoder Activation Function Optimizer
32 Tanh Tanh Adam

Hyperparameter tuning was exclusively conducted for Dataset 1. Consequently, the same architecture of the chosen final model was also used when training Dataset 2. For the FC case, two models emerge as strong candidates for the final selection. These top-tier models will be referred to as Model A and Model B and are presented in table 11. Having the highest F1 scores, they indicate a balance between precision and recall and a decent overall classifier performance. It is decided to assess both and compare their results, before finalizing the choice. Each model is trained separately on both Dataset 1 and Dataset 2 (on the test data).

Table 11: FC autoencoder models selected after hyperparameter tuning on the test data.
Model Activation Function Batch Size Optimizer Learning Rate Scheduler
Model A ReLU 128 Adam Cyclical Learning Rate
Model B ReLU 32 RMSProp None
Training Process.

The training dataset only consists of non-wildfire instances, while both the test and validation datasets contain a mix of cases. After obtaining the best combination of parameters for the model and finalizing the autoencoder model, training is performed. During the training, validation data is used to observe the validation loss and errors for each iteration of the training process.

Initially, the train data is input into the FC/LSTM autoencoder, which has been previously trained on this data. The autoencoder then generates the reconstructed train data as its output. The reconstruction error, represented as the mean squared logarithmic error (MSLE), quantifies the deviation between the original and the reconstructed train data. Therefore, subsequently, MSLETrain values are calculated to calculate the difference between the original train data and reconstructed train data. Using the MSLETrain values, a threshold is set up based on the formula provided below, here termed as ThresholdTrain.

ThresholdTrain=μMSLE(Train)+2σMSLE(Train)subscriptThresholdTrainsubscript𝜇MSLETrain2subscript𝜎MSLETrain\mathrm{Threshold}_{\mathrm{Train}}=\mu_{\mathrm{MSLE}(\mathrm{Train})}+2% \sigma_{\mathrm{MSLE}(\mathrm{Train})}roman_Threshold start_POSTSUBSCRIPT roman_Train end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT roman_MSLE ( roman_Train ) end_POSTSUBSCRIPT + 2 italic_σ start_POSTSUBSCRIPT roman_MSLE ( roman_Train ) end_POSTSUBSCRIPT (2)

For the procedure for test data reconstruction and anomaly detection, similarly, the test data is input into the same FC/LSTM autoencoder. The autoencoder then generates the reconstructed test data as its output. Subsequently, MSLETest values are calculated to calculate the difference between the original test data and reconstructed test data. Finally, in the classification phase, MSLETest values are compared with ThresholdTrain to assign anomalies. Basically, if MSLETest is higher than ThresholdTrain, then the data point is categorized as a wildfire, denoted by “1”. If not, it categorized as a non-wildfire, denoted by “0”, with the following formula:

FireTest={1,ThresholdTrainMSLETest0,ThresholdTrain>MSLETest\mathrm{Fire}_{\mathrm{Test}}=\left\{\begin{matrix}1,\mathrm{Threshold}_{% \mathrm{Train}}~{}{\leq}~{}\mathrm{MSLE}_{\mathrm{Test}}\\ ~{}0,\mathrm{Threshold}_{\mathrm{Train}}>\mathrm{MSLE}_{\mathrm{Test}}\end{% matrix}\right.roman_Fire start_POSTSUBSCRIPT roman_Test end_POSTSUBSCRIPT = { start_ARG start_ROW start_CELL 1 , roman_Threshold start_POSTSUBSCRIPT roman_Train end_POSTSUBSCRIPT ≤ roman_MSLE start_POSTSUBSCRIPT roman_Test end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 0 , roman_Threshold start_POSTSUBSCRIPT roman_Train end_POSTSUBSCRIPT > roman_MSLE start_POSTSUBSCRIPT roman_Test end_POSTSUBSCRIPT end_CELL end_ROW end_ARG (3)
Performance Metrics

For both approaches, after getting the anomaly detection/classification results based on test data, the performance of the model on unseen data needs to be evaluated. For this, the model’s performance is evaluated against five metrics: Accuracy, precision, recall, F1-score, and Matthew’s coefficient (MCC). The following equations present formulas for calculating these metrics, where TP = True Positive, TN = True Negative, FP= False Positive, and FN = False Negative:

Accuracy=𝑇𝑃+𝑇𝑁P+NAccuracy𝑇𝑃𝑇𝑁𝑃𝑁\mathrm{Accuracy}=\frac{\mathit{TP}+\mathit{TN}}{P+N}roman_Accuracy = divide start_ARG italic_TP + italic_TN end_ARG start_ARG italic_P + italic_N end_ARG (4)
Precision=𝑇𝑃𝑇𝑃+𝐹𝑃Precision𝑇𝑃𝑇𝑃𝐹𝑃\mathrm{Precision}=\frac{\mathit{TP}}{\mathit{TP}+\mathit{FP}}roman_Precision = divide start_ARG italic_TP end_ARG start_ARG italic_TP + italic_FP end_ARG (5)
Recall=𝑇𝑃𝑇𝑃+𝐹𝑁Recall𝑇𝑃𝑇𝑃𝐹𝑁\mathrm{Recall}=\frac{\mathit{TP}}{\mathit{TP}+\mathit{FN}}roman_Recall = divide start_ARG italic_TP end_ARG start_ARG italic_TP + italic_FN end_ARG (6)
F1=2(Precision×Recall)(Precision+Recall)F12PrecisionRecallPrecisionRecall\mathrm{F1}=2\frac{(\mathrm{Precision}\times\mathrm{Recall})}{(\mathrm{% Precision}+\mathrm{Recall})}F1 = 2 divide start_ARG ( roman_Precision × roman_Recall ) end_ARG start_ARG ( roman_Precision + roman_Recall ) end_ARG (7)
Matthewscoefficient(MCC)=TP×TNFP×FN(TP+FP)(TP+FN)(TN+FP)(TN+FN)superscriptMatthewscoefficientMCCTPTNFPFNTPFPTPFNTNFPTNFN\mathrm{Matthew^{\prime}s\ coefficient\ (MCC)}=\frac{\mathrm{TP}\times\mathrm{% TN}-\mathrm{FP}\times\mathrm{FN}}{\sqrt{(\mathrm{TP}+\mathrm{FP})(\mathrm{TP}+% \mathrm{FN})(\mathrm{TN}+\mathrm{FP})(\mathrm{TN}+\mathrm{FN})}}roman_Matthew start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_s roman_coefficient ( roman_MCC ) = divide start_ARG roman_TP × roman_TN - roman_FP × roman_FN end_ARG start_ARG square-root start_ARG ( roman_TP + roman_FP ) ( roman_TP + roman_FN ) ( roman_TN + roman_FP ) ( roman_TN + roman_FN ) end_ARG end_ARG (8)

A confusion matrix for each model [Demir (\APACyear2022)], based on test data, was also produced. Lastly, a ROC curve plot and AUC are plotted and calculated. A ROC curve plots the TP rate in comparison to the FP rate. These values shift relative to one another. Essentially, when the TP rate peaks, the FP rate is at its minimum, and vice versa [Maxion \BBA Roberts (\APACyear2004)]. AUC is a key metric for assessing a binary classifier’s performance. It ranges from 0.5 (equivalent to random guessing) to 1.0 (a flawless classifier). AUC evaluates the effectiveness of score classifiers by considering all potential classification thresholds. [Melo (\APACyear2013)].

4 Results

4.1 Anomaly Detection Based on Clustering through Latent Features

In this section, two different models and their results will be presented for each cluster model type (Isolation Forest, LOF, and one-class SVM). While the first one is the best model according to the hyperparameter tuning, the other one will have different parameters for the sake of the comparison of the parameters and performance.

4.1.1 Isolation Forest

Table 12: Parameter information and results for two models of isolation forest.
Model Number of estimators Max samples Contami-nation Accuracy Precision Recall F1-score MCC
Model (A) 100 0.9 0.5 0.612 0.664 0.675 0.67 0.242
Model (B) 100 0.9 0.2 0.536 0.717 0.337 0.459 0.264
Refer to caption
Figure 6: Confusion matrix comparison for Model (A) and (B) for isolation forest.

Based on Table 12, Model (A) assumes half of the data is anomalous (0.5), while Model (B) only assumes 0.2 of the data is anomalous due to the contamination parameter. This discrepancy leads to a significant difference in their performance.

Figure 6 illustrates Model (A) has a better balance between precision and recall, resulting in a higher F1-score of 67%. In contrast, Model (B) has a higher precision of 71.7%, suggesting that among the anomalies it detected, a greater percentage were true anomalies. However, its recall is considerably low at 33.7%, which means it failed to capture a significant proportion (around 66%) of the true anomalies present in the dataset.

4.1.2 LOF

Table 13: Parameter information and results for two models of LOF
Model Number of neighbours Contami-nation Distance Metric Accuracy Precision Recall F1-score MCC
Model (C) 20 0.5 Manhattan 0.65 0.677 0.766 0.719 0.329
Model (D) 12 0.3 Manhattan 0.631 0.72 0.6 0.655 0.361
Refer to caption
Figure 7: Confusion matrix comparison for Model (C) and (D) for LOF

In 13 and Figure 7, it is seen that Model (C) outperformed in recall (76.6% vs 60%) and F1-score (71.9% vs 65.5%), indicating its superiority in detecting real anomalies. In contrast, while Model (D) has a higher precision of 72%, it struggles to identify all real anomalies, as revealed by its low recall of 60%.

While drawing definitive conclusions on the influence of the number of neighbours from just two models is challenging, there is a trend: models with better accuracies consistently have a higher number of neighbours in LOF. Similarly with isolation forest, the contamination parameter significantly drops LOF’s performance. It typically reflects the proportion of outliers in the dataset and was highly influential for both isolation forest and LOF models.

4.1.3 One-Class SVM

Table 14: Parameter information and results for two models of one-class SVM
Model Kernel Nu value Accuracy Precision Recall F1-score MCC
Model (E) RBF 0.6 0.554 0.608 0.662 0.634 0.074
Model (F) RBF 0.3 0.482 0.604 0.322 0.42 0.032
Refer to caption
Figure 8: Confusion matrix comparison for Model (E) and (F) for one-class SVM.

Both models, (E) and (F), employ the RDF kernel, allowing us to compare the performance metrics due to the variations in the Nu parameter Table 14. The difference in the Nu values significantly impacts the models’ performances. Model (E), with a Nu value of 0.6, exhibits better overall performance across all metrics than Model (F), which has a Nu value of 0.3. Specifically, Model (F)’s performance lags, especially in terms of recall (32.2%), which is considerably lower than Model (E)’s recall (66.2%) as in Figure 8.

4.1.4 Visualization of the Results

Figure 9 represents the anomaly detection results of all three techniques on 3D maps with PCA features. PCA features are created here just for visualisation purposes. The actual clustering, fitting, and prediction processes utilise the original latent data generated through an autoencoder, detailed in Section 3.2.2. Latent test data, which is 8-dimensional, is reduced into 3-dimensional data using PCA so that it is possible to visualise the whole data in a 3D dimension plot instead of randomly choosing three variables.

Refer to caption
Figure 9: Anomaly detection results of isolation forest on 3D maps (PCA is used for only visualization purposes) for Model (A) and (B) (top row), (C) and (D) (middle row), and (E) and (F) (bottom row).

Upon reviewing Figure 9, it is clear that anomaly and normal data points do not lie in the different spaces, which makes the clustering approach challenging. Furthermore, the significant reliance of these models on specific parameters like contamination and Nu is evident, as indicated in previous sections. While these approaches showed initial promise, they might not be the most suitable choice for imbalanced datasets like this one.

4.2 Anomaly Detection Based on Reconstruction Error

4.2.1 LSTM autoencoder

The model definition used for the training was presented in Table 10. Both datasets training underwent training for 200 epochs, incorporating an early stopping mechanism to mitigate overfitting. Figure 10 displays the training and validation loss plots of the LSTM autoencoder for both Dataset 1 and Dataset 2.

Refer to caption
Figure 10: LSTM Autoencoder training and validation loss plots. Dataset 1 on the left, Dataset 2 on the right.

For Dataset 1, although the initial epochs show fluctuations, the model seems to stabilize and converge without any evident signs of overfitting or underfitting. Both the training and validation losses exhibit a consistent declining trend across the 200 epochs in Figure 10. Early stopping is not triggered, suggesting that extending the training might further reduce validation loss. For Dataset 2, the loss plots appear less consistent. Around the 125th epoch, there is a sudden spike in loss followed by a rapid decline. Even as training continues to the 175th epoch, the model does not attain much additional learning. Table 15 presents the obtained performance metrics of models on the test data for Dataset 1 and Dataset 2.

Table 15: Two LSTM autoencoder models based on Dataset 1 and Dataset 2, on test data.
Model Accuracy Precision Recall F1-score MCC
LSTM Autoencoder (Dataset 1) 0.563 0.581 0.897 0.705 -0.013
LSTM Autoencoder (Dataset 2) 0.457 0.526 0.692 0.598 -0.20
Refer to caption
Figure 11: Confusion matrices of LSTM autoencoder for Dataset 1 and Dataset 2.

From Table 15 and Figure 11, it is observed that the model trained on Dataset 1 demonstrates better performance in Accuracy, Precision, Recall, and F1-score compared to the model trained on Dataset 2, which retains only 17 variables. However, an important point for both models is that MCC is very low. While the first model’s MCC is closer to zero, suggesting a near-random prediction, the second model’s MCC is significantly lower, indicating a slightly worse prediction than random guessing. Analysing the ROC curve plots in Figure 12, the second model exhibits a better AUC value of 0.61 than the first model with 0.51. However, one could say that this type of ROC curve plot is not desirable.

Refer to caption
Figure 12: ROC curve plots of LSTM autoencoder Dataset 1 and Dataset 2.

In conclusion, while the first model, trained on Dataset 1, outperforms traditional performance metrics, the second model’s ROC curve seems better. Nevertheless, the overall quality of both models is not very promising, given that an AUC close to 0.5 implies nearly random prediction. However, one can note that this reflects the inherent difficulty in predicting wildfires with limited historical data.

4.2.2 FC Autoencoder

The models A and B trained in this scenario were presented in Table 11. Each model’s training process spanned 400 epochs, with an early stopping (patience criteria as 20) integrated to avoid potential overfitting.

Refer to caption
Figure 13: FC Autoencoder (Model A) training and validation loss plots. Dataset 1 on the left, Dataset 2 on the right.

Based on Figure 13, the FC Autoencoder Model A demonstrates steady convergence in its training and validation loss plots for both Dataset 1 and Dataset 2. This smooth progression suggests effective learning without erratic fluctuations, which is favourable. However, the early stopping could have been triggered earlier. This might indicate that the early stopping criteria, the patience parameter, which is 20, in this case, were set too early, which allowed the model to train longer than necessary without significant performance gains.

Refer to caption
Figure 14: FC Autoencoder (Model B) training and validation loss plots. Dataset 1 on the left, Dataset 2 on the right.

In Figure 14, the FC Autoencoder Model B shows a less consistent convergence than Model A across both datasets. Notable fluctuations in loss are observed, with early stopping triggered around the 120th epoch for Dataset 1 and the 100th epoch for Dataset 2. Additionally, a significant gap between training and validation losses in Dataset 1 suggests potential overfitting, contrasting with the performance observed in Model B.

Table 16 represents the obtained performance metrics of Models A and B on the test data of Dataset 1 and Dataset 2. Upon analysing the performance metrics from Table 16 and confusion matrices in Figures 15 and 16, several insights emerge regarding the efficacy of Models A and B across Datasets 1 and 2. For Dataset 1, both models show relatively similar performance metrics. However, a key distinction arises in the recall, where Model B has a noticeable advantage with 71.4% while Model A’s recall is 68%. On the contrary, the MCC value of Model A is slightly higher than Model B, which may be due to its use of the Cyclical Learning Rate scheduler, enabling more refined learning rate adjustments. For Dataset 2, although both models showcase very close precision, Model B compensates again with a better recall, offering 60.7% against Model A’s 55.5%. Model B also marginally outperforms Model A in accuracy and MCC.

Table 16: Model A and B performance metrics on both test data of Dataset 1 and Dataset 2.
Model Accuracy Precision Recall F1-score MCC
Model A (Dataset 1) 0.712 0.797 0.68 0.733 0.432
Model A (Dataset 2) 0.678 0.838 0.555 0.667 0.41
Model B (Dataset 1) 0.711 0.773 0.714 0.742 0.417
Model B (Dataset 2) 0.695 0.823 0.607 0.698 0.423
Refer to caption
Figure 15: Confusion matrices of FC Autoencoder (Model A) on the test data. Dataset 1 on the left, Dataset 2 on the right.
Refer to caption
Figure 16: Confusion matrices of FC Autoencoder (Model B) on the test data. Dataset 1 on the left, Dataset 2 on the right.
Refer to caption
Figure 17: ROC curve plots of FC Autoencoder (Model A). Dataset 1 on the left, Dataset 2 on the right.
Refer to caption
Figure 18: ROC curve plots of FC Autoencoder (Model B). Dataset 1 on the left, Dataset 2 on the right.

Figures 17 and 18 display the ROC curve plots for Models A and B across both Dataset 1 and Dataset 2 (using test data). All these plots fall within the 77% to 78% range, making it difficult to draw meaningful comparisons between the models and datasets. Nonetheless, the near-80% performance indicates that the models consistently show quite promising results across both datasets.

Analysing the influence of datasets on model performance reveals some significant insights. Both models demonstrate strong performance metrics even when applied to Dataset 2, which has been reduced to 17 variables. There is only a slight decrease in metrics when the models transition from Dataset 1 to Dataset 2. An interesting observation is that Model B’s accuracy on Dataset 2 drops slightly from 71.1% to 69.5%, yet its AUC remains consistent at 78%. This consistency is particularly impressive given that the models’ hyperparameters were specifically tuned for Dataset 1. Naturally, one might expect better results on Dataset 1 because of this tuning. However, the minor decline in performance on Dataset 2 suggests that this compact dataset effectively integrates the most influential variables.

In a direct comparison of results, the standout models still come from Dataset 1. Model A outperforms precision and MCC, whereas Model B is better in recall and F1-score. Given the importance of accurately detecting wildfires, a focus on recall becomes crucial. Hence, in such a scenario, Model B, with its superior recall, may be the more advantageous choice.

5 Discussion

In this research, two different unsupervised learning approaches to detect wildfires in Australia with deep autoencoders have been implemented successfully.

The first approach was anomaly detection based on clustering through latent features using deep autoencoders. Here, firstly an FC autoencoder is trained and applied to reduce the data dimension from 28 to 8. Then, this latent data is fed into three different clustering methods (isolation forest, LOF, and one-class SVM) to spot outliers. This approach presented several challenges. First, the performance metrics of all three methods were closely dependent on specific parameters, especially the contamination and Nu values. For instance, the recall value for the isolation forest model dropped sharply from 67.5% to 33.7% when contamination was decreased from 0.5 tom 0.2. Adjusting contamination and Nu values, while helpful, may only serve as a specific short-term solution. In addition, considering that these techniques are implemented in an “unsupervised learning” way, it is important to remember that a real-life scenario will not have ground truth labelled (wildfire or non-wildfire) data available for parameter adjustment. Another challenge related to how the anomaly data spread out. Thanks to 3D map visualisations done using PCA components, it was observed that anomaly data outliers did not cluster separately from the usual normal data points. Consequently, due to these challenges and results, clustering techniques alone were not sufficient, robust, or efficient in distinguishing anomalies in this wildfire dataset. This realisation triggered the exploration of alternative strategies, guiding us to the second approach: anomaly detection using deep autoencoders, primarily based on reconstruction error.

The second approach involved using deep autoencoders to detect anomalies by using the reconstruction error. The approach involved training the deep autoencoder on normal data points and then using this trained model on the reconstruction of test data. Anomalies have higher reconstruction errors while normal data points have low errors. Two different deep autoencoder type were implemented: LSTM and FC.

In the LSTM autoencoder case, a final model architecture was determined after hyperparameter tuning and was applied to both Dataset 1 and Dataset 2. The model trained on Dataset 1 showcased better performance metrics with an accuracy of 56.3%, whereas the model trained on Dataset 2 managed only 45.7%. A consistent issue with both models was their tendency to misclassify non-wildfire cases, often labelling them as wildfires. Ultimately, the performance of both models is not sufficient, as their AUC values close to 0.5 suggest nearly random predictions. Our initial assumption was that utilizing LSTM layers would be advantageous given the time series nature of the data, hoping to capture temporal information. However, this assumption did not hold true in practice. Potential reasons include the short sequence span of 10 days, which might not provide enough temporal context. Additionally, the daily aggregation of the data could have lost valuable temporal patterns that LSTM layers might have captured. Lastly, one can note that low values of accuracy are expected given the challenging nature of the data and the lack of historical ground truth. Eventually, the goal of this study is not to filter out methods with poorer performance but to provide a comprehensive comparison.

The final modelling phase employed an FC autoencoder. Two promising models obtained from hyperparameter tuning were utilized on both Dataset 1 and Dataset 2. The first model leveraged the Adam optimizer with a learning rate scheduler, while the second relied on RMSProp without any scheduler. Despite the compact 17-feature dimension of Dataset 2, the models maintained strong metrics on this dataset as well. However, Dataset 1’s models did perform better eventually. Most impressively, the top model - using RMSProp without any scheduler - recorded an accuracy of 71.1%, an F1-score of 74.2%, and an MCC of 0.417. These figures are significantly higher than what is achieved with earlier methods, highlighting the effectiveness of this approach and the use of FC layers in deep autoencoders.

6 Conclusion

In environmental science and disaster prevention, detection, and prediction of wildfires in time, especially in vulnerable regions such as Australia, remains a crucial concern. This research successfully employed several unsupervised learning techniques, in the context of anomaly detection with deep autoencoders, to predict wildfires in Australia. This approach fills a gap in the field of wildfire prediction.

Our research explored two primary methods. The first method focused on anomaly detection using latent features extracted from a deep autoencoder and demonstrated initial promising results. Utilizing the FC autoencoder enabled significant dimensionality reduction, and when paired with clustering techniques like isolation forest, LOF, and one-class SVM, yielded some encouraging results. However, while these methods showed potential, they were somewhat sensitive to specific parameters, which could affect their reliability in real-life scenarios without labels. Furthermore, the fact that anomaly data is not scattered as a different cluster, challenged these techniques even more.

The second method, using anomaly detection through reconstruction error, presented a clearer potential. The LSTM autoencoder, despite its theoretical applicability to time-series data, failed to yield the expected outcomes. Yet, the FC autoencoder showcased impressive results. The final FC autoencoder outperformed its counterparts, achieving an accuracy of 71.1%, an F1-score of 74.2%, and an MCC of 0.417 on Dataset 1.

This work proposed wildfire prediction by employing unsupervised learning approaches without the need for ground truth data of historical wildfires. Our findings demonstrate the feasibility of these methodologies in complex real-life scenarios linked to wildfire detection and filling a significant gap in the literature. The use of deep autoencoders for wildfire prediction shows the effectiveness of the method. Given the critical role of forests in the global ecosystem and the pressing need for efficient wildfire detection, this study aims to present a meaningful and practical solution in wildfire prediction, reinforcing the critical bridge between nature’s preservation and technological innovation using AI.

Several recommendations could be given for future work for researchers working in this area. To begin, while the LSTM autoencoder used in this research was limited to 10-day sequences due to computational limitations, longer sequences, such as weekly or monthly intervals, may improve its performance. Second, given the dataset’s temporal nature, Bi-LSTM autoencoders, which can capture bidirectional temporal information, might be explored as an alternative to the traditional LSTM. Furthermore, there is an opportunity for more extensive hyperparameter tuning to discover possible alternative optimum model configurations. Modification of the decision rule can provide a probability of wildfire instead of a detection outcome. Finally, given the encouraging improvements in the autoencoder area, experimenting with Variational Autoencoders (VAEs) might be a promising next step. These proposals can help to refine and improve the predictive ability of the models in the wildfire prediction domain using unsupervised ML and extend the applicability of the wildfire prediction methods to a global scale.

Open Research Section

The main data source is the dataset released by IBM for the Call for Code Spot Challenge for Wildfires competition [IBM (\APACyear2020)]. It is available at https://github.com/Call-for-Code/Spot-Challenge-Wildfires

The specific elements from the previous dataset used in this article have been processed from the following original sources: ERA5-Land hourly data from 1950 to present dataset [Muñoz Sabater (\APACyear2019)] available at the Copernicus Climate Change Service (C3S) Climate Data Store (CDS) https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-land ; MOD13Q1 MODIS/Terra Vegetation Indices 16-Day L3 Global 250m SIN Grid V006 [Didan (\APACyear2015)] available at NASA EOSDIS Land Processes Distributed Active Archive Center https://lpdaac.usgs.gov/products/mod13q1v006/ ; and [\APACcitebtitleMODIS/Aqua+Terra Thermal Anomalies/Fire locations 1km FIRMS V0061 NRT (\APACyear2020)] available at NASA FIRMS https://www.earthdata.nasa.gov/learn/find-data/near-real-time/firms/mcd14dl-nrt

References

  • Abadi \BOthers. (\APACyear2016) \APACinsertmetastarabadi_tensorflow:_2016{APACrefauthors}Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J.\BDBLZheng, X.  \APACrefYearMonthDay2016\APACmonth05. \APACrefbtitleTensorFlow: A system for large-scale machine learning. TensorFlow: A system for large-scale machine learning. \APACaddressPublisherarXiv. \APACrefnotearXiv:1605.08695 [cs] \PrintBackRefs\CurrentBib
  • Abid (\APACyear2021) \APACinsertmetastarabid2021survey{APACrefauthors}Abid, F.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleA survey of machine learning algorithms based forest fires prediction and detection systems A survey of machine learning algorithms based forest fires prediction and detection systems.\BBCQ \APACjournalVolNumPagesFire technology572559–590. \PrintBackRefs\CurrentBib
  • Aggarwal \BBA Reddy (\APACyear2013) \APACinsertmetastaraggarwal2013data{APACrefauthors}Aggarwal, C\BPBIC.\BCBT \BBA Reddy, C\BPBIK.  \APACrefYearMonthDay2013. \BBOQ\APACrefatitleData Clustering: Algorithms and Applications, ser Data clustering: Algorithms and applications, ser.\BBCQ \APACjournalVolNumPagesData Mining and Knowledge Discovery Series. CRC Press/Taylor & Francis Group. \PrintBackRefs\CurrentBib
  • Arif \BOthers. (\APACyear2021) \APACinsertmetastararif2021role{APACrefauthors}Arif, M., Alghamdi, K., Sahel, S., Alosaimi, S., Alsahaft, M., Alharthi, M.\BCBL \BBA Arif, M.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleRole of machine learning algorithms in forest fire management: A literature review Role of machine learning algorithms in forest fire management: A literature review.\BBCQ \APACjournalVolNumPagesJ. Robot. Autom5212–226. \PrintBackRefs\CurrentBib
  • Arpaci \BOthers. (\APACyear2014) \APACinsertmetastararpaci2014using{APACrefauthors}Arpaci, A., Malowerschnig, B., Sass, O.\BCBL \BBA Vacik, H.  \APACrefYearMonthDay2014. \BBOQ\APACrefatitleUsing multi variate data mining techniques for estimating fire susceptibility of Tyrolean forests Using multi variate data mining techniques for estimating fire susceptibility of tyrolean forests.\BBCQ \APACjournalVolNumPagesApplied Geography53258–270. \PrintBackRefs\CurrentBib
  • Bank \BOthers. (\APACyear2020) \APACinsertmetastarbank2020autoencoders{APACrefauthors}Bank, D., Koenigstein, N.\BCBL \BBA Giryes, R.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleAutoencoders. arXiv Autoencoders. arxiv.\BBCQ \APACjournalVolNumPagesarXiv preprint arXiv:2003.05991. \PrintBackRefs\CurrentBib
  • Borchers Arriagada \BOthers. (\APACyear2020) \APACinsertmetastarborchers2020unprecedented{APACrefauthors}Borchers Arriagada, N., Palmer, A\BPBIJ., Bowman, D\BPBIM., Morgan, G\BPBIG., Jalaludin, B\BPBIB.\BCBL \BBA Johnston, F\BPBIH.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleUnprecedented smoke-related health burden associated with the 2019–20 bushfires in eastern Australia Unprecedented smoke-related health burden associated with the 2019–20 bushfires in eastern australia.\BBCQ \APACjournalVolNumPagesMedical Journal of Australia2136282–283. \PrintBackRefs\CurrentBib
  • Breunig \BOthers. (\APACyear2000) \APACinsertmetastarbreunig2000lof{APACrefauthors}Breunig, M\BPBIM., Kriegel, H\BHBIP., Ng, R\BPBIT.\BCBL \BBA Sander, J.  \APACrefYearMonthDay2000. \BBOQ\APACrefatitleLOF: identifying density-based local outliers Lof: identifying density-based local outliers.\BBCQ \BIn \APACrefbtitleProceedings of the 2000 ACM SIGMOD international conference on Management of data Proceedings of the 2000 acm sigmod international conference on management of data (\BPGS 93–104). \PrintBackRefs\CurrentBib
  • Canada (\APACyear\bibnodate) \APACinsertmetastarCanada{APACrefauthors}Canada, N\BPBIR.  \APACrefYearMonthDay\bibnodate. \APACrefbtitleCanadian wildland fire information system. Canadian wildland fire information system. {APACrefURL} https://cwfis.cfs.nrcan.gc.ca/home \PrintBackRefs\CurrentBib
  • Chang \BOthers. (\APACyear2020) \APACinsertmetastarchang2020landslide{APACrefauthors}Chang, Z., Du, Z., Zhang, F., Huang, F., Chen, J., Li, W.\BCBL \BBA Guo, Z.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleLandslide susceptibility prediction based on remote sensing images and GIS: Comparisons of supervised and unsupervised machine learning models Landslide susceptibility prediction based on remote sensing images and gis: Comparisons of supervised and unsupervised machine learning models.\BBCQ \APACjournalVolNumPagesRemote Sensing123502. \PrintBackRefs\CurrentBib
  • D. Cheng \BOthers. (\APACyear2022) \APACinsertmetastarcheng2022resnet{APACrefauthors}Cheng, D., Fan, Y., Fang, S., Wang, M.\BCBL \BBA Liu, H.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleResNet-AE for Radar Signal Anomaly Detection Resnet-ae for radar signal anomaly detection.\BBCQ \APACjournalVolNumPagesSensors22166249. \PrintBackRefs\CurrentBib
  • Z. Cheng \BOthers. (\APACyear2021) \APACinsertmetastarcheng2021improved{APACrefauthors}Cheng, Z., Wang, S., Zhang, P., Wang, S., Liu, X.\BCBL \BBA Zhu, E.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleImproved autoencoder for unsupervised anomaly detection Improved autoencoder for unsupervised anomaly detection.\BBCQ \APACjournalVolNumPagesInternational Journal of Intelligent Systems36127103–7125. \PrintBackRefs\CurrentBib
  • Coca \BBA Datcu (\APACyear2021) \APACinsertmetastarcoca2021anomaly{APACrefauthors}Coca, M.\BCBT \BBA Datcu, M.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleAnomaly detection in post fire assessment Anomaly detection in post fire assessment.\BBCQ \BIn \APACrefbtitle2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS 2021 ieee international geoscience and remote sensing symposium igarss (\BPGS 8620–8623). \PrintBackRefs\CurrentBib
  • Demir (\APACyear2022) \APACinsertmetastardemir_deep_2022{APACrefauthors}Demir, F.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleDeep autoencoder-based automated brain tumor detection from MRI data Deep autoencoder-based automated brain tumor detection from MRI data.\BBCQ \BIn \APACrefbtitleArtificial Intelligence-Based Brain-Computer Interface Artificial Intelligence-Based Brain-Computer Interface (\BPGS 317–351). \APACaddressPublisherElsevier. \PrintBackRefs\CurrentBib
  • Didan (\APACyear2015) \APACinsertmetastarndvi{APACrefauthors}Didan, K.  \APACrefYearMonthDay2015. \APACrefbtitleMOD13Q1 MODIS/Terra Vegetation Indices 16-Day L3 Global 250m SIN Grid V006 Mod13q1 modis/terra vegetation indices 16-day l3 global 250m sin grid v006 [dataset]. \APACaddressPublisherNASA EOSDIS Land Processes Distributed Active Archive Center. {APACrefURL} https://lpdaac.usgs.gov/products/mod13q1v006/ {APACrefDOI} 10.5067/MODIS/MOD13Q1.006 \PrintBackRefs\CurrentBib
  • Esmaeili \BOthers. (\APACyear2023) \APACinsertmetastaresmaeili2023anomaly{APACrefauthors}Esmaeili, F., Cassie, E., Nguyen, H\BPBIP\BPBIT., Plank, N\BPBIO., Unsworth, C\BPBIP.\BCBL \BBA Wang, A.  \APACrefYearMonthDay2023. \BBOQ\APACrefatitleAnomaly Detection for Sensor Signals Utilizing Deep Learning Autoencoder-Based Neural Networks Anomaly detection for sensor signals utilizing deep learning autoencoder-based neural networks.\BBCQ \APACjournalVolNumPagesBioengineering104405. \PrintBackRefs\CurrentBib
  • Gholamnia \BOthers. (\APACyear2020) \APACinsertmetastargholamnia2020comparisons{APACrefauthors}Gholamnia, K., Gudiyangada Nachappa, T., Ghorbanzadeh, O.\BCBL \BBA Blaschke, T.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleComparisons of diverse machine learning approaches for wildfire susceptibility mapping Comparisons of diverse machine learning approaches for wildfire susceptibility mapping.\BBCQ \APACjournalVolNumPagesSymmetry124604. \PrintBackRefs\CurrentBib
  • Haar \BOthers. (\APACyear2019) \APACinsertmetastarhaar2019comparison{APACrefauthors}Haar, L., Anding, K., Trambitckii, K.\BCBL \BBA Notni, G.  \APACrefYearMonthDay2019. \BBOQ\APACrefatitleComparison between Supervised and Unsupervised Feature Selection Methods. Comparison between supervised and unsupervised feature selection methods.\BBCQ \BIn \APACrefbtitleICPRAM Icpram (\BPGS 582–589). \PrintBackRefs\CurrentBib
  • Hasan \BOthers. (\APACyear2016) \APACinsertmetastarhasan2016learning{APACrefauthors}Hasan, M., Choi, J., Neumann, J., Roy-Chowdhury, A\BPBIK.\BCBL \BBA Davis, L\BPBIS.  \APACrefYearMonthDay2016. \BBOQ\APACrefatitleLearning temporal regularity in video sequences Learning temporal regularity in video sequences.\BBCQ \BIn \APACrefbtitleProceedings of the IEEE conference on computer vision and pattern recognition Proceedings of the ieee conference on computer vision and pattern recognition (\BPGS 733–742). \PrintBackRefs\CurrentBib
  • IBM (\APACyear2020) \APACinsertmetastarIBM{APACrefauthors}IBM.  \APACrefYearMonthDay2020. \APACrefbtitleCall for Code Spot Challenge for Wildfires competition dataset Call for code spot challenge for wildfires competition dataset [dataset]. \APACaddressPublisherhttps://github.com/Call-for-Code/Spot-Challenge-Wildfires. \PrintBackRefs\CurrentBib
  • Jain \BOthers. (\APACyear2020) \APACinsertmetastarjain2020review{APACrefauthors}Jain, P., Coogan, S\BPBIC., Subramanian, S\BPBIG., Crowley, M., Taylor, S.\BCBL \BBA Flannigan, M\BPBID.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleA review of machine learning applications in wildfire science and management A review of machine learning applications in wildfire science and management.\BBCQ \APACjournalVolNumPagesEnvironmental Reviews284478–505. \PrintBackRefs\CurrentBib
  • Kondylatos \BOthers. (\APACyear2022) \APACinsertmetastarkondylatos2022wildfire{APACrefauthors}Kondylatos, S., Prapas, I., Ronco, M., Papoutsis, I., Camps-Valls, G., Piles, M.\BDBLCarvalhais, N.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleWildfire danger prediction and understanding with Deep Learning Wildfire danger prediction and understanding with deep learning.\BBCQ \APACjournalVolNumPagesGeophysical Research Letters4917e2022GL099368. \PrintBackRefs\CurrentBib
  • Kucharski \BOthers. (\APACyear2020) \APACinsertmetastarkucharski_semi-supervised_2020{APACrefauthors}Kucharski, D., Kleczek, P., Jaworek-Korjakowska, J., Dyduch, G.\BCBL \BBA Gorgon, M.  \APACrefYearMonthDay2020\APACmonth03. \BBOQ\APACrefatitleSemi-supervised nests of melanocytes segmentation method using convolutional autoencoders Semi-supervised nests of melanocytes segmentation method using convolutional autoencoders.\BBCQ \APACjournalVolNumPagesSensors2061546. \PrintBackRefs\CurrentBib
  • Lehr \BOthers. (\APACyear2021) \APACinsertmetastarlehr2021supervised{APACrefauthors}Lehr, J., Philipps, J., Hoang, V\BPBIN., von Wrangel, D.\BCBL \BBA Krüger, J.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleSupervised learning vs. unsupervised learning: A comparison for optical inspection applications in quality control Supervised learning vs. unsupervised learning: A comparison for optical inspection applications in quality control.\BBCQ \BIn \APACrefbtitleIOP Conference Series: Materials Science and Engineering Iop conference series: Materials science and engineering (\BVOL 1140, \BPG 012049). \PrintBackRefs\CurrentBib
  • Li \BOthers. (\APACyear2021) \APACinsertmetastarli2021deep{APACrefauthors}Li, T., Wang, Z., Liu, S.\BCBL \BBA Lin, W\BHBIY.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleDeep unsupervised anomaly detection Deep unsupervised anomaly detection.\BBCQ \BIn \APACrefbtitleProceedings of the IEEE/CVF winter conference on applications of computer vision Proceedings of the ieee/cvf winter conference on applications of computer vision (\BPGS 3636–3645). \PrintBackRefs\CurrentBib
  • Liu \BOthers. (\APACyear2012) \APACinsertmetastarliu2012isolation{APACrefauthors}Liu, F\BPBIT., Ting, K\BPBIM.\BCBL \BBA Zhou, Z\BHBIH.  \APACrefYearMonthDay2012. \BBOQ\APACrefatitleIsolation-based anomaly detection Isolation-based anomaly detection.\BBCQ \APACjournalVolNumPagesACM Transactions on Knowledge Discovery from Data (TKDD)611–39. \PrintBackRefs\CurrentBib
  • Ma (\APACyear2022) \APACinsertmetastarma2022achieving{APACrefauthors}Ma, H.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleAchieving deep clustering through the use of variational autoencoders and similarity-based loss Achieving deep clustering through the use of variational autoencoders and similarity-based loss.\BBCQ \APACjournalVolNumPagesMathematical Biosciences and Engineering191010344–10360. \PrintBackRefs\CurrentBib
  • Malik \BOthers. (\APACyear2021) \APACinsertmetastarmalik2021wildfire{APACrefauthors}Malik, A., Jalin, N., Rani, S., Singhal, P., Jain, S.\BCBL \BBA Gao, J.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleWildfire Risk Prediction and Detection using Machine Learning in San Diego, California Wildfire risk prediction and detection using machine learning in san diego, california.\BBCQ \BIn \APACrefbtitle2021 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/IOP/SCI) 2021 ieee smartworld, ubiquitous intelligence & computing, advanced & trusted computing, scalable computing & communications, internet of people and smart city innovation (smartworld/scalcom/uic/atc/iop/sci) (\BPGS 622–629). \PrintBackRefs\CurrentBib
  • Masci \BOthers. (\APACyear2011) \APACinsertmetastarmasci2011stacked{APACrefauthors}Masci, J., Meier, U., Cireşan, D.\BCBL \BBA Schmidhuber, J.  \APACrefYearMonthDay2011. \BBOQ\APACrefatitleStacked convolutional auto-encoders for hierarchical feature extraction Stacked convolutional auto-encoders for hierarchical feature extraction.\BBCQ \BIn \APACrefbtitleArtificial Neural Networks and Machine Learning–ICANN 2011: 21st International Conference on Artificial Neural Networks, Espoo, Finland, June 14-17, 2011, Proceedings, Part I 21 Artificial neural networks and machine learning–icann 2011: 21st international conference on artificial neural networks, espoo, finland, june 14-17, 2011, proceedings, part i 21 (\BPGS 52–59). \PrintBackRefs\CurrentBib
  • Maxion \BBA Roberts (\APACyear2004) \APACinsertmetastarmaxion2004proper{APACrefauthors}Maxion, R\BPBIA.\BCBT \BBA Roberts, R\BPBIR.  \APACrefYearMonthDay2004. \BBOQ\APACrefatitleProper use of ROC curves in Intrusion/Anomaly Detection Proper use of roc curves in intrusion/anomaly detection.\BBCQ \APACjournalVolNumPagesSchool of Computing Science Technical Report Series. \PrintBackRefs\CurrentBib
  • Meddour-Sahar (\APACyear2015) \APACinsertmetastarmeddour2015wildfires{APACrefauthors}Meddour-Sahar, O.  \APACrefYearMonthDay2015. \BBOQ\APACrefatitleWildfires in Algeria: problems and challenges Wildfires in algeria: problems and challenges.\BBCQ \APACjournalVolNumPagesiForest-Biogeosciences and Forestry86818. \PrintBackRefs\CurrentBib
  • Melo (\APACyear2013) \APACinsertmetastarmelo2013area{APACrefauthors}Melo, F.  \APACrefYearMonthDay2013. \BBOQ\APACrefatitleArea under the ROC Curve Area under the roc curve.\BBCQ \APACjournalVolNumPagesEncyclopedia of systems biology2013. \PrintBackRefs\CurrentBib
  • \APACcitebtitleMODIS/Aqua+Terra Thermal Anomalies/Fire locations 1km FIRMS V0061 NRT (\APACyear2020) \APACinsertmetastarmcdl\APACrefbtitleMODIS/Aqua+Terra Thermal Anomalies/Fire locations 1km FIRMS V0061 NRT Modis/aqua+terra thermal anomalies/fire locations 1km firms v0061 nrt [dataset]. \APACrefYearMonthDay2020. \APACaddressPublisherNASA FIRMS. {APACrefURL} https://www.earthdata.nasa.gov/learn/find-data/near-real-time/firms/mcd14dl-nrt {APACrefDOI} 10.5067/FIRMS/MODIS/MCD14DL.NRT.0061 \PrintBackRefs\CurrentBib
  • Muñoz Sabater (\APACyear2019) \APACinsertmetastarera5land{APACrefauthors}Muñoz Sabater, J.  \APACrefYearMonthDay2019. \APACrefbtitleERA5-Land hourly data from 1950 to present Era5-land hourly data from 1950 to present [dataset]. \APACaddressPublisherCopernicus Climate Change Service (C3S) Climate Data Store (CDS). {APACrefURL} https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-land {APACrefDOI} 10.24381/cds.e2161bac \PrintBackRefs\CurrentBib
  • Nhangumbe \BOthers. (\APACyear2023) \APACinsertmetastarnhangumbe2023supervised{APACrefauthors}Nhangumbe, M., Nascetti, A., Georganos, S.\BCBL \BBA Ban, Y.  \APACrefYearMonthDay2023. \BBOQ\APACrefatitleSupervised and unsupervised machine learning approaches using Sentinel data for flood mapping and damage assessment in Mozambique Supervised and unsupervised machine learning approaches using sentinel data for flood mapping and damage assessment in mozambique.\BBCQ \APACjournalVolNumPagesRemote Sensing Applications: Society and Environment32101015. \PrintBackRefs\CurrentBib
  • Norman \BOthers. (\APACyear2021) \APACinsertmetastarnorman2021apocalypse{APACrefauthors}Norman, B., Newman, P.\BCBL \BBA Steffen, W.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleApocalypse now: Australian bushfires and the future of urban settlements Apocalypse now: Australian bushfires and the future of urban settlements.\BBCQ \APACjournalVolNumPagesnpj Urban Sustainability112. \PrintBackRefs\CurrentBib
  • Ogie \BOthers. (\APACyear2022) \APACinsertmetastarogie2022twitter{APACrefauthors}Ogie, R., Moore, A., Wickramasuriya, R., Amirghasemi, M., James, S.\BCBL \BBA Dilworth, T.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleTwitter data from the 2019–20 Australian bushfires reveals participatory and temporal variations in social media use for disaster recovery Twitter data from the 2019–20 australian bushfires reveals participatory and temporal variations in social media use for disaster recovery.\BBCQ \APACjournalVolNumPagesScientific reports12116914. \PrintBackRefs\CurrentBib
  • Pang \BOthers. (\APACyear2022) \APACinsertmetastarpang2022forest{APACrefauthors}Pang, Y., Li, Y., Feng, Z., Feng, Z., Zhao, Z., Chen, S.\BCBL \BBA Zhang, H.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleForest fire occurrence prediction in China based on machine learning methods Forest fire occurrence prediction in china based on machine learning methods.\BBCQ \APACjournalVolNumPagesRemote Sensing14215546. \PrintBackRefs\CurrentBib
  • Pedregosa \BOthers. (\APACyear2018) \APACinsertmetastarpedregosa_scikit-learn:_2018{APACrefauthors}Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O.\BDBLDuchesnay, E.  \APACrefYearMonthDay2018\APACmonth06. \APACrefbtitleScikit-learn: machine learning in python. Scikit-learn: machine learning in python. \APACaddressPublisherarXiv. \APACrefnotearXiv:1201.0490 [cs] \PrintBackRefs\CurrentBib
  • Pérez-Porras \BOthers. (\APACyear2021) \APACinsertmetastarperez2021machine{APACrefauthors}Pérez-Porras, F\BHBIJ., Triviño-Tarradas, P., Cima-Rodríguez, C., Meroño-de Larriva, J\BHBIE., García-Ferrer, A.\BCBL \BBA Mesas-Carrascosa, F\BHBIJ.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleMachine learning methods and synthetic data generation to predict large wildfires Machine learning methods and synthetic data generation to predict large wildfires.\BBCQ \APACjournalVolNumPagesSensors21113694. \PrintBackRefs\CurrentBib
  • Pradhan \BOthers. (\APACyear2021) \APACinsertmetastarpradhan2021computational{APACrefauthors}Pradhan, P., Meyer, T., Vieth, M., Stallmach, A., Waldner, M., Schmitt, M.\BDBLBocklitz, T.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleComputational tissue staining of non-linear multimodal imaging using supervised and unsupervised deep learning Computational tissue staining of non-linear multimodal imaging using supervised and unsupervised deep learning.\BBCQ \APACjournalVolNumPagesBiomedical Optics Express1242280–2298. \PrintBackRefs\CurrentBib
  • Qiang \BOthers. (\APACyear2011) \APACinsertmetastarqiang{APACrefauthors}Qiang, Z., Ze Meka, E., C. Anderson, R.\BCBL \BBA Kakabadse, Y.  \APACrefYearMonthDay2011. \APACrefbtitleOur planet: Forests nature at your service. Our planet: Forests nature at your service. {APACrefURL} https://www.unep.org/resources/report/our-planet-forests-nature-your-service \PrintBackRefs\CurrentBib
  • Sakr \BOthers. (\APACyear2011) \APACinsertmetastarsakr_efficient_2011{APACrefauthors}Sakr, G\BPBIE., Elhajj, I\BPBIH.\BCBL \BBA Mitri, G.  \APACrefYearMonthDay2011\APACmonth08. \BBOQ\APACrefatitleEfficient forest fire occurrence prediction for developing countries using two weather parameters Efficient forest fire occurrence prediction for developing countries using two weather parameters.\BBCQ \APACjournalVolNumPagesEngineering Applications of Artificial Intelligence245888–894. \PrintBackRefs\CurrentBib
  • Salehi \BOthers. (\APACyear2016) \APACinsertmetastarsalehi_dynamic_2016{APACrefauthors}Salehi, M., Rusu, L\BPBII., Lynar, T.\BCBL \BBA Phan, A.  \APACrefYearMonthDay2016\APACmonth08. \BBOQ\APACrefatitleDynamic and robust wildfire risk prediction system: an unsupervised approach Dynamic and robust wildfire risk prediction system: an unsupervised approach.\BBCQ \BIn \APACrefbtitleProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (\BPGS 245–254). \APACaddressPublisherSan Francisco California USAACM. \PrintBackRefs\CurrentBib
  • Sanderfoot \BOthers. (\APACyear2022) \APACinsertmetastarsanderfoot2022review{APACrefauthors}Sanderfoot, O., Bassing, S., Brusa, J., Emmet, R., Gillman, S., Swift, K.\BCBL \BBA Gardner, B.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleA review of the effects of wildfire smoke on the health and behavior of wildlife A review of the effects of wildfire smoke on the health and behavior of wildlife.\BBCQ \APACjournalVolNumPagesEnvironmental Research Letters1612123003. \PrintBackRefs\CurrentBib
  • Saxena \BOthers. (\APACyear2017) \APACinsertmetastarsaxena2017review{APACrefauthors}Saxena, A., Prasad, M., Gupta, A., Bharill, N., Patel, O\BPBIP., Tiwari, A.\BDBLLin, C\BHBIT.  \APACrefYearMonthDay2017. \BBOQ\APACrefatitleA review of clustering techniques and developments A review of clustering techniques and developments.\BBCQ \APACjournalVolNumPagesNeurocomputing267664–681. \PrintBackRefs\CurrentBib
  • Sayad \BOthers. (\APACyear2019) \APACinsertmetastarsayad2019predictive{APACrefauthors}Sayad, Y\BPBIO., Mousannif, H.\BCBL \BBA Al Moatassime, H.  \APACrefYearMonthDay2019. \BBOQ\APACrefatitlePredictive modeling of wildfires: A new dataset and machine learning approach Predictive modeling of wildfires: A new dataset and machine learning approach.\BBCQ \APACjournalVolNumPagesFire safety journal104130–146. \PrintBackRefs\CurrentBib
  • Schölkopf \BOthers. (\APACyear1999) \APACinsertmetastarscholkopf1999support{APACrefauthors}Schölkopf, B., Williamson, R\BPBIC., Smola, A., Shawe-Taylor, J.\BCBL \BBA Platt, J.  \APACrefYearMonthDay1999. \BBOQ\APACrefatitleSupport vector method for novelty detection Support vector method for novelty detection.\BBCQ \APACjournalVolNumPagesAdvances in neural information processing systems12. \PrintBackRefs\CurrentBib
  • Serra \BBA Tagliaferri (\APACyear2019) \APACinsertmetastarserra2019unsupervised{APACrefauthors}Serra, A.\BCBT \BBA Tagliaferri, R.  \APACrefYearMonthDay2019. \APACrefbtitleUnsupervised Learning: Clustering. Encyclopedia of Bioinformatics and Computational Biology. S. Ranganathan, M. Gribskov, K. Nakai and C. Schönbach. Unsupervised learning: Clustering. encyclopedia of bioinformatics and computational biology. s. ranganathan, m. gribskov, k. nakai and c. schönbach. \APACaddressPublisherOxford, Academic Press. \PrintBackRefs\CurrentBib
  • Shin \BOthers. (\APACyear2005) \APACinsertmetastarshin2005one{APACrefauthors}Shin, H\BPBIJ., Eom, D\BHBIH.\BCBL \BBA Kim, S\BHBIS.  \APACrefYearMonthDay2005. \BBOQ\APACrefatitleOne-class support vector machines—an application in machine fault detection and classification One-class support vector machines—an application in machine fault detection and classification.\BBCQ \APACjournalVolNumPagesComputers & Industrial Engineering482395–408. \PrintBackRefs\CurrentBib
  • Shmuel \BBA Heifetz (\APACyear2022) \APACinsertmetastarshmuel2022global{APACrefauthors}Shmuel, A.\BCBT \BBA Heifetz, E.  \APACrefYearMonthDay2022. \BBOQ\APACrefatitleGlobal wildfire susceptibility mapping based on machine learning models Global wildfire susceptibility mapping based on machine learning models.\BBCQ \APACjournalVolNumPagesForests1371050. \PrintBackRefs\CurrentBib
  • Sulova \BBA Jokar Arsanjani (\APACyear2020) \APACinsertmetastarsulova2020exploratory{APACrefauthors}Sulova, A.\BCBT \BBA Jokar Arsanjani, J.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleExploratory analysis of driving force of wildfires in Australia: An application of machine learning within Google Earth engine Exploratory analysis of driving force of wildfires in australia: An application of machine learning within google earth engine.\BBCQ \APACjournalVolNumPagesRemote Sensing13110. \PrintBackRefs\CurrentBib
  • Sutera (\APACyear2021) \APACinsertmetastarsutera_importance_2021{APACrefauthors}Sutera, A.  \APACrefYearMonthDay2021\APACmonth06. \APACrefbtitleImportance measures derived from random forests: characterisation and extension. Importance measures derived from random forests: characterisation and extension. \APACaddressPublisherarXiv. \APACrefnotearXiv:2106.09473 [cs, stat] \PrintBackRefs\CurrentBib
  • Thudumu \BOthers. (\APACyear2020) \APACinsertmetastarthudumu2020comprehensive{APACrefauthors}Thudumu, S., Branch, P., Jin, J.\BCBL \BBA Singh, J.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleA comprehensive survey of anomaly detection techniques for high dimensional big data A comprehensive survey of anomaly detection techniques for high dimensional big data.\BBCQ \APACjournalVolNumPagesJournal of Big Data71–30. \PrintBackRefs\CurrentBib
  • Tian \BOthers. (\APACyear2014) \APACinsertmetastartian2014learning{APACrefauthors}Tian, F., Gao, B., Cui, Q., Chen, E.\BCBL \BBA Liu, T\BHBIY.  \APACrefYearMonthDay2014. \BBOQ\APACrefatitleLearning deep representations for graph clustering Learning deep representations for graph clustering.\BBCQ \BIn \APACrefbtitleProceedings of the AAAI conference on artificial intelligence Proceedings of the aaai conference on artificial intelligence (\BVOL 28). \PrintBackRefs\CurrentBib
  • Tuo \BOthers. (\APACyear2023) \APACinsertmetastartuo2023supervised{APACrefauthors}Tuo, K., Li, W., Deng, S.\BCBL \BBA Zhu, Y.  \APACrefYearMonthDay2023. \BBOQ\APACrefatitleSupervised, semi-supervised, and unsupervised learning of the Domany-Kinzel model Supervised, semi-supervised, and unsupervised learning of the domany-kinzel model.\BBCQ \APACjournalVolNumPagesarXiv preprint arXiv:2309.13990. \PrintBackRefs\CurrentBib
  • Vardoulakis \BOthers. (\APACyear2020) \APACinsertmetastarvardoulakis2020lessons{APACrefauthors}Vardoulakis, S., Marks, G.\BCBL \BBA Abramson, M\BPBIJ.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleLessons learned from the Australian bushfires: climate change, air pollution, and public health Lessons learned from the australian bushfires: climate change, air pollution, and public health.\BBCQ \APACjournalVolNumPagesJAMA Internal Medicine1805635–636. \PrintBackRefs\CurrentBib
  • Yu \BOthers. (\APACyear2020) \APACinsertmetastaryu2020bushfires{APACrefauthors}Yu, P., Xu, R., Abramson, M\BPBIJ., Li, S.\BCBL \BBA Guo, Y.  \APACrefYearMonthDay2020. \BBOQ\APACrefatitleBushfires in Australia: a serious health emergency under climate change Bushfires in australia: a serious health emergency under climate change.\BBCQ \APACjournalVolNumPagesThe Lancet Planetary Health41e7–e8. \PrintBackRefs\CurrentBib
  • Zenati \BOthers. (\APACyear2018) \APACinsertmetastarzenati2018adversarially{APACrefauthors}Zenati, H., Romain, M., Foo, C\BHBIS., Lecouat, B.\BCBL \BBA Chandrasekhar, V.  \APACrefYearMonthDay2018. \BBOQ\APACrefatitleAdversarially learned anomaly detection Adversarially learned anomaly detection.\BBCQ \BIn \APACrefbtitle2018 IEEE International conference on data mining (ICDM) 2018 ieee international conference on data mining (icdm) (\BPGS 727–736). \PrintBackRefs\CurrentBib
  • Zhang \BOthers. (\APACyear2021) \APACinsertmetastarzhang2021analysis{APACrefauthors}Zhang, J., Sheng, Z., He, Y., Zuo, X., Jin, B.\BCBL \BBA He, M.  \APACrefYearMonthDay2021. \BBOQ\APACrefatitleAnalysis of the Impact of the 2019–20 Australian Bushfire Season on the Atmospheric Environment Analysis of the impact of the 2019–20 australian bushfire season on the atmospheric environment.\BBCQ \APACjournalVolNumPagesFrontiers in Earth Science9566891. \PrintBackRefs\CurrentBib
  • Zhao \BOthers. (\APACyear2017) \APACinsertmetastarzhao2017spatio{APACrefauthors}Zhao, Y., Deng, B., Shen, C., Liu, Y., Lu, H.\BCBL \BBA Hua, X\BHBIS.  \APACrefYearMonthDay2017. \BBOQ\APACrefatitleSpatio-temporal autoencoder for video anomaly detection Spatio-temporal autoencoder for video anomaly detection.\BBCQ \BIn \APACrefbtitleProceedings of the 25th ACM international conference on Multimedia Proceedings of the 25th acm international conference on multimedia (\BPGS 1933–1941). \PrintBackRefs\CurrentBib