Published November 13, 2020 | Version v1.0
Dataset Open

Characterization of Industrial Smoke Plumes from Remote Sensing Data

  • 1. University of St. Gallen
  • 2. Sociovestix Labs Ltd.
  • 3. ISS Inc.

Description

Characterization of Industrial Smoke Plumes from Remote Sensing Data
 

This data set contains imaging data acquired by ESA's Sentinel-2 Earth-observing satellite constellation for a sample of industrial sites that were picked based on emission information provided by the European Pollutant Release and Transfer Register. The images contain scenes of mainly industrial sites, some of which are actively emitting smoke plumes.

This data set was created to investigate whether it would be possible to train a deep learning model to automatically identify and segment smoke plumes from remote sensing image data. Please refer to the acknowledgements section for more on information on this project.


Description

Each image is provided in the GeoTIFF file format, contains a total of 13 bands and georeferencing information, and has a shape of 120 x 120 pixels (corresponding to a square area with an edge length of 1.2 km on the ground). The bands are extracted from Sentinel-2 Level-2A products, except for band 10, which has been extracted from the
corresponding Level-1C product (this band has not been utilized in the underlying work).

This repository contains a total of 21,350 images. Based on manual annotation, the image sample was split into a sample of 3,750 positive images that contain industrial smoke plumes, and 17,600 negative images that do not contain smoke plumes. Furthermore, this repository contains a collection of JSON files that hold manual segmentation labels for smoke plumes present in 1,437 images. Segmentation labels were generated using label-studio. Please note that polygon edge coordinates have to be scaled by a factor of 1.2 to fit the images.


Content

The following tarballs are contained in this repository:

  • README.md - this file
  • images.tar.gz [6.0GB] - contains 21,350 GeoTIFF images
  • segmentation_labels.tar.gz [350KB] - contains 1,437 JSON files

Acknowledgement

If you use this data set, please cite our publication:

Mommert, M., Sigel, M., Neuhausler, M., Scheibenreif, L., Borth, D., "Characterization of Industrial Smoke Plumes from Remote Sensing Data", Tackling Climate Change with Machine Learning workshop at NeurIPS 2020.

Please refer to this publication for additional information on the data set.

The code used for this publication is available at github.

This data set contains modified Copernicus Sentinel data acquired in 2019, processed by ESA.

 

Responsible Author

Michael Mommert
University of St. Gallen, Institute of Computer Science
Chair Artificial Intelligence and Machine Learning
michael.mommert ( at ) unisg.ch

Files

README.md

Files (6.4 GB)

Name Size Download all
md5:d87992cf622920dc0135e16bac9e2cfd
6.4 GB Download
md5:859d07bb43ab99fcd7790522717aba08
2.8 kB Preview Download
md5:73e6124bc285b0cff100613ef9e28668
340.6 kB Download

Additional details

Related works

Has part
Software: https://github.com/HSG-AIML/IndustrialSmokePlumes (URL)
Conference paper: arXiv:2011.11344 (arXiv)