The SpaceNet 7 Multi-Temporal Urban Development Challenge: Dataset Release

Preface: SpaceNet LLC is a nonprofit organization dedicated to accelerating open source, artificial intelligence applied research for geospatial applications, specifically foundational mapping (i.e., building footprint & road network detection). SpaceNet is run in collaboration by co-founder and managing partner CosmiQ Works, co-founder and co-chair Maxar Technologies, and our partners including Amazon Web Services (AWS), Capella Space, Topcoder, IEEE GRSS, the National Geospatial-Intelligence Agency and Planet.
The SpaceNet partners are proud to announce the release of the SpaceNet 7 dataset. The SpaceNet 7 Multi-Temporal Urban Development Challenge dataset features a deep temporal stack of imagery and labels in over 100 locations across all six inhabited continents. Up-to-date high-fidelity maps are crucial for many applications (such as disaster response or humanitarian efforts, discussed in great detail in our previous blog), and this dataset will aid efforts to improve automated mapping and overhead change detection methods.
This is the first time that SpaceNet has used Planet imagery at ~4 meter resolution, as well as the first time that the temporal dimension will be explicitly incorporated into a challenge. The selected locations boast significant change over the two year data collection timeline. The dynamic nature of the data cubes permits tracking of urban development, specifically: building footprint evolution as well as address propagation. In the sections below we discuss the many facets of this unique open source dataset, distributed under a permissive CC BY-SA 4.0 license.
1. Geographic Coverage
The SpaceNet 7 dataset contains ~100 locations, spread out across the globe. Many of the locations were selected to highlight signifiant change, though some were selected to overlap with previous SpaceNet locales, and some were selected due to geopolitical interest. 60 of the data cubes are released as training examples (with both imagery and attendant labels), and 20 are released as test_public examples (imagery only). The remainder of the data cubes are reserved for final testing purposes. Figure 1 links to an interactive map that allows interested readers to explore the dataset locations.

2. Data Cubes — Imagery
At each location, monthly mosaics are curated over a timespan of two years. This lengthy time span captures multiple seasons and atmospheric conditions, as well as the commencement and completion of multiple construction projects, see Figure 2.

Cloud cover or (rarely) sensor mis-calibrations render some regions of images unusable. Accordingly, each image has an attendant unusable data mask (UDM) in standard GeoJSON format, see Figure 3.

3. Data Cubes — Building Footprint Labels
For each monthly mosaic, the SpaceNet labeling team painstakingly outlined the footprint of each building. These GeoJSON vector labels permit tracking of individual building locations (i.e. addresses) over time, hence the moniker: SpaceNet 7 Urban Development Challenge. See Figure 4 for an example of the building footprint labels in one of the training cities.

While building masks are useful for visualization (and for training deep learning segmentation algorithms) the precise vector labels of the SpaceNet 7 dataset permit the assignment of a unique identifier (i.e. address) to each building. Matching these building addresses between time steps is a central theme of the SpaceNet 7 challenge. In Figure 5 we display these building address changes.

4. Aggregate Stats

5. Dataset Access
As always, the SpaceNet data is freely available on AWS. As with previous data releases (e.g. SpaceNet 6) all you need is an AWS account and the AWS CLI installed and configured. Once you’ve done that, simply run the command below to download the training dataset to your working directory:
aws s3 cp s3://spacenet-dataset/spacenet/SN7_buildings/tarballs/SN7_buildings_train.tar.gz .
6. Conclusions
Detecting change in overhead imagery is a difficult task, greatly complicated by seasonal, atmospheric, and lighting effects. Yet the ability to localize and track the change in building footprints over time is an important facet in a number of applications, from disaster response to disease preparedness to environmental monitoring. Furthermore, this task poses interesting technical challenges for the computer vision community.
The SpaceNet 7 dataset provides a solid foundation for advances in these areas, via a large corpus of imagery and labels spanning over 100 distinct locations, greater than 40,000 square kilometers of observed area, and over 10 million labeled buildings.
Stay tuned for further updates on the upcoming SpaceNet 7 challenge and metric that will accompany this dataset (also see spacenet.ai for further resources). This challenge launches August 31, 2020, and will be featured as a competition at the 2020 NeurIPS conference in December.
* Special thanks to Jesus Martinez Manso for spearheading dataset creation.
2020-Aug-10 correction: The introduction was amended to specify the SpaceNet 7 dataset license (CC BY-SA 4.0)