Arctic Wintertime Sea Ice Lead Detection From Sentinel-1 SAR Images
Summary
1. The article presents a new method called SILDET for detecting sea ice leads in the Arctic using Sentinel-1 SAR imagery.
2. SILDET can detect both open leads and newly frozen leads covered by thin ice during winter months (January-April).
3. The method uses four main components:
- A segmentation module using PSPNet
- A balance module to handle class imbalance
- An optimization module utilizing lead shape features
- A mask module to remove land, landfast ice, etc.
4. SILDET was evaluated against visual interpretation of SAR images, achieving 97.80% overall accuracy and a Kappa coefficient of 0.88.
5. It was also compared to lead maps from Sentinel-2, MODIS, and another SAR-based method, showing generally good agreement.
6. The method was applied to generate Arctic-wide lead frequency maps for January-April 2023 at 40m resolution.
7. Lead width distributions derived from SILDET followed a power law, consistent with previous studies.
8. Some limitations include difficulty detecting very narrow leads (<360m wide) and potential misclassification in certain ice conditions.
9. Overall, SILDET is presented as an effective automated tool for monitoring Arctic sea ice lead distribution at high resolution during winter.
Figures and Tables
1. Figure 1: Flowchart of the preprocessing and the proposed lead detection method (SILDET)
2. Figure 2: Variation of the backscatter from sea ice and open water with incidence angle
3. Figures 3-15: Various images and plots related to the study (details not provided in the given excerpt)
Tables mentioned:
1. Table I: Details of the Training Dataset
2. Table II: List of the Sentinel-1 Scenes Selected for Evaluation
3. Table III: Precision, Recall, Kappa Coefficient, and OA for SILDET Results
4. Table IV: Length and Area of Retrieved Leads from Different Methods
5. Table V: Exponent of the Power-Law Fitting
6. Table VI: Length and Area of Retrieved Leads Based on SILDET for January and April 2023
7. Table VII: Results of the Ablation Study on SILDET
8. Tables VIII-X: Additional tables in appendices (details not provided)
Figure 1 is a flowchart illustrating the preprocessing steps and the SILDET method for sea ice lead detection. It consists of two main parts:
1. Pre-processing:
- Shows the steps from Sentinel-1 product to the final Sentinel-1 HH image
- Steps include thermal noise removal, border noise removal, radiometric calibration, terrain correction, and incidence angle normalization
2. SILDET Method:
- Consists of four modules:
(1) Segmentation module: Uses a neural network (likely PSPNet) with convolutional layers and a pyramid pooling module
(2) Balance module: Shows loss functions including dice loss and cross-entropy loss with OHEM
(3) Optimization module: Removes regions wider than 30km and non-linear regions using medial axis transform, eccentricity, and solidity
(4) Mask module: Removes land, large open water areas, and landfast ice using land mask, ASI ice concentration, and NIC ice chart data
The flowchart provides a comprehensive overview of the entire process from raw satellite data to the final sea ice lead detection output.
Artifacts
Artifacts Used:
1. Sentinel-1 SAR Images:
- Level-1 Ground Range Detected (GRD) products
- Extra Wide (EW) swath mode
- HH polarization
- 40m x 40m pixel spacing
2. Ancillary Data:
- Sea Ice Concentration (SIC) data from AMSR2
- Landfast ice maps from U.S. National Ice Center (NIC) ice charts
- Land mask from OpenStreetMap
- ERA5 wind data
- Ice drift data from NSIDC
3. Comparison Data:
- Sentinel-2 multispectral images
- MODIS Ice Surface Temperature (IST) data
- Lead maps from previous studies (Murashkin method, Qu et al. dataset)
Artifacts Produced:
1. SILDET Algorithm:
- A new method for detecting sea ice leads from Sentinel-1 SAR images
2. Lead Detection Maps:
- Binary maps showing the presence of open and frozen leads
3. Arctic-wide Lead Frequency Maps:
- Monthly maps from January to April 2023
- 40m spatial resolution
4. Lead Width and Length Statistics:
- Distributions following power-law relationships
5. Evaluation Metrics:
- Precision, recall, Kappa coefficient, and overall accuracy for the SILDET results
6. Comparative Analysis:
- Comparisons between SILDET results and other lead detection methods
7. Training Dataset:
- 211 subimages from 23 Sentinel-1 scenes with manually labeled categories
Availability
1. Sentinel-1 SAR Images: These are available through the Copernicus Open Access Hub or the Alaska Satellite Facility, as mentioned in the article. Copernicus Open Access Hub closed at the end of October 2023. Copernicus Sentinel data are now fully available in the Copernicus Data Space Ecosystem
2. Ancillary Data:
- AMSR2 Sea Ice Concentration data: Likely available from the University of Bremen website.
- NIC ice charts: Typically available from the U.S. National Ice Center website.
- OpenStreetMap land mask: Publicly available from the OpenStreetMap project.
- ERA5 wind data: Available through the Copernicus Climate Data Store.
- NSIDC ice drift data: Likely available on the NSIDC website.
3. Comparison Data:
- Sentinel-2 images: Available through the Copernicus Open Access Hub.
- MODIS IST data: Likely available through NASA's data portals.
4. The SILDET algorithm itself is not mentioned as being publicly available, but the article provides a detailed description of its methodology.
5. The specific lead detection maps, Arctic-wide lead frequency maps, and other derived products from this study are not mentioned as being publicly available.
6. The training dataset created for this study is not mentioned as being publicly shared.
Authors
1. Shiyi Chen
- Affiliation: School of Geospatial Engineering and Science, Sun Yat-sen University, Guangzhou, China
- Pursuing PhD, focusing on Arctic sea ice leads monitoring with remote sensing data
2. Mohammed Shokr
- Affiliation: Environment Canada
- PhD from Cairo University
- Extensive experience in sea ice physics and remote sensing since 1989
- Co-authored a book "Sea Ice Physics and Remote Sensing"
- Visiting Scientist at several Chinese universities
3. Lu Zhang
- Affiliation: School of Geospatial Engineering and Science, Sun Yat-sen University
- Pursuing PhD, focusing on thermal infrared remote sensing and Arctic sea ice
4. Zhilun Zhang
- Affiliation: School of Geospatial Engineering and Science, Sun Yat-sen University
- PhD focused on microwave remote sensing of sea ice
5. Fengming Hui
- Affiliation: School of Geospatial Engineering and Science, Sun Yat-sen University
- Professor of Polar Remote Sensing and Climate Change
- Published over 70 articles on polar remote sensing
6. Xiao Cheng
- Affiliation: Dean of School of Geospatial Engineering and Science, Sun Yat-sen University
- Previously at Beijing Normal University and Chinese Academy of Sciences
- Over 100 peer-reviewed articles on polar remote sensing and climate change
7. Peng Qin
- Affiliation: Sun Yat-sen University
- Pursuing PhD in cartography and geographic information systems
8. Dmitrii Murashkin
- Affiliation: German Aerospace Center (DLR) and University of Bremen
- PhD from University of Bremen
- Experience in sea ice studies and SAR image analysis for polar regions
Many of the authors have prior work related to remote sensing of sea ice and polar regions. The team combines expertise in various aspects of remote sensing, geospatial science, and polar climate studies, with a strong focus on Arctic sea ice monitoring and analysis.
Citation
S. Chen et al., "Arctic Wintertime Sea Ice Lead Detection From Sentinel-1 SAR Images," in IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1-19, 2024, Art no. 4302419, doi: 10.1109/TGRS.2024.3444045.
Abstract: Leads are almost linear fractures within the ice pack, which are commonly observed in polar regions. In wintertime, leads promote energy flux from the underlying ocean to the atmosphere. Synthetic aperture radar (SAR) can monitor leads at a finer spatial resolution than other spaceborne datasets, regardless of solar illumination and atmospheric conditions. However, the SAR-based lead detection methods proposed to date are restricted to some specific areas, instead of the entire Arctic.
In this article, we present a generalized deep learning-based approach for automatic sea ice lead detection (SILDET) in the Arctic wintertime using Sentinel-1 SAR images. The validation results show that SILDET has the capability of detecting open and frozen leads at different stages of development. Compared with the visual interpretation of Sentinel-1 images, the overall detection accuracy is 97.80% and the Kappa coefficient is 0.88. The lead map of a regional study obtained from SILDET was compared to that from a previous SAR-based lead detection method and a lead dataset based on Moderate Resolution Imaging Spectroradiometer (MODIS) data. The lead map was also validated using Sentinel-2 images.
The result shows that SILDET can provide a more detailed distribution of leads and a better estimation of lead width and area. SILDET was applied to present the Arctic-wide lead distribution from January to April 2023 with a spatial resolution of 40 m. The Arctic-wide lead width distribution follows a power law with an average exponent of 1.65. The SILDET approach can be expected to provide long-term high-resolution lead distribution records.
keywords: {Ice;Lead;Sea ice;Sentinel-1;Arctic;Backscatter;Spatial resolution;Arctic ocean;deep learning;sea ice lead;segmentation;Sentinel-1},
URL: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10637413&isnumber=10354519
Published in: IEEE Transactions on Geoscience and Remote Sensing ( Volume: 62)
Funding Agency:
Introduction
Sea ice leads are semi-linear fractures in the ice pack, which are usually formed during the divergence of ice cover. Lead width can vary from meters to kilometers. In wintertime, leads refreeze within a short time under the cold atmospheric temperature. Leads are composed of water and thin ice of a thickness of usually less than 30 cm (new and young ice, according to the World Meteorological Organization (WMO) [1]). Although open leads cover only 1%–2% of the total ice cover area in wintertime in the Arctic [2], they contribute more than 70% to the heat fluxes to the atmosphere [2]. Turbulent heat fluxes over open leads can be two orders of magnitude higher than those over the ice surface in winter [3]. The heat flux over a lead decreases as the frozen ice thickness increases, but is still significantly higher than that over the surrounding ice pack [4], [5]. Therefore, open and frozen leads play important roles in heat transfer over the Arctic Ocean, which further impacts the evolution of sea ice. Since Arctic sea ice has become thinner and more mobile in recent years [6], [7], [8], [9], an increasing role of leads in heat transfer over the Arctic Ocean can be expected. The areas of leads from midwinter to late spring could be a predictor for summer Arctic sea ice extent [10].
Satellite
sensors can provide long time series and wide coverage for Arctic sea
ice lead monitoring, with several products already available based on
moderate-resolution thermal infrared images [11], [12], microwave images [13], [14], and radar altimetry [15]. However, the relatively coarse spatial resolution (
Synthetic aperture radar (SAR) is capable of monitoring sea ice, independent of daylight and cloud coverage conditions, at a large scale and high spatial resolution. Murashkin et al. [20] and Murashkin and Spreen [21] used a random forest classifier to detect sea ice leads from Sentinel-1 dual-polarization images, with the algorithm based on polarization features and textural features derived from the gray-level co-occurrence matrix. Deep learning models, which can extract high-level feature information, are beneficial for various image analysis tasks [22], [23]. Advanced deep learning models have been widely used in detecting sea ice leads. For example, Murashkin and Frost [24] adjusted the UNet++ convolutional neural network architecture to classify sea ice types from Sentinel-1 images, including leads with low surface roughness and with wind-roughened surfaces; Zakhvatkina et al. [25] used texture characteristics as well as the difference and ratio between the co- and cross-polarizations (HH and HV) from Sentinel-1 images to detect leads; Murashkin [26] used U-Net [27] to detect leads from Sentinel-1 dual-polarization images, where, compared with the approach based on the random forest classifier [20], it obtained more robust results [26]; and Liang et al. [28] developed an algorithm based on an improved U-Net model [27]—the entropy-weighted network (EW-Net)—to detect open leads under low wind speed conditions from Sentinel-1 images. However, most of the existing SAR-based methods were tested on only a few scenes. To the best of our knowledge, there are no available Arctic-wide lead products based on the classification of SAR images, as a range of factors hinders the generalization and robustness of the SAR-based detection methods.
As sea ice leads cover only small parts of the ice cover area in the Arctic [2],
the number of observations belonging to leads is significantly lower
than that belonging to the other classes. With a deep learning approach,
this class imbalance problem affects the convergence during the
training phase and the generalization of the model in segmentation
performance [29], [30].
In addition, the variations in thickness and surface roughness that are
characteristics of frozen leads strongly affect the backscatter
signature. Leads covered with calm open water have the lowest
backscatter intensities. The backscatter coefficient (i.e.,
In this article, we aim to enhance the generalizability and robustness of a deep learning-based method for sea ice lead detection (SILDET). We propose a robust and efficient approach for SILDET in the Arctic wintertime (January to April) from Sentinel-1 HH polarization images. Note that the openings between ice floes are also included under the term “lead.” Three classes of surface cover are considered: 1) open leads; 2) frozen leads with thin ice (<0.30 m thick); and 3) background. The proposed method focuses on detecting the leads that exist in high ice concentration sea ice cover and with widths <30 km. To improve the performance of lead detection, the pyramid scene parsing network (PSPNet) [39] utilizing global scene category clues is applied. A new loss function is also proposed to alleviate the class imbalance problem. In addition, we utilize the narrow/elongated feature of leads to optimize the detection results. Finally, ancillary data are used to remove landfast ice and land, as well as the area outside the ice cover. In this article, we compare the lead maps from SILDET with those from visual interpretation and other existing lead detection methods. We then discuss the advantages and limitations of the proposed method. Furthermore, we describe how Arctic-wide lead distribution maps from January to April 2023 were generated utilizing SILDET. We then analyze the spatial distribution and width distribution of leads at a smaller scale than in previous studies.
Data
A. Sentinel-1 Images
Sentinel-1A/B
is a constellation of two polar-orbiting satellites, developed and
operated by the European Space Agency (ESA). The Sentinel-1A and
Sentinel-1B satellites were launched in April 2014 and April 2016,
respectively. Unfortunately, Sentinel-1B has not been available since
December 23, 2021. Each satellite carries a C-band SAR system with
dual-polarized channels. The sensor has four standard operational modes:
stripmap (SM), interferometric wide swath (IW), extra wide swath (EW),
and wave (WV). Sentinel-1 images are available free of charge from the
Alaska Satellite Facility and Copernicus Data Space Ecosystem. In this
study, we used the Level-1 ground range detected (GRD) product. All the
scenes were acquired in EW mode from January to April over the Arctic
sea ice. Only the HH polarization images were used in this study. The EW
mode GRD product has an approximately 400-km swath width, with an
incidence angle ranging from 18.9° to 47° and a spatial resolution of
B. Ancillary Data
The following ancillary data were used in the SILDET: sea ice concentration (SIC) data, landfast ice maps, and a land mask. The SIC data were derived from the Advanced Microwave Scanning Radiometer-2 (AMSR2) using the Arctic Radiation and Turbulence Interaction Study (ARTIST) sea ice (ASI) algorithm, with a 6.25-km spatial resolution for the dataset [40]. The ASI SIC data were provided by the Institute of Environmental Physics (IUP) at the University of Bremen, Germany. In addition, the landfast ice maps were derived from the weekly ice charts produced by the U.S. National Ice Center (NIC). NIC ice charts provide information about ice types, including landfast ice distribution. The charts are based on the integration of SAR data, visible and infrared imagery, and meteorological data. In addition, a shapefile including land polygons was derived from the OpenStreetMap project.1
The 10-m wind parameters, as described in Section IV-A, were obtained from the ERA5 hourly data at single levels from 1979 to the present [41].
The ice drift speeds were derived using ice motion data from the National Snow and Ice Data Center (NSIDC), and are used for reference in Section IV-B.
C. Comparison Data
Murashkin [26] utilized the U-Net convolutional neural network to detect sea ice leads from both the HH and HV SAR channels of Sentinel-1 EW scenes. This algorithm produces binary maps with a lead classification at the same pixel spacing (40 m) as SILDET. Two types of leads are considered in this method. One type is the leads covered by water and thin ice, with a smooth surface and low backscatter in both HH and HV polarization images. The other type is the open leads with a rough surface affected by wind, leading to high backscatter in the HH images but low backscatter in the HV images. The lead fractions obtained from this method are close to the observed open-water fractions derived from airborne ice thickness observations [42] during the Multidisciplinary drifting Observatory for the Study of Arctic Climate (MOSAiC) expedition [43]. This implies that this method detects predominantly open water and leads covered by smooth thin ice.
Qu et al. [44] produced daily sea ice lead maps for April from 2001 to 2020 in the Beaufort Sea using temperature anomalies from Moderate Resolution Imaging Spectroradiometer (MODIS) thermal imagery. Two types are presented in the maps: lead (5% uncertainty) and a potential lead class (50% uncertainty). This dataset was compared with the other existing lead datasets and Landsat 8 images, and showed a better performance in identifying leads with open water and frozen thin ice. The MYD29 MODIS ice surface temperature (IST) daily L3 product with a spatial resolution of 1 km is used for reference in the lead map comparison in Section IV-B.
A lead map was generated from Sentinel-2 images based on a threshold of reflectance. The Sentinel-2 mission comprises a constellation of two polar-orbiting satellites in the same sun-synchronous orbit, with a ten-day repeat cycle for each satellite. The payload of the Sentinel-2 satellites is a single MultiSpectral Instrument (MSI) composed of 13 bands, including blue, green, and red channel images (bands 2–4). The true-color (red: band 4, green: band 3, blue: band 2) composite images with a spatial resolution of 10 m were used in this study. We downloaded the Sentinel-2 Level-2A surface reflectance images from the Copernicus Open Access Hub.
In this article, for evaluation purposes, we compare the lead map from SILDET for April 20, 2019 in the Beaufort Sea with three other maps from the datasets mentioned above.
Methodology
In this section, we present a detailed description of the proposed method for SILDET from Sentinel-1 HH polarization SAR imagery. The description covers the preprocessing of SAR images, the generation of the training dataset, the settings for the training models, the framework of SILDET, and the evaluation of the detection results. A flowchart for the SAR image preprocessing and SILDET is illustrated in Fig. 1. SILDET consists of four modules: 1) a segmentation module; 2) a balance module; 3) an optimization module; and 4) a mask module.
A. Sentinel-1 Image Preprocessing
The
Sentinel-1 image preprocessing included border noise removal, thermal
noise removal, radiometric calibration with conversion to decibels,
terrain correction, and incidence angle normalization. All the
preprocessing was performed in the sentinel application platform (SNAP),
except for incidence angle normalization. The correction factors that
account for the incidence angle are different for sea ice and water [45].
We analyzed the dependence of the incidence angle for sea ice and water
using 480 sample points for each class from a set of 24 Sentinel-1
scenes acquired in the Arctic from January to April over the period of
2016–2022 (Table VIII
in Appendix A). Considering the effect of wind on the backscatter in HH
images, we visually selected samples of wind-free water surfaces in the
sea ice leads. The variation of backscatter for ice and water with
respect to incidence angle is shown in Fig. 2. The incidence angle slopes of the linear fits for sea ice and water are −0.26 and −0.11 (
B. Generation of the Training Dataset and Settings for the Training Models
The
sea ice lead training dataset consisted of subimages of Sentinel-1 HH
polarization and the corresponding categories. Each subimage was
Open Leads: These are openings within the ice cover or between ice floes with the lowest backscatter intensities (see the region with label A in Fig. 3). Although the backscatter intensity of open water depends on the surface wind, the wind effect on leads within sea ice cover is less than that over the open ocean, due to the wind shadowing by ice floes [47]. Therefore, we do not consider the wind effect on the identified leads in this article.
Frozen Leads: The leads with new ice (<0.10 m thick) and young ice (0.10–0.30 m thick), according to WMO nomenclature [48], were labeled as “frozen lead.” A frozen lead surface usually has increasing backscatter as the new ice thickness increases [49], [50] (see the region with label B in Fig. 3). The backscatter of the leads at the different stages of development is different. There is often a “stream line” texture (see the region with label C in Fig. 3) representing different ice growth stages in the frozen leads. In addition, all the frozen leads are characterized by a linear shape and higher IST than the surrounding ice pack. MODIS IST data were used to confirm the manual interpretation of leads in the SAR images, as shown in Fig. 3(c).
Background: This category included all objects that did not lead. Ice floes in leads were also classified as “background.” The training dataset consisted of scenes with open and frozen leads at different development stages.
Considering the computational cost, the samples from the training dataset were cropped from
We employed the PyTorch framework to implement the networks, which were trained on an NVIDIA GeForce RTX 3090 Ti GPU. The training was conducted for 100 epochs with a batch size of 16. The Adam optimization algorithm [51] was selected as the optimizer. The initial learning rate was set as 0.0001 and was decreased by a ratio of 0.91 after each epoch.
C. SILDET Approach
The components of SILDET (see Fig. 1) are explained in the following.
1) Segmentation Module:
The segmentation module uses PSPNet [39] to obtain the initial segmentation result from the Sentinel-1 input images, which are
2) Balance Module:
The training samples in the training dataset are extremely imbalanced among the different classes listed in Table I,
which could have degraded the performance of the networks. To deal with
this problem, data augmentation methods are usually employed, such as
geometrical transformations of the images (e.g., horizontal flipping and
multiscale strategy) or using a random occlusion mask on the images or
feature maps. On the other hand, some loss functions have been shown to
be able to tackle class imbalance. We use dice loss as well as
cross-entropy (CE) loss with online hard example mining (OHEM) to train
the model in SILDET. CE loss is the entropy that calculates the
prediction probability and the real situation based on each pixel in the
dataset. Dice loss is different from CE loss in that the former is
based on the dice overlap coefficient between the predicted results and
real situations [53],
and can thus alleviate class imbalance. OHEM is a training strategy
that focuses on hard samples during the training process and applies
higher weights to them [54].
This strategy can enable the model to concentrate more on hard samples,
which can be applied to further optimize the CE loss function and make
the training more effective and efficient. The hard sample pixels are
determined by a probability threshold
The overall loss function for the segmentation module is defined as
3) Optimization Module:
The optimization module quantifies features to identify the surviving regions as leads in the initial segmentation result. First, the objects with a width >30 km are removed. This value was referred from [44]. The medial axis distance is used in this step. The medial axis of an object is the set of all the points having more than one closest point on the object’s boundary [55]. Medial axis transform is used to compute the width of the objects. The ice floes misclassified as frozen leads are then removed. Sea ice leads have linear-like structures. In this step, the eccentricity and solidity are used. Eccentricity is the ratio of the distance between the foci of the ellipse and its major axis length. The value is between 0 and 1. An ellipse with an eccentricity is 0 is a circle, while an ellipse with an eccentricity of 1 is a line segment. However, a circular curve or circle would have similar eccentricity, so solidity is used as an additional feature to distinguish them. Solidity represents the ratio of pixels in the region to pixels of the convex hull image. According to the statistics based on the training dataset (Fig. 13 in Appendix B), the objects with eccentricity <0.95 and solidity >0.55 are not considered to be frozen leads.
4) Mask Module:
In this module, land, landfast ice, and the area outside the ice cover in the SAR images are removed. The land is removed using the land shapefile. In the coastal zones, landfast ice has a smooth surface and is often misclassified as open lead due to its similarity in low backscatter [56]. Landfast ice is masked using maps from the NIC ice charts. Daily ice cover maps from AIS SIC data are used to mask out the large and contiguous open-water areas. The daily ice cover is identified through three steps. In the first step, the regions with SIC >87% are generated to initiate the ice cover mask. This approach was based on the study by Guo et al. [46]. In order to eliminate the effects of ice motion during the time gap between SIC data and SAR data, the edge of the mask is reduced by 12.5 km in the second step. In the third step, leads with low ice concentration in the pack ice are filled using morphological operations. This process employs the hole-filling function, which identifies and fills enclosed background regions, ensuring a continuous ice cover. Lead detection results outside the ice cover maps are removed.
D. Evaluation of SILDET
To evaluate the performance and robustness of SILDET, we compared the SILDET results with Sentinel-1 visual interpretation. For comparison, eight subimages were obtained from different locations (Table II), which are shown in Fig. 4. Scenes (red boxes) from the Beaufort Sea, Chukchi Sea, East Siberian Sea, Laptev Sea, Kara Sea, and Greenland Sea were selected. In each month from January to April, two scenes were acquired. Finally, to test the robustness of the method, we ensured that the selected scenes covered different stages of lead freezing and sea ice conditions.
In this article, to quantitatively characterize the comparison between the SILDET results and the visual interpretation of Sentinel-1, we use four parameters: precision, recall, Kappa coefficient, and the overall accuracy (OA), based on the confusion matrix [57], [58]. The confusion matrix is formulated from four indices: true positives (TPs), true negatives (TNs), false positives (FPs), and false negatives (FNs). Precision is the number of correct results (TP) relative to the total number of results. Recall is the number of correct results relative to the number of visual interpretation results. OA denotes the proportion of correctly classified pixels in all the pixels. The Kappa coefficient [59] is a useful evaluation metric when dealing with imbalanced data
To further assess the reliability of the SILDET results obtained by SILDET, a lead map from Sentinel-2 scenes was generated for comparison. The blue box in Fig. 4 shows the location of the Sentinel-2 mosaic in the Beaufort Sea, which is characterized by high lead frequency [44].
E. Lead Width and Length Statistics
We analyzed the lead width distribution from SILDET and the other lead detection methods. Previous studies using submarine measurements and remote sensing data have shown that the lead width distribution follows a power-law distribution [60], [61], [62], [63]:
The method presented in Qu et al. [18]
was used to calculate the lead width and length from the lead maps. The
width for every pixel was determined by an orthogonal system (vertical
and horizontal). However, assuming a parallel lead boundary, this method
tends to overestimate lead width by up to 41.4% [18]. The overall length
Note that the two types—“open lead” and “frozen lead”—in the lead detection results are combined as one type in the lead width and length statistics.
Results
A. Comparison With Visual Interpretation of Sentinel-1
Fig. 5 shows subimages from the four Sentinel-1 HH polarization scenes (Nos. 2, 3, 5, and 7), the locations of which are shown in Fig. 4, along with the corresponding lead identification obtained using visual interpretation, as well as the SILDET results. The results for the other scenes (Nos. 1, 4, 6, and 8) are shown in Fig. 12. The scenes cover different lead conditions with different stages of development. The numerical indicators that characterize the comparison between SILDET and the visual interpretation results are presented in Table III. The OA and Kappa coefficients for the SILDET detection results are 97.80% and 0.88, respectively, which means a good match between SILDET and the visual interpretation results. In addition, the frozen lead class has a precision of 95.66% and a recall of 79.05%, which indicates that the detected frozen leads are mostly classified correctly by SILDET, but some frozen leads in the visual interpretation images are ignored. For the open leads, the precision and recall values are 83.17% and 90.58%, respectively. The low recall of frozen leads and the precision of the open leads can be mainly attributed to the misclassification of some frozen leads as open leads, because of their low backscatter. In general, SILDET can correctly detect most of the visible open and frozen leads from the SAR images. However, it does still generate some incorrect identifications due to backscatter overlapping.
Scene No. 7 in Fig. 5(a) shows the first-year ice (FYI) and open leads in the Sentinel-1 imagery. Most of the visible open leads are detected using SILDET. The evaluation parameters for this scene are high (Table III). However, some narrow leads with width <5 pixels are incompletely detected, such as the area marked by the blue ellipse in Fig. 5(b). The open leads present as discontinuous punctiform or linear objects in the SILDET detection results.
For Scene No. 5, many of the leads contain frozen sections. Most of the open and frozen sections are identified by SILDET. Fig. 5(b) shows an enlargement of the area in the yellow box in Scene 2. The average 10-m wind speed from ERA5 at 20:00 GMT was 4.31 m/s. In the area marked by the blue ellipse in Fig. 5(b), there is likely thin ice forming in the leads, which is being advected downwind. Some frozen parts of the leads with similar backscatter and texture to the surrounding FYI are ignored by SILDET. This results in a decrease in the recall for the frozen leads to 71.37%.
Scenes 2 and 3 (Table II) feature multiyear ice (MYI), FYI, open leads, and frozen leads. In Scene 3, there are ice floes within the frozen leads, as shown in the area with label A. Although the ice floes have a backscatter similar to that of the surrounding frozen leads, their texture is different. Both SILDET and visual interpretation can distinguish the ice floes from the background of frozen leads well (areas with white within the blue ellipse with label A). This can be attributed to the global context utilized by PSPNet in SILDET. In Scene No. 4, some frozen lead pixels are not detected (the blue ellipse with label B), due to their small backscatter difference with the surrounding ice. In addition, smooth FYI is misclassified as open lead in the area marked by the blue ellipses with label C. This causes the precision to take on a relatively low value of 67.22% for the open leads, which is the lowest value in Table III.
B. Comparison With the Lead Maps From Other Methods
To further assess the reliability of the SILDET results obtained by SILDET, we compared the lead results overlaid on a mosaicked true-color image assembled from four Sentinel-2 scenes (the location of each Sentinel-2 scene is shown in Fig. 13 in Appendix D). The four Sentinel-2 scenes were acquired at 21:22 on April 20, 2019, 22:31 on April 21, 2019, and 21:40 on April 21, 2019 in the Beaufort Sea. The locations of the frames are shown in Fig. 4 (blue box). Fig. 6(d)–(g) shows examples of the lead detection maps from the four methods: d) SILDET based on Sentinel-1 images; e) Murashkin [26] method based on Sentinel-1 images; f) the method based on a threshold for the reflectance in Sentinel-2 images; and g) the Qu, et al. [44] method based on MODIS images. These maps are referred to as Fig. 6(d)–(g) in the subsequent parts of this section. All the lead maps are overlaid on the Sentinel-2 images.
The Sentinel-1 scene from 15:42 on April 20, 2019 is shown in Fig. 6(a).
The time difference between the acquisition of the Sentinel-1 and
Sentinel-2 scenes ranges from 6 h to one day. The average ice drift
speed on April 20, 2019 within the frame of Fig. 6
was 1.71 km/day. In this case, the open and frozen leads are clearly
visible in the pack ice region. The open leads are represented by low
backscatter in the HH polarization images. The frozen leads show a wide
range of
Compared with the Sentinel-2 images, SILDET captures the majority of the leads and shows the detailed spatial distribution of open leads, frozen leads, as well as ice floes in leads. Although the open leads shown in the SAR image could have frozen during the Sentinel-2 overpass, the open leads in our detection results are consistent with the areas of low reflectance in the Sentinel-2 images. Meanwhile, Fig. 6(d) is also basically consistent with the area that has a relatively high IST in the MODIS image. However, we found that SILDET misses some small leads in the Sentinel-2 images. The reason for this could be the limitations of SILDET, the spatial resolution differences, or ice motion due to the time difference between the data acquisition. Fig. 6(e) shows more small leads but fewer frozen leads than Fig. 6(f). Meanwhile, some FYI with low backscatter is misclassified as leads. The lead detection results of Fig. 6(g) identify most of the leads, compared to the Sentinel-2 retrievals. However, it appears that the edges of the leads are beyond the bounds of the leads in the Sentinel-2 images. This is probably because this dataset is based on MODIS thermal infrared images from many orbits within the same day.
We estimated and compared the lead width distributions of these lead maps. Note that we only include the “lead” type in Fig. 6(g) in this section. Fig. 7 shows the width distribution of the leads and their power-law fittings (the frame of Fig. 6). The lead width distribution from Fig. 6(d) is close to that from Fig. 6(f).
However, SILDET has limitations in detecting narrow leads. For the
narrow leads, the maximum area is associated with a lead width of around
360 m (9 pixels). A possible source of error in the distribution could
be the receptive field of PSPNet with a theoretical size of
We compared the power-law fittings of the lead width “x” of the three lead maps Fig. 6(d)–(g). They are presented as follows. Note that the result from Fig. 6(e) does not fit the power-law function.
Fig. 6(d):
Fig. 6(f):
Fig. 6(g):
The SILDET result from Sentinel-1 has the lowest power-law exponent, which is likely due to the limitation of SILDET in detecting narrow leads. The lead map from Qu et al. [44] has the lowest spatial resolution among the three sensors but the highest power-law exponent for the data. It is possible that the lead retrieval method from MODIS images is sensitive to narrow leads with a width smaller than the pixel width (<1 km). The power-law exponent of Sentinel-2 is close to that for MODIS. The methods for detecting leads from the MODIS and Sentinel-2 images are based on thresholds, and show better results for narrow leads, whereas SILDET is based on a segmentation network. The limitations of the segmentation network approach are discussed in Section V-B.
Furthermore, we list the length and area of the lead categories with different width ranges for these three methods in Table IV. We calculated the lengths and areas for the five categories of leads as (see Table IV): width
The total length of the leads from Fig. 6(d) is 1254.38 km. The Category B leads contribute most to the total length, with about 36%. The total length of the leads from Fig. 6(f)
is 9209.64 km, with the Category A leads constituting about 70% of the
total number of leads. The total length of the leads from Fig. 6(g) is close to that of Fig. 6(d), with 1415.46 km. Fig. 6(d) and (g) clearly underestimate the total length of the leads, compared with Fig. 6(f).
The main reason for this is that the lengths of the Category A and B
leads are underestimated in SILDET and ignored in the method of Qu et
al. [44].
Although Sentinel-1 has a nearly 100-m resolution, the measured lead
Category A accounts for only 6.44% of the total length. MODIS has a low
resolution and cannot provide information for these narrow leads. Fig. 6(g) is sensitive to the small leads with a width
As for the lead area from the three lead maps, Fig. 6(d) and (f) have close values of the total lead area, with 2922.65 and 3792.50 km2, respectively. The total area of leads is overestimated by 68% in Fig. 6(g), compared to Fig. 6(f).
For the three lead maps, the highest contributions are from the
Category E leads, with more than 50%. For the Categories A and B leads, Fig. 6(d) shows an underestimation of their contribution to the total lead area, compared with Fig. 6(f). For the lead categories with widths
C. Application of SILDET
To further explore the applicability of SILDET, we generated Arctic-wide lead frequency (including open and frozen leads) maps based on Sentinel-1 data from January to April 2023 with a spatial resolution of 40 m (Fig. 8). Lead frequency is defined as the number of scenes during a specified period (a month for this figure) in which a pixel is found to be covered by lead, relative to the number of available Sentinel-1 scenes. A total of 1108, 1045, 1169, and 1139 Sentinel-1 scenes were used to generate the maps for each month from January to April 2023, respectively. The image of the Beaufort Sea is enlarged in Fig. 9. The spatial distribution of the number of Sentinel-1 scenes from January to April 2023 is shown in Fig. 15 in Appendix E. Due to the retirement of the Sentinel-1B satellite in December 2021, there are fewer scenes than before in the Arctic. However, most of the regions in the area covered by the Sentinel-1 images have more than ten coverages per month from January to April 2023.
Fig. 8 shows that the lead frequency is less than 20% in most Arctic regions. This should be considered in the context of rapid ice motion during the time between observations and the short lead lifetime. Regions with a relatively high lead frequency include the Beaufort Sea, Baffin Bay, Fram Strait, Kara Sea, and the Barents Sea. The relatively high lead frequency in the Beaufort Sea is found in regions closer to land. In addition, compared with January and February, the lead frequency increases in March and April, as found in Baffin Bay, Fram Strait, and the Barents Sea. These regions are close to the marginal ice zone, where the pack ice becomes less dense in March and April.
The lead width distributions from January to April 2023 in the Arctic and Beaufort Sea are presented in Fig. 10. The number of pixels for each lead width is fit with a power-law distribution. Considering the limitation of SILDET in detecting narrow leads, the points with widths <360 m are not included in the power-law fitting (see Section IV-B). The exponents of the power-law fitting for the lead width distribution are listed in Table V. For the Arctic Ocean, the differences in the lead width distribution between January, February, March, and April 2023 are small, while the exponent increases toward early spring with an average of 1.65. This could be related to the changes in internal ice strength. A tendency of the exponent to increase throughout the winter season during the MOSAiC expedition was also reported [64]. Regarding the Beaufort Sea, the lead width distribution across the four months mainly differs in the leads with large widths. In addition, the exponent varies from 1.49 to 1.96.
We further analyzed the length and area of the leads in January and April. The leads were divided into five categories, which were the same as those in Section IV-B. The length and area for these categories are presented in Table VI.
For the Arctic, compared with January, the detected lead length and area in April are higher by 42% and 44%, respectively. The contributions to total lead length and area for the different lead categories are similar for the two months. The highest contribution to the total length is from Category B, with about 50%. The contribution of Category A length to the total length in January and April is about 13%. In addition, wide leads (Category E) account for only 2% of the total lead length. As for the area, the contribution of Category D to the total area is the highest among all the lead categories, at about 40%. Category A leads account for only 1%, which is the smallest value in Table VI.
In the Beaufort Sea, a higher total lead length but a lower total lead area is found in April, compared with January. However, the contributions of the lead categories to lead length and area are small between January and April. Category D makes the most contribution to the total lead area, with more than 35%. The second highest and lowest contributions belong to categories E and A, respectively, which is consistent with the Arctic.
Discussion
A. Ablation Experiments for SILDET
To evaluate the effectiveness of each module within SILDET, we conducted ablation experiments with various settings, including the loss function (CE loss and the loss function in the balance module) and the selection of network (i.e., PSPNet, U-Net, and DeepLabv3+) in the segmentation module, with and without the optimization module. Given that the mask module is a necessary component in the segmentation task based on remote sensing data, we used this module in all the experiments. All the experiments were evaluated with the scenes in the evaluation dataset, and the results are presented in Table VII. SILDET performs the best in all the experiments across all the metrics, indicating the importance of each module. In the following, we compare the different experiments to evaluate the role of the settings for each module in SILDET.
First, to evaluate the balance module, we compare Experiments 1 and 4. The segmentation network (PSPNet) was the same in both cases, but it was trained by the widely applied CE loss in Experiment 1 and by the loss function in the balance module in Experiment 4. By training the model with the loss function in the balance module, the detection of the open lead class, which accounts for a relatively small proportion of the training dataset, is improved. In Experiment 4, more open leads are detected and the recall is increased by 4%.
Second, we compared three deep learning networks in the segmentation module, i.e., U-Net [27], DeepLabv3+ [65], and PSPNet, which were used in Experiments 2–4, respectively. DeepLabv3+ is referred to as “DeepLab” for simplicity in Table VII. All the networks were trained by the loss function in the balance module. In addition, the optimization module was not used in Experiments 2–4. Experiment 4 using PSPNet achieves a Kappa coefficient improvement of 0.05–0.07, compared to the experiments based on the other networks. PSPNet shows the highest precision (90.58%) and recall (78.72%) for the frozen lead class. The detection performance using U-Net is slightly better than that for DeepLabv3+.
Finally, the contribution of the optimization module in SILDET could be estimated by comparing Experiments 4 and 7. Both experiments used PSPNet in the segmentation module and balance module. The optimization module was added in Experiment 7. This module improves the precision of frozen leads by filtering some incorrect regions. The results show that the optimization module significantly improves the precision for the frozen lead class by 6% and the Kappa coefficient by 0.02 in SILDET. Experiments 2 and 3 used U-Net and DeepLabv3+, respectively, in the segmentation module as well as the balance module. It was the same for Experiments 5–6 (respectively), but the optimization module was used in these two cases. The comparison of Experiments 2–3 and 5–6 demonstrates the effectiveness of the optimization module when other deep learning networks are used, with an increasing precision and Kappa coefficient for the frozen leads. These results indicate that the optimization module of SILDET utilizing the linear feature of leads can contribute to improved lead detection.
B. Method Constraints and Potential
SILDET is a lead detection algorithm which has applicability for SAR images containing open and frozen leads. However, there are four aspects of SILDET that should be further improved.
First, it is difficult to detect narrow leads with SILDET, especially leads with a width <360 m (see Section IV-B). One of the possible reasons for this could be the mixed pixels of narrow leads [15], [63]. When the lead area is equivalent to the area of one pixel, the lead is not always covered by a single pixel. An observation from a pixel covering a lead sometimes includes a contribution from surface types other than lead. This complicates the interpretation of the observation from a mixed pixel. Furthermore, SAR observations can contain noise, as well as overlap of the backscatter from the lead and other surface types. Methods based on a threshold for the backscatter cannot handle these situations. Compared with the threshold approach, deep learning networks incorporate more image information to improve the performance of detecting leads. Texture plays an important role in detecting leads in SAR images. However, a narrow lead usually covers a limited number of pixels in the image, allowing for less texture information. In addition, PSPNet in SILDET utilizes the top-most feature maps computed by ResNet-101, which shows a better performance than the other ResNets (Table X in Appendix F). The top feature maps have a large receptive field and rich semantic information but with a coarse resolution. There is little information left in the top-most features for small objects, which may compromise the segmentation performance for narrow leads. Improving the detection of small leads could allow more accurate lead distribution monitoring.
The second issue is the wind effect on the water surface in the leads, which can cause misclassification as ice [20], [28]. In this study, we did not consider this effect on open leads. However, the effect needs to be considered when the leads are very wide (a few kilometers). This issue could be resolved by utilizing cross-polarization SAR data [66]. Therefore, a strong and robust method of removing the thermal noise in cross-polarization images needs to be explored in the future.
Third, SILDET can misclassify some classes when their backscatter and texture are similar. For example, in Scene 8, the ice has a lower backscatter than the surrounding ice, resulting in its misclassification as an open lead. It is possible that the ice surface (whether bare or snow-covered) becomes wet. Furthermore, when a lead freezes while the ice develops under calm water conditions, it returns very low backscatter and smooth texture and hence may be misclassified as open lead, even though the ice becomes thick. Some physical conditions hinder accurate lead identification. A more comprehensive and balanced training dataset, with scenes covered by leads with different backscatter and texture, as well as different ice conditions, should allow a more comprehensive assessment of the performance of the network. Meanwhile, it is difficult to distinguish ridges and leads with frost flowers in SAR images. This can lead to an overestimation of Kappa and accuracy when ridges are classified as leads in the visual interpretation and SILDET results. Due to the lack of field data, we cannot consider these factors quantitatively.
Finally, in the preprocessing of Sentinel-1 images, we use a threshold to separate water and ice and determine the normalization coefficients for each type in the incidence angle normalization. Although this approach is simple and rapid, it may bring prior error information for SILDET. Meanwhile, SILDET may encounter difficulties when the same type of sea ice is located on different edges of the range swath of SAR images because the incidence angle dependence might not be normalized perfectly.
Based on SILDET, we derived the lead frequency distribution in the Arctic from January to April 2023. The regions with a relatively high lead frequency are consistent with previous studies [19], [67]. Meanwhile, the Arctic-wide lead width distribution follows a power law, with an average exponent of 1.65 from January to April 2023. This is in agreement with the results from previous studies, e.g., Wadhams [68] and Lindsay and Rothrock [69], which were, respectively, based on data from a submarine mission and Advanced Very High Resolution Radiometer (AVHRR) satellite data, with a power-law exponent of 2.0 and around 1.6, respectively. There was also a previous study based on the CryoSat-2 satellite which yielded an exponent of about 2.47 [15]. The difference in the exponents can be attributed to the different measurement systems (satellite and submarine), lead definitions, and detection methods.
The results presented in this article demonstrate the potential of SILDET for monitoring Arctic-wide sea ice leads during wintertime. However, as Sentinel-1B has been retired, the frequency of the Sentinel-1 coverage has decreased. In the future, the launch of the Sentinel-1C and Sentinel-1D satellites should enable more frequent coverage. In addition, the development of a cross-platform method for lead detection based on SAR images is necessary. This would provide extensive spatial and temporal coverage for Arctic sea ice lead monitoring based on the data from multiple satellites. Cross-calibration between SAR systems will be needed in this case.
Conclusion
In this article, we have proposed an automatic method to detect open and newly frozen sea ice leads covered by thin ice in wintertime (from January to April) in the Arctic, using Sentinel-1 HH polarization images. The method, called SILDET, is composed of four modules: 1) a segmentation module; 2) a balance module; 3) an optimization module; and 4) a mask module. In the segmentation module, PSPNet utilizes global content to improve the accuracy of the detection and the generalization ability. A loss function is then introduced to alleviate the class imbalance problem in the balance module. In the optimization module, the shape features of leads are utilized to improve the precision evaluation for frozen leads. Land, landfast ice, and areas outside the ice cover are removed in the mask module.
By comparing the SILDET results with the lead results from visual interpretation of SAR imagery covering different ice conditions and development stages of leads, the OA and Kappa were 97.80% and 0.88, respectively. This shows that the sea ice lead maps detected by SILDET agree with the visual interpretation well. Furthermore, SILDET showed a good identification performance for both water and thin ice in the sea ice leads. SILDET achieved detection accuracy for the open lead class with an overall precision of 83.17% and recall of 90.58%. For the frozen lead class, the overall precision and recall from SILDET were 95.66% and 79.05%, respectively. The ablation experiments demonstrated that the segmentation module, balance module, and optimization module in SILDET contribute equally to the improvement of the detection performance. The SILDET detection results were compared with the lead maps from Sentinel-2 images, lead maps from an existing SAR-based lead detection method, and a lead dataset based on MODIS images. SILDET captured most of the open leads and frozen leads shown in the Sentinel-2 maps. Compared with the lead maps from the other two methods, the lead width distribution and area from SILDET were closer to those from Sentinel-2. We derived the lead frequency distribution in the Arctic from January to April 2023 using SILDET and analyzed the lead width distribution. The Arctic lead width derived from the SILDET detection result followed a power-law distribution, with the fit exponents varying from 1.57 to 1.71. Overall, SILDET is an effective and robust method for the automated detection of Arctic sea ice leads with their open and frozen components in wintertime using Sentinel-1 HH polarization imagery. The method can be used as a tool to monitor sea ice lead distribution at a fine resolution.
ACKNOWLEDGMENT
The authors would like to thank the Alaska Satellite Facility Data Active Archive Center, the European Centre for Medium-Range Weather Forecasts (ECMWF), the NASA Goddard Space Flight Center, the Copernicus Open Access Hub, the University of Bremen, and the U.S. National Ice Center for providing the Sentinel-1, ERA5, MODIS, Sentinel-2, SIC, ice drift, and ice chart data, respectively. They would also like to thank Dr. M. Qu from the Polar Research Institute, China, for providing the lead dataset based on MODIS images.
No comments:
Post a Comment