Multiple Mobile Target Detection and Tracking in Active Sonar Array Using a Track-Before-Detect Approach
Notes and Comments on the Paper
Summary
The paper presents a new method for detecting and tracking multiple underwater mobile targets using active sonar with a hydrophone array. Key points:
- It uses a track-before-detect approach to overcome high false alarm rates due to low signal-to-clutter ratio (SCR).
- For each transmitted chirp signal, a 2D angle-distance matrix is created by pulse compression and delay-and-sum beamforming. This is filtered with a 2D CFAR detector to identify potential targets.
- Closely spaced detections from multiple transmissions are merged into blobs. A nearly constant velocity model is used to track the blobs over time.
- Position and velocity are estimated using a debiased converted measurement Kalman filter.
- Simulations and sea experiments tracking tagged fish show the method provides good track continuity and accuracy that is robust to detection threshold choice. It outperforms two benchmark schemes, especially in low SCR conditions.
- The key advantages are utilizing the expected smooth target motion to reduce false alarms, and its low computational complexity suitable for real-time onboard processing. Further work will address array ambiguity and exploitation of multiple target reflections.
Functional Block Diagram
Figure 2 presents a block diagram that illustrates the main steps of the proposed underwater target detection and tracking method. The process flow in the FBD is quite conventional for an active sonar or radar, as follows:
- R-Theta 2D Matrix samples
- Polar to Cartesian conversion
- Detection to track correlation
- Track initiation or Detection to track assignment
- Track update
- Track Confirmation
The process flow described in section 4 does not follow the FBD very closely, and can be described as follows:
- Pulse Compression: The received hydrophone signals are first processed using a matched filter adapted to the transmitted chirp signal. This step compresses the chirp reflections into short pulses, improving the signal-to-noise ratio and range resolution.
- Beamforming: The compressed signals from each hydrophone are combined using a delay-and-sum beamformer to form a set of beams covering different azimuth angles. This step enhances the spatial selectivity of the system and allows estimation of the target's bearing.
- 2D CFAR Detection: The beamformed data is organized into a 2D matrix with dimensions of range and azimuth. A 2D constant false alarm rate (CFAR) detector is applied to this matrix to identify potential targets while adaptively setting the detection threshold based on the background noise and clutter levels.
- Blob Detection: The CFAR detector output is a binary matrix, where '1' indicates a potential target detection. Connected regions of '1's are grouped into "blobs" and their centroids are calculated to estimate the range and azimuth of each potential target.
- 2D Kalman Filter: The range and azimuth measurements of the detected blobs are fed into a 2D Kalman filter for tracking. The filter uses a nearly constant velocity (NCV) model to predict the target's motion between measurements and update its estimated state (position and velocity) based on the new observations.
- Track Management: The tracking process includes logic for initiating, confirming, and deleting tracks based on the consistency of blob measurements over time. Confirmed tracks are output as the final target trajectories.
The block diagram in Figure 2 provides a high-level overview of the processing chain, showing how the received sonar signals are transformed and filtered at each stage to extract target information and maintain tracks over time. This modular representation helps to break down the complexity of the system and identify the key functional components.
Test
The paper describes a sea experiment conducted in Šibenik, Croatia in January 2024 to validate the proposed detection and tracking method. The experimental setup included:
Equipment:
- - A self-built floater with a 4-hydrophone planar array and a single projector positioned above the hydrophones
- - The hydrophones were spaced about 25 cm apart
- - The projector was about 40 cm above the hydrophones
- - The floater maintained a depth of 20 m using buoyancy control
- - A small surface buoy with GPS was attached to the floater to monitor its position
Location:
- - The experiment was conducted in the sea near Šibenik, Croatia
- - Water depth was approximately 30 m
- - The seafloor was relatively flat and muddy
- - The water had a freshwater layer in the top 3 m from a nearby river
Data gathered:
- - Linear frequency modulated chirp signals were emitted in the 31-41 kHz band with 10 ms duration
- - 1 second interval between emissions to avoid interference
- - Hydrophone data was sampled at 96 kHz, 2 bytes per sample
- - Maximum detection range was set to 500 m
- - 2 gilt-head seabream fish were released and tracked as targets
- - Each fish was attached to a surface float with GPS (2 m accuracy) via a 20 m line to record ground truth tracks
- - 3 tracking experiments were conducted with the fish swimming freely
The GPS-tagged fish provided ground truth tracks to compare against the sonar tracking estimates. Ethical approval was obtained for the experiment.
Test Sonar Parameters and Analysis
The paper specifies the following parameters for the transmitted waveform in the sea experiment (Section VI-B1):
- - Waveform type: Linear frequency modulated (LFM) chirp signals
- - Frequency band: 31 kHz to 41 kHz
- - Bandwidth: 10 kHz (implied by the frequency band)
- - Pulse duration: 10 ms
- - Repetition interval: 1 second between adjacent chirp transmissions
The relatively long repetition interval of 1 second was chosen to avoid interference between consecutive pulse transmissions, as the maximum detection range was set to 500 m. This ensures that reflections from distant targets will be received before the next pulse is transmitted.
Range Resolution
The wide bandwidth of 10 kHz provides good range resolution after pulse compression, while the 10 ms pulse duration allows sufficient energy to be transmitted for detecting small targets like fish at long ranges. The chosen frequency band in the 30-40 kHz range is suitable for medium-range underwater sensing applications.
In this case, the pulse duration is 10 ms (0.01 s) and the bandwidth is 10 kHz (10,000 Hz). The time-bandwidth product (TBP) is:
TBP = Pulse duration × Bandwidth = 0.01 s × 10,000 Hz = 100
The pulse compression gain is typically expressed in decibels (dB) and is given by:
Pulse compression gain (dB) = 10 × log10(TBP) = 10 × log10(100) = 20 dB
This means that the pulse compression operation will provide a processing gain of approximately 20 dB, which can significantly enhance the signal-to-noise ratio (SNR) of the received echoes and improve the detection performance of the sonar system.
The long duration and wide bandwidth of the transmitted chirp pulse allow for this high compression gain, which is one of the key advantages of using pulse compression techniques in active sonar systems. The compressed pulse will have a much shorter duration (inversely proportional to the bandwidth) compared to the original pulse,
The compressed pulse width is inversely proportional to the bandwidth of the transmitted chirp signal. It can be calculated using the following formula:
Compressed pulse width = 1 / Bandwidth
In this case, with a bandwidth of 10 kHz (10,000 Hz), the compressed pulse width would be:
Compressed pulse width = 1 / 10,000 Hz = 0.1 ms (0.0001 s)
To convert this compressed pulse width to range resolution, we need to consider the speed of sound in water, which is approximately 1500 m/s. The range resolution is given by:
Range resolution = (Speed of sound × Compressed pulse width) / 2
The factor of 2 in the denominator accounts for the two-way travel time of the acoustic signal from the sonar to the target and back.
Plugging in the values:
Range resolution = (1500 m/s × 0.0001 s) / 2 = 0.075 m = 7.5 cm
Therefore, the sonar system in this study has a range resolution of approximately 7.5 cm after pulse compression. This means that the system can distinguish between two targets that are separated by at least 7.5 cm in range.
The high range resolution achieved through pulse compression enables accurate localization and tracking of small targets, such as individual fish, as demonstrated in the sea experiments described in the paper.
Azimuth Resolution
The relatively small array and limited description of beamforming does not allow a thorough analysis of direction resolution and accuracy. The paper does not explicitly state the number of beams formed or their exact angles relative to broadside. Azimuth resolution beyond 50 meters will be much coarser than the range resolution. This is critical in blob formation and measurement covariance for kalman filtering. However, we can infer some information from the provided details.
In Section IV-A, the authors mention that the beamforming process creates "U spatial domain beams" and forms an "angle-distance 2D matrix" with dimensions U × Ls, where Ls is the number of range samples. This suggests that the system forms a discrete set of beams covering different azimuth angles.
Figure 8b shows the azimuth beam pattern of the 4-element hydrophone array used in the sea experiment. From this plot, we can observe that the main lobe of the beam pattern is relatively wide, with a 3 dB beamwidth of approximately 30 degrees. The plot also shows some side lobes at around ±45 degrees and ±90 degrees from the main lobe direction.
Given the wide main lobe and the presence of side lobes, it is likely that the system forms a set of overlapping beams to cover the entire 180-degree azimuth range in front of the array. The exact number of beams and their angular spacing are not specified, but a common approach is to space the beams at intervals of half the 3 dB beamwidth to ensure adequate coverage and overlap between adjacent beams.
Assuming a 3 dB beamwidth of 30 degrees and a half-beamwidth spacing, we can estimate that the system might form around 12 beams (180 degrees / 15 degrees) to cover the full azimuth range. However, this is only an estimate based on the available information.
The beam angles relative to broadside would depend on the chosen beam spacing and the desired coverage area. For example, with 12 beams and a 15-degree spacing, the beam angles might be: -90, -75, -60, -45, -30, -15, 0 (broadside), 15, 30, 45, 60, and 75 degrees.
Without more specific details from the authors, it is difficult to determine the exact beam configuration used in this study. Nonetheless, the beamforming process is a crucial step in the proposed detection and tracking method, as it allows the system to estimate the azimuth angle of the detected targets and form the angle-distance matrix for further processing.
Authors
The paper has five authors from institutions in Israel, Croatia, and the United States:
- Avi Abu - Hatter Department of Marine Technologies, University of Haifa, Israel
- Nikola Mišković - Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia
- Oleg Chebotar - Hatter Department of Marine Technologies, University of Haifa, Israel
- Neven Cukrov - Division for Marine and Environmental Research, Ruder Bošković Institute, Croatia
- Roee Diamant - Hatter Department of Marine Technologies, University of Haifa, Israel; Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia; Project CETI, New York, USA (corresponding author)
Roee Diamant (Senior Member, IEEE) received the B.Sc. and M.Sc. degrees from the Technion, Israel Institute of Technology, Haifa, Israel, in 2002 and 2007, respectively, and the Ph.D. degree from the Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, BC, Canada, in 2013. He is currently an Associate Prof. with the Department of Marine Technologies, University of Haifa, Haifa, Israel, and also Heads the underwater Acoustic and Navigation Laboratory (ANL). From 2001 to 2009, he was with Rafael Advanced Defense Systems, Israel, as a Project Manager and Systems Engineer, where he developed a commercial underwater modem with network capabilities. In 2015 and 2016, he was a Visiting Professor with the University of Padova, Padova, Italy.
His research interests include underwater acoustic communication, underwater localization and navigation, object detection and classification, and sonar signal processing. He was the recipient of the NSERC Vanier Canada Graduate Scholarship in 2010, Israel Excellent Worker First Place Award from the Israeli Presidential Institute in 2019, three Best Paper awards, and the EU ERA-Chair appointment to the University of Zagreb, Zagreb, Croatia. He is also an Associate Editor for IEEE Journal of Ocean Engineering. He was the coordinator of the EU H2020 project SYMBIOSIS (BG-14 track).
The research was supported by the Schmidt Marine Foundation, the European Union's Horizon Europe program (project UWIN-LABUST), and Project CETI which received funding from several philanthropic organizations.
The paper cites several relevant works by other researchers on topics such as:
- - Underwater target detection and tracking applications
- - Active sonar systems and processing techniques
- - Constant false alarm rate (CFAR) detectors
- - Kalman and particle filtering for tracking
- - Probability hypothesis density filters for multitarget tracking
- - Sonar image processing and beamforming with arrays
This suggests the authors are building upon established techniques while introducing novel elements like the track-before-detect approach and low-complexity implementation suitable for real-time operation on autonomous platforms like floats and gliders. The sea experiment also demonstrates their capability to conduct realistic validation tests.
Computer Science > Sound
We present an algorithm for detecting and tracking underwater mobile objects using active acoustic transmission of broadband chirp signals whose reflections are received by a hydrophone array. The method overcomes the problem of high false alarm rate by applying a track-before-detect approach to the sequence of received reflections. A 2D time-space matrix is created for the reverberations received from each transmitted probe signal by performing delay and sum beamforming and pulse compression. The result is filtered by a 2D constant false alarm rate (CFAR) detector to identify reflection patterns corresponding to potential targets. Closely spaced signals for multiple probe transmissions are combined into blobs to avoid multiple detections of a single object. A track-before-detect method using a Nearly Constant Velocity (NCV) model is employed to track multiple objects. The position and velocity is estimated by the debiased converted measurement Kalman filter. Results are analyzed for simulated scenarios and for experiments at sea, where GPS tagged gilt-head seabream fish were tracked. Compared to two benchmark schemes, the results show a favorable track continuity and accuracy that is robust to the choice of detection threshold.
Submission history
From: Oleg Chebotar [view email][v1] Tue, 16 Apr 2024 06:48:41 UTC (1,240 KB)
Multiple Mobile Target Detection and Tracking in Active Sonar Array Using a Track-Before-Detect Approach
Avi Abu,
Nikola Mišković,
Oleg Chebotar,
Neven Cukrov,
Roee Diamant
This research was supported by the
Schmidt Marine Foundation via the Global Fisheries Tech Initiative, by
the Horizon Europe programme of the European Union under the UWIN-LABUST
project (project number 101086340), and by Project CETI via grants from
Dalio Philanthropies and Ocean X; Sea Grape Foundation; Rosamund
Zander/Hansjorg Wyss, Chris Anderson/Jacqueline Novogratz through The
Audacious Project: a collaborative funding initiative housed at TED.
Hatter Department of Marine Technologies, University of Haifa, Israel
Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia
Division for marine and environmental research, Ruder Bošković Institute, Croatia
Project CETI, New York, NY 10003, USA
Corresponding author, email: roee.d@univ.haifa.ac.il
Abstract
We present an algorithm for detecting and tracking underwater mobile objects using active acoustic transmission of broadband chirp signals whose reflections are received by a hydrophone array. The method overcomes the problem of high false alarm rate by applying a track-before-detect approach to the sequence of received reflections. A 2D time-space matrix is created for the reverberations received from each transmitted probe signal by performing delay and sum beamforming and pulse compression. The result is filtered by a 2D constant false alarm rate (CFAR) detector to identify reflection patterns corresponding to potential targets. Closely spaced signals for multiple probe transmissions are combined into blobs to avoid multiple detections of a single object. A track-before-detect method using a Nearly Constant Velocity (NCV) model is employed to track multiple objects. The position and velocity is estimated by the debiased converted measurement Kalman filter. Results are analyzed for simulated scenarios and for experiments at sea, where GPS tagged gilt-head seabream fish were tracked. Compared to two benchmark schemes, the results show a favorable track continuity and accuracy that is robust to the choice of detection threshold.
Index Terms:
Sonar images, multitarget tracking, adaptive beamforming, track-before-detect, underwater target detection, biodiversity and biomass estimation.
I Introduction
The detection and tracking of underwater objects is required for applications such as the monitoring of marine fauna [1], the identification of schools of fish in commercial ventures[2] and the detection of prey of sperm whales (Physeter macrocephalus) for behavioral analysis [3]. Monitoring is done from vessels and underwater by floaters or gliders. Real-time analysis enables an immediate response to either report a detection event or approach the source to achieve a better signal-to-noise ratio (SNR). Two specific applications we are focusing on are the monitoring of deep-sea giant squid (Architeuthis dux) by autonomous gliders to draw conclusions about the behavior of sperm whales, for which squid are a major food source, and the monitoring of fish populations for biodiversity analysis. This is needed both for regulators who need to set data-driven policy and for sustainable fisheries to reduce bycatch by making fishermen aware of the location of fish schools. The first application is considered within a project called CETI, where we have set the ambitious goal of understanding the language of sperm whales by tracking their activity and vocalizations. The second application is carried out as part of a project called SOUND, where we deploy a swarm of autonomous floaters to monitor the water column in search of schools of fish. In both cases, the monitoring is done by active acoustics. In particular, by emitting acoustic signals and analyze their reflections to find moving targets. Pictures of our glider and floater platforms can be found in Fig 1a and 1b respectively. In this paper, we focus on the task of target detection and localization.
The detection of underwater objects is dominated by the signal-to-clutter ratio (SCR), which includes all reflections that do not associate with the target. Due to the low target strength of fish (in the order of -40 dB, [4]), the SCR is likely to be in the order of 0 dB. Therefore, in active acoustic detection, an array of hydrophone receivers is used to utilize the spatial domain by e.g., beamforming [5]. In our case, we use an array of four hydrophones positioned in the plane. Here, the challenge is to handle the ambiguity in the angle estimation, which reaches 15 for a plenary array [6]. Another challenge lies in the fast mobility of the tracked object and the possible existence of multiple targets.
Our multi-target detection and localization solution is based on the track-before-detect approach to analyze the reflections of a sequence of broadband chirp signals received by an acoustic array. For each transmitted chirp signal, a 2D angle-distance matrix is formed by pulse compression and delay-and-sum beamforming. The samples of this matrix are thresholded by a constant false alarm detector (CFAR) to eliminate weak reflections that may not be associated with the search targets. The remaining samples are grouped into blobs, followed by further filtering to remove detected regions that are too small or too large. To track objects, we employ the debiased converted measurement Kalman filter[7] with a near constant velocity dynamical model(NCV). The result is a scheme for 2D target tracking whose main advantage is the ease of handling objects with low SCR in a high false alarm rate environment.
The main contributions of this paper are twofold:
-
A 2D multitarget scheme for identifying moving fish by active acoustics while attenuating noise.
-
Merging adaptive beamforming and tracking into a single track-before-detect approach to reduce the false alarm rate. Incorporating the beamformer into a track-before-detect scheme makes it possible to overcome low SCR values while utilizing the expected smooth motion of the target.
The results of numerical simulations show the applicability of our method to overcome low SCR values in multi-target scenarios and its advantages over two benchmark schemes. The results of experiments at sea, where live fish were tagged with a GPS receiver and detected by our float, show the applicability of our approach in realistic scenarios and the accuracy achieved in a real marine environment.
The remaining of this paper is organized as follows. In Section II, we survey the state-of-the-art for object detection and tracking of underwater objects. System model and main assumptions are presented in Section III. Our target detection scheme is introduced in Section IV, and our multitarget tracking is outlined in Section V. Numerical and experimental results are discussed in Section VI. Concluding remarks are offered in Section VII.
II Literature Review
Target detection using the constant false alarm rate (CFAR) method is a common application for underwater monitoring tasks [8]. For clutter mitigation, a cell-averaged CFAR detector is offered in [9], where multiple cells are averaged to set the detection threshold. Other similar CFAR approaches are the accumulated cell-averaged CFAR [10], in which the accumulated sample matrix is calculated to reduce the computational cost in calculating the detection threshold, and the order statistics CFAR [11], in which noisy samples are attenuated by sorting the samples and selecting only the dominant samples. A detection method that is robust to complex environments is provided in [12], where the 2D matrix is clustered using the K-Means algorithm and the objects are identified using the resulting binary matrix.
For target tracking, the extended Kalman filter [13] is used due to its simplicity in tracking nonlinear dynamic or measurement models. The method relies on a motion model for the target, and a linearization of the measurement model. Versions of Kalman tracking for object tracking is the debiased converted measurement Kalman filter [7], in which the nonlinear polar measurements are converted into a linear system of Cartesian coordinates taking into account the bias due to the nonlinearity of the conversion.
For multitarget tracking, data association of detections into tracks is necessary. Methods include the multi-hypothesis tracking (MHT), in which all possible mappings between detections and tracks are considered, and joint probabilistic data association (JPDA) [14], in which a track is assigned to multiple detections by a probability measure. To reduce the complexity of the MHT filter, a Gaussian mixture probability hypothesis density (GM-PHD) filter is introduced in [15]. This filter has closed-form recursions for the propagation of the state vector and its covariance and its advantage lies in its practicability for underwater scenarios with multiple targets. In [16], CA-CFAR with threshold segmentation is used to reduce false alarms. The method applies iterative threshold segmentation to reduce false alarms. Another multi-target approach for underwater applications is the sequential Monte Carlo density algorithm [17]. For object detection, the authors offered a combination of CA-CFAR and the K-means detector. To reduce the number of missed tracks, a continuous lost frame threshold is added. More specifically, if the predicted states of a track are not correlated with an object across multiple sonar frames, the state vector and its covariance are replaced by the last updated state of the track. However, the results are limited to high SCR.
Acoustic arrays are used to increase the SCR. In [18], detection and tracking of underwater objects is performed using a multi-beam sonar. Detection is performed by segmenting the sonar image, while the position of the object after detection is determined by its contour position. Object tracking is done using a particle filter to model the posterior by a series of weighted samples. In [19], the Gaussian mixture cardinalized probability hypothesis density filter is applied to multistatic sonar data. The posterior is estimated by a Gaussian mixture and the state vector and its covariance are predicted by the EKF. The Gaussian mixture is also used in [20] for a cardinalized likelihood hypothesis filter over real sonar data obtained from a sea trial. This filter has the advantage that it can well handle clutter. A linear motion model is used with a non-linear measurement model. However, due to the high false alarm rate in real sonar data, there is a large track breakage rate. In [21], three optimization algorithms for probabilistic maximum likelihood data association are introduced. The formalization includes the multi-pass grid search, the genetic algorithm and the developed directed subspace search. However, solving directly involves high computational costs. Although the aforementioned methods provide good results, the complexity in tracking or detection is high and the implementation on an on-board computer may be limited to short missions. We identify two remaining challenges for the task of multitarget detection and tracking multiple targets: 1) high complexity and 2) high rate of missing tracks. The latter is especially true in scenarios with high false alarm rates and low SCR environment.
III System Model
Our goal is to detect and track mobile underwater objects while achieving a favorable trade-off between precision and recall as well as high accuracy in target localization. To manage a small array suitable for small platforms such as a floater or a glider, we focus only on 2D localization. Our scenario involves a Lagrangian floater or glider that drifts with the water current, changing only its depth to a target profile. This can be achieved by a piston-based buoyancy control, as is the case with the system used in our results below, or by a bladder [22].
For active acoustic detection, we use a single projector that emits a sequence of chirp signals of large bandwidth-duration product for high pulse compression. The reflections are recorded by an array of hydrophones ( in our results) mounted in a plenary array around the platform, as shown in Fig. 1. Further details can be found in [23]. The data received from the sensors is transferred to the angle-distance domain by applying a matched filter followed by a beamforming. The result is a 2D matrix in which each sample is associated with a time-of-arrival, which can be converted to a distance if the speed of sound is known, and an angle of arrival. The time domain input of the m hydrophone with sources is modeled as follows
(1) |
where , , and stand for the signal waveform, the time delay of the n source with azimuth denoted by , and the clutter noise. The time delay is given by
(2) |
where is sound velocity at water, is the Cartesian location of the m hydrophone, and is the spatial direction of the n source given by
(3) |
We assume that the speed of sound in water is known or measured by the profiling platform. Following [24], we assume that the angle-distance samples are independent and identically distributed (i.i.d.) and follow the Rayleigh distribution. We assume that the polar measurements are uncorrelated and corrupted by an additive white Gaussian noise with known or measured covariance. We neglect multipath reflections from the target through the seafloor and assume that the dominant reflection is given by the direct path. We argue that this is a reasonable assumption since the planar array does not account for elevation angle and most reflections are received from the same bearing. The dynamic path of the targets is assumed to follow a model with nearly constant velocity. The process noise of the dynamic target model is assumed to be Gaussian noise with zero mean and a known standard deviation .
IV Underwater Target Detection
Referring to the block diagram in Fig. 2, our target detection involves four steps: 1) pulse compression using a matched filter, 2) beamforming spatial domain beams and forming an angle-distance 2D matrix, 3) applying a constant 2D false alarm rate (CFAR) detector, and 4) binary map extraction and blob detection.
IV-A Forming the 2D angle-distance Matrix
For each emission of chirp signals, we create a 2D angle-distance matrix from the received reflections. The process of creating these matrices is as follows. First, a buffer of length is extracted at each of the array elements. Multiplying by half the speed of sound reflects the maximum detection range of the system. The received buffer is filtered separately at the output of each hydrophone with a matched filter that is adapted to the transmitted chirp signal. The compressed signals are beamformed into beams using the delay-and-sum procedure [25]. Formally, for a beam azimuth , the associated delay is . The data after beamforming for a reflection received at time instance , , is given by
(4) |
The angle-distance matrix is obtained by
(5) |
IV-B Blob Detection
The corresponding angle-distance 2D matrix for each emission is filtered to reduce clutter. The detection threshold is calculated according to a 2D CFAR detector to yield a binary matrix samples that passed (’1’) or didn’t pass (’0’) the threshold. The process involves estimation of the 2D angle-distance sample distribution parameters to calculate the CFAR detection threshold. The Rayleigh distribution,
(6) |
where is the law-specific parameter, is adopted to model the clutter distribution [24]. Parameter is estimated by the maximum-likelihood [26]
(7) |
where stands for the group of samples surrounding the sample under test in that belong to the reference region. The CFAR threshold is given by
(8) |
where is a target probability of false alarm.
The CFAR detection binary map is obtained by
(9) |
Adopting the 4-connectivity method [27], we identify in several regions of interest (ROI) who contain potential targets and that form a connected region. We refer to such region as a blob. The range and azimuth of the -th blob is estimated by
(10) |
where is the -th coordinate of the center mass of the -th blob, and is the sampling frequency. The azimuth is given by
(11) |
When the number of elements in the acoustic array is small, the beamwidth is large. As a result, the beamforming grating lobes are also large, and a target can be received at adjacent beams. To manage this ambiguity, we merge blobs that are closely spaced. Let be the group of all blobs obtained from a single emission. Let , be two separate blobs, which are merged if
(12a) | ||||
(12b) |
where and are the merging thresholds in range and azimuth, respectively. The threshold in range can be obtained by utilizing knowledge about the expected size of the mobile target to be detected, and the threshold in azimuth can be set by the position of the grating lobes of the array. The range and azimuth of the -th merged blob at the emission number is given by
(13a) | ||||
(13b) |
where is the group of all blobs that fulfill conditions in (IV-B). A tracking process is now applied on the merged blobs to detect a mobile target with a nearly constant velocity dynamical model. To keep the tracking process efficient with low complexity, we apply a linear Kalman filter, with a debiased polar to Cartesian measurement conversion as a fast and efficient method to handle a non linear measurement model.
IV-C Polar to Cartesian Conversion
Following [7], we convert the polar measurements () into a Cartesian space. The covariance matrix of the polar measurement noise is
(14) |
where and are the standard deviations of the range and the azimuth measurements, respectively. The debiased converted measurement in Cartesian coordinates is given by [7]
(15) |
V Underwater Target Tracking
The final step in our target detection is tracking, where we confirm valid mobile targets through detected blobs that follow a dynamical model of constant velocity. The dynamical model is given by
(16) |
where , and are the position vector of a target in X and Y cartesian frame and , are the Cartesian velocities, respectively, and is the process noise. Matrix in (16) is given by
(17) |
matrix is given by
(18) |
and is the time of the -th blob.
The measurement equation is given by
(19) |
with
(20) |
and is the measurement noise. The goal of our tracking is to estimate the position and velocity trajectories of a target. For this, we use Kalman filtering. The Kalman filter includes predict and update steps. The predicted step of the state vector and its covariance for the -th track are given by
(21a) | ||||
(21b) |
where is the process noise covariance matrix.
The update stage of the state vector and its covariance matrix is given by:
(22a) | ||||
(22b) | ||||
, |
where
(23) |
and . The term is the covariance of the debiased converted measurement noise [24].
V-A Detection to Track Correlation
To decrease computation time, we first apply a coarse gating between tracks and blobs that originate from one emission. Formally, only a blob that fulfills the following two conditions with at least one track is allowed, otherwise a new track is created. The gating between the -th blob and the -th track in range and azimuth is given by
(24a) | ||||
(24b) |
where and are the gating thresholds in range and azimuth, respectively. These thresholds can be set by utilizing the knowledge about the maximum velocity of the mobile target and the time between adjacent emissions. Terms , and , are the Cartesian coordinates of the -th track for emission number . Only tracks and blobs that meet constraint (V-A), are considered for the correlation
(25) |
The result is further thresholded,
(26) |
to ensure that a correct blob lies within the gate with a certain probability, which in turn is a function of the gate threshold . Blobs that are not correlated with any track are initiated as a new track. Correlated blobs are further considered for a detection to track assignment.
V-B Detection to Track Assignment
For each chirp emission, the global assignment process determines the best association of blobs and tracks. Let be the association matrix, where and are the number of tracks and blobs, respectively, at emission number , respectively. is given by
(27) |
Let be a set of tracks and blobs containing indices for possible assignments. The global assignment is obtained by finding the blobs-tracks assignment with the minimum global statistical distance, namely
(28) |
To solve (28), we use the auction algorithm [28] due to its simplicity and low computational load. A blob that is not assigned to a track (in the case ) opens a new track. A track is removed if it has not been assigned to a blob at least times during the last emissions.
V-C Track Confirmation
A track is confirmed if the number of blobs assigned to it is greater than . Formally,
(29) |
where is an indicator function that is 1 if the argument is true and 0 otherwise.
V-D Complexity Analysis
The complexity of our detection and tracking method per single emission is
(30) |
This is composed of a complexity of for the debiased Kalman filter [29], where is the size of the state vector , and stands for the 2D CFAR detection. While the complexity of the particle filter is , and the complexity of the PHD filter is [30]. In noisy environment, , which implies that the proposed detection and tracking method has much lower complexity compared to the particle and PHD filters.
VI Performance Evaluation
In this section, we analyze the performance of our detection and tracking algorithm with both simulative and real data. We compare our method with benchmarks that focus on underwater acoustic detection and tracking. In particular, the methods in [31], referred below as Lo, and the scheme in [32], referred below as Karoui, which are commonly used underwater detection and tracking schemes designed specifically for noisy underwater environment.
The waveform is a chirp signal with a bandwidth of 10 kHz and a duration of 0.01 s. The sampling frequency is set to 50 kHz. The merging thresholds and are set to 10 m and , respectively. The standard deviations for the range and azimuth and are set to 0.3 m and , respectively. The standard deviation of the process noise is set to m/s. The gating thresholds for range and azimuth, and , are set to 10 m and , respectively, and the threshold value for gating, , is set to 0.1. The threshold value for the track confirmation is set to . The track is deleted if the number of unassigned blobs exceeds 7 during 15 signal emissions.
VI-A Results on Simulated Data
In this section, we analyze the performance of our proposed detection and tracking method using simulated data. We simulate a single target with an initial position of m and m and a velocity of m/s and m/s.
An example of the detection and tracking results can be found in Fig. 3. The binary map is shown in Fig. 3a, where the yellow regions indicate the detected blobs. Fig. 3b shows the track map. The solid line indicates the track that belongs to the real target and the dotted lines represent the false tracks. Fig. 3c and Fig. 3d show the position and velocity estimation of the target, respectively. We observe a convergence of velocity after 32 emissions. Fig. 4 shows an example of the samples received from the emission after beamforming. The target is marked with the arrow, and the SCR is 3 dB.
VI-A1 Comparison with the Benchmarks
In this section, we compare the performance of our detection and tracking method with the Lo and Karoui benchmarks. For comparison, we measure the continuity of the estimated track. The track continuity is defined as the ratio between the duration of the track and the lifetime of the target. A track continuity equal to ’1’ would mean that the target is detected and continuously tracked for all signal emissions.
Results are shown for an SCR of 3 dB in Fig 5a and for an SCR of 5 dB in Fig 5b. For the latter figure, we note that the results for Lo are of less range than the other two schemes since it showed confirmed tracks only after a threshold corresponding to 4 false tracks. We observe that the results of Lo show the lowest track continuity. This is because this scheme does not merge closely spaced detection, which increases the false alarm rate and thus decreases the track continuity. We find that our method outperforms the benchmark for both tested SCR values and achieves a track continuity of more than 0.9 with less than two false traces. We notice the SCR has an impact on the tracking performance. For an SCR of 3 dB, our method achieves a track continuity of one with 7 false tracks, while, for SCR of 5 dB, the same tracking continuity is achieved with 2 false tracks.
VI-A2 Confirmation of Threshold Analysis
In this section, we analyze the continuity of tracks and the number of false tracks for different values of the track confirmation thresholds, . In this analysis, we set the detection threshold to 0.2 and the SCR to 3 dB. The results are shown in Fig. 6. When the confirmation threshold is small, more tracks are validated and the number of false tracks increases. However, the track associated with the true target is reported with less validation time. When the confirmation threshold is high, the validation time is also high and consequently the number of false tracks decreases, but the track associated with the true target is reported with a higher latency. As expected, increasing the confirmation threshold does not affect the continuity of the tracks, but rather reduces the number of false tracks.
VI-B Sea Trial Results
VI-B1 Experimental Setup
To further validate the performance of our proposed detection and tracking method, we now present the results of a designated sea experiment conducted in Šibenik, Croatia, in January 2024. The experiment comprised a self-built floater with a self-built simultaneously sampled four hydrophone planar array and a self-built single projector (see Fig. 7a). The hydrophones were about 25 cm apart and the projector was positioned about 40 cm above the hydrophones. The measurements of all elements to a reference point on the floater were fed into the multidimensional scaling (MDS) algorithm [33] to obtain a 3D estimate of the position of the array. A schematic representation of our system can be found in Fig. 8a, where the hydrophones are labeled ch1 to ch4. The positions of ch1, ch2, ch3 and ch4 are (-38.5,0), (0,39.5), (40.5,0) and (0,-40.5) respectively. The effective beam pattern of this constellation is shown in Fig. 8b.
In our experiment, the floater was set to maintain a depth of 20 m with 1 m overshoot and the water depth was approximately 30 m. The position of the floater was monitored by attaching it to a small surface buoy with a GPS logger (2 m accuracy). The sea was calm, the bathymetry was approximately flat and the seabed consisted mainly of muddy sediment. The sound speed profile was 1500 m/sec in the first 3 m, due to a freshwater layer from the Krka River, and was fixed at 1520 m/sec below this layer. The emitted signals were linear frequency modulated chirps in the frequency band of 31 kHz-41 kHz and a duration of 10 ms. A guard interval of 1 second between adjacent emissions was used to suppress inter-pulse-interference, while the maximum distance for creating the distance-angle 2D matrices was 500 m. The sampling rate was set to 96 k samples per second at 2 bytes per sample.
We experimented with 2 gilt-head seabream as target fish. The fish were released at the same time near the floater buoy and swam freely away from it. The fish were attached with a 20 m fishing line to a float equipped with a GPS logger (2 m accuracy) to obtain information about the actual fish location. Due to the length of the fishing line and the inaccuracies in the GPS position, the inaccuracy in the true position of the fish is at most 24 m. We confirm that ethical approval for this experiment was obtained from the Ruđer Bošković Institute, Croatia, and the experiments were conducted in accordance with the approved guidelines and EU ethical regulations. The methods used followed the ARRIVE guidelines (https://arriveguidelines.org).
VI-B2 Experiment Results
Fig. 9 shows the results for the tracks obtained during the three experiments. The location estimates for fish #1 and for fish #2 are shown in Fig. 9a and Fig. 9c respectively. The velocity estimates for fish #1 and for fish #2 are shown in Fig. 9b and Fig. 9d respectively. Comparing the figures, we observe a different swimming behavior for the two fish, with fish #2 swimming faster and more smoothly. The two fish we experimented with for Exp 3 swam in different directions, but at approximately the same speed.
We note that the results of our tracking method are close to the GPS ground truth data, where in both cases the error was below the maximum ambiguity. For fish #1, convergence of position was achieved after 45 seconds, and the error in velocity estimation was about 0.2 m/s in both coordinates. Convergence of the position of fish #2 was achieved after 10 seconds and the velocity error was 0.2 m/s and 0.1 m/s for the X and Y coordinates, respectively.
Fig. 10 introduces a track continuity comparison for the sea trial results between our method and the two explored benchmarks. The track was associated with the true fish in the sea trial by the GPS data. Regarding the false detection, besides the tagged fish, more valid targets could exist. Thus, with no ground truth information for the number of false tracks, we measure the track continuity for different detection thresholds by changing the target false alarm rate as defined in (8). The results confirm our observations from the simulations in Fig. 5. Low track continuity is observed for the ”Lo” benchmark, whereas our proposed approach show robuswtness to different choice of threshold.
VII Conclusion
We presented a detection and tracking method for underwater mobile targets. High acoustic reflections from clutter often increase the number of false alarms, which significantly affects the performance of underwater tracking. To overcome this challenge, we offered a track-before-detect mechanism that clusters sets of binary maps, and performs tracking on blobs rather that on individual detections. Towards an application of real-time analysis, we aimed for a low complexity tracking solution. Results on simulated data as well as on real data from a sea experiment reveal that our method effectively reduces the number of false tracks while keeping the track continuity high. Accurate tracking in location and velocity estimate is shown in the presence of low signal-to-clutter ratio that exists two benchmark schemes. Further work will extend this work for handling the spatial ambiguity of the array as well as to utilize the multiple arrivals reflected from the mobile target.
References
- [1] P. R. De La Torre, E. L. Smith, A. Sancheti, K. N. Salama, and M. L. Berumen, “isat: The mega-fauna acoustic tracking system,” in 2013 MTS/IEEE OCEANS - Bergen, 2013, pp. 1–6.
- [2] J. Park, G. Kim, J. Seok, and J. Hong, “Pulsed active sonar using generalized sinusoidal frequency modulation for high-speed underwater target detection and tracking,” IEEE Access, vol. 11, pp. 143 081–143 091, 2023.
- [3] J. Andreas, G. Beguš, M. M. Bronstein, R. Diamant, D. Delaney, S. Gero, S. Goldwasser, D. F. Gruber, S. de Haas, P. Malkin et al., “Toward understanding the communication in sperm whales,” Iscience, vol. 25, no. 6, 2022.
- [4] C. Rautureau, C. Goulon, and J. Guillard, “In situ ts detections using two generations of echo-sounder, ek60 and ek80: the continuity of fishery acoustic data in lakes,” Fisheries Research, vol. 249, p. 106237, 2022.
- [5] B. Ferguson, “Improved time-delay estimates of underwater acoustic signals using beamforming and prefiltering techniques,” IEEE Journal of Oceanic Engineering, vol. 14, no. 3, pp. 238–244, 1989.
- [6] E. Dubrovinskaya, V. Kebkal, O. Kebkal, K. Kebkal, and P. Casari, “Underwater localization via wideband direction-of-arrival estimation using acoustic arrays of arbitrary shape,” Sensors, vol. 20, no. 14, 2020.
- [7] D. Lerro and Y. Bar-Shalom, “Tracking with debiased consistent converted measurements versus ekf,” IEEE Transactions on Aerospace and Electronic Systems, vol. 29, pp. 1015–1022, 1993.
- [8] B. Kalyan and A. Balasuriya, “Sonar based automatic target detection scheme for underwater environments using cfar techniques: a comparative study,” in Proceedings of the 2004 International Symposium on Underwater Technology (IEEE Cat. No.04EX869), 2004, pp. 33–37.
- [9] R. Blum and S. Kassam, “Distributed cell-averaging cfar detection in dependent sensors,” IEEE Transactions on Information Theory, vol. 41, no. 2, pp. 513–518, 1995.
- [10] G. G. Acosta and S. A. Villar, “Accumulated ca–cfar process in 2-d for online object detection from sidescan sonar data,” IEEE Journal of Oceanic Engineering, vol. 40, no. 3, pp. 558–569, 2015.
- [11] H. Rohling, “Ordered statistic cfar technique - an overview,” in 2011 12th International Radar Symposium (IRS), 2011, pp. 631–638.
- [12] T. Zhou, J. Si, L. Wang, C. Xu, and X. Yu, “Automatic detection of underwater small targets using forward-looking sonar images,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–12, 2022.
- [13] D. Zhan, H. Zheng, and W. Xu, “Tracking control of autonomous underwater vehicles with acoustic localization and extended kalman filter,” Applied Sciences, vol. 11, no. 17, 2021.
- [14] Y. Bar-Shalom, T. Kirubarajan, and X. Lin, “Probabilistic data association techniques for target tracking with applications to sonar, radar and eo sensors,” IEEE Aerospace and Electronic Systems Magazine, vol. 20, no. 8, pp. 37–56, 2005.
- [15] D. Clark, B.-N. Vo, and J. Bell, “GM-PHD filter multitarget tracking in sonar images,” in Signal Processing, Sensor Fusion, and Target Recognition XV, I. Kadar, Ed., vol. 6235, International Society for Optics and Photonics. SPIE, 2006, p. 62350R.
- [16] T. Zhou, Y. Wang, L. Zhang, B. Chen, and X. Yu, “Underwater multitarget tracking method based on threshold segmentation,” IEEE Journal of Oceanic Engineering, vol. 48, no. 4, pp. 1255–1269, 2023.
- [17] T. Zhou, Y. Wang, B. Chen, J. Zhu, and X. Yu, “Underwater multitarget tracking with sonar images using thresholded sequential monte carlo probability hypothesis density algorithm,” IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1–5, 2022.
- [18] M. Li, H. Ji, X. Wang, L. Weng, and Z. Gong, “Underwater object detection and tracking based on multi-beam sonar image processing,” in 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2013, pp. 1071–1076.
- [19] R. Georgescu and P. Willett, “The gm-cphd tracker applied to real and realistic multistatic sonar data sets,” IEEE Journal of Oceanic Engineering, vol. 37, no. 2, pp. 220–235, 2012.
- [20] R. Georgescu, S. Schoenecker, and P. Willett, “Gm-cphd and mlpda applied to the seabar07 and tno-blind multi-static sonar data,” in 2009 12th International Conference on Information Fusion, 2009, pp. 1851–1858.
- [21] W. R. Blanding, P. K. Willett, Y. Bar-Shalom, and R. S. Lynch, “Directed subspace search ml-pda with application to active sonar tracking,” IEEE Transactions on Aerospace and Electronic Systems, vol. 44, no. 1, pp. 201–216, 2008.
- [22] Y. Katz and M. Groper, “On the development of a mid-depth Lagrangian float for littoral deployment,” Journal of Marine Science and Engineering, vol. 10, no. 12, p. 2030, 2022.
- [23] “[online] available:,” https://drive.google.com/file/d/1ZfhY855AR0EVQHAloQH-CNKnPj-KZr1B/view?usp=sharing, accessed: 2023-09-10.
- [24] I. Quidu, L. Jaulin, A. Bertholom, and Y. Dupas, “Robust multitarget tracking in forward-looking sonar image sequences using navigational data,” IEEE Journal of Oceanic Engineering, vol. 37, no. 3, pp. 417–430, 2012.
- [25] A. Madanayake, C. Wijenayake, S. Wijayaratna, R. Acosta, and S. I. Hariharan, “2-d-iir time-delay-sum linear aperture arrays,” IEEE Antennas and Wireless Propagation Letters, vol. 13, pp. 591–594, 2014.
- [26] P. Grieve, “The optimum constant false alarm probability detector for relatively coherent multichannel signals in gaussian noise of unknown power,” IEEE Transactions on Information Theory, vol. 23, no. 6, pp. 708–721, 1977.
- [27] H. Samet and M. Tamminen, “Efficient component labeling of images of arbitrary dimension represented by linear bintrees,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 10, no. 4, pp. 579–586, 1988.
- [28] D. P. Bertsekas, Network Optimization. MA, Belmont:Athena Scientific, 1998.
- [29] M. Raitoharju and R. Piché, “On computational complexity reduction methods for kalman filter extensions,” IEEE Aerospace and Electronic Systems Magazine, vol. 34, no. 10, pp. 2–19, 2019.
- [30] S. Cong and L. Hong, “Computational complexity analysis for multiple hypothesis tracking,” in Proceedings of the 36th IEEE Conference on Decision and Control, vol. 5, 1997, pp. 4991–4996 vol.5.
- [31] K. Lo and B. Ferguson, “Automatic detection and tracking of a small surface watercraft in shallow water using a high-frequency active sonar,” IEEE Transactions on Aerospace and Electronic Systems, vol. 40, no. 4, pp. 1377–1388, 2004.
- [32] I. Karoui, I. Quidu, and M. Legris, “Automatic sea-surface obstacle detection and tracking in forward-looking sonar image sequences,” IEEE Transactions on Geoscience and Remote Sensing, vol. 53, no. 8, pp. 4661–4669, 2015.
- [33] F. Wickelmaier, “An introduction to MDS,” Sound Quality Research Unit, Aalborg University, Denmark, vol. 46, no. 5, pp. 1–26, 2003.
No comments:
Post a Comment