Aerospace Electronic and Defense Systems: A high-performance universal miniature SAR/GMTI and space-borne imaging radar system

high-performance universal miniature radar system | IEEE Conference Publication | IEEE Xplore

T. Jin, H. -X. Wang and H. -W. Liu, "A high-performance universal miniature radar system," 2016 CIE International Conference on Radar (RADAR), Guangzhou, China, 2016, pp. 1-5, doi: 10.1109/RADAR.2016.8059594.

Abstract: This paper proposes the design and realization of a high-performance universal miniature radar system. It presents a well solution to the main challenges of the radar system including extremely huge data flow and calculating burden, the traditional custom-built pattern of radar system, and the strict limitations for the size, weight and power consumption of the airborne or space-borne real-time Synthetic Aperture Radar(SAR) signal processing systems. The system has showed the virtues of standardization, modularization, stability, reconstruction, good adaptability due to the combined application of the distributed parallel architecture, latest interconnection standard and processor. By the successful application cases of airborne SAR/GMTI and space-borne imaging, its high-performance universality and miniature property could be adequately proved.

Published in: 2016 CIE International Conference on Radar (RADAR)

Date of Conference: 10-13 October 2016

Date Added to IEEE Xplore: 05 October 2017

ISBN Information:

DOI: 10.1109/RADAR.2016.8059594

Conference Location: Guangzhou, China

Authors

Ting Jin, Hong-Xian Wang, Hong-Wei Liu; National Laboratory of Radar Signal Processing, Xi'an University, Xi'an, China

The senior author, Hongwei Liu (Senior Member, IEEE) received the M.S. and Ph.D. degrees in electronic engineering from Xidian University, Xi’an, China, in 1995 and 1999, respectively.,He worked with the National Laboratory of Radar Signal Processing, Xidian University.

From 2001 to 2002, he was a Visiting Scholar at the Department of Electrical and Computer Engineering, Duke University, Durham, NC, USA.

He is currently a Professor with the National Laboratory of Radar Signal Processing, Xidian University. His research interests include radar automatic target recognition, radar signal processing, and adaptive signal processing.(Based on document published on 7 December 2023).

Summary

The paper proposes the design and realization of a high-performance universal miniature radar system. The key features of the system include:

A distributed parallel processing architecture using DSP+FPGA structure to handle the extremely huge data flow and computational burden of radar signal processing.
Modularity and reconfigurability achieved through a standardized 3U VPX form factor and interconnection framework, allowing easy updating and reconstruction of the system.
Miniaturization to meet size, weight, and power constraints for airborne and space-borne applications through careful board-level design.
A multi-layer interconnection structure using high-speed serial networks (RapidIO, PCIe), synchronization timing buses, and control buses to facilitate data transfer and system coordination.
Successful application in real-time airborne SAR imaging and space-borne SAR imaging, demonstrating its high performance, universality, and miniature properties.

The paper describes the system architecture, interconnection theory, signal processing units, data flows, imaging algorithms, and resource utilization details. The proposed system offers a flexible and scalable solution for diverse radar applications with demanding performance requirements.

data flow and computational burden

The paper discusses the extremely huge data flow and computational burden associated with radar signal processing, especially for real-time applications like synthetic aperture radar (SAR) imaging. It highlights these challenges as the main motivation for developing a high-performance distributed parallel processing architecture.

Regarding the data flow, the paper mentions that in real-time systems, the speed of signal processing must be faster than the signal acquisition rate to ensure that all continuous echo data can be processed. It gives an example that in a large-scale spotlight SAR mode, each processing node finishes processing one real-time image of 32K*16K (4GB) complex points within 23.89 seconds.

As for the computational burden, the paper states that although chip technology and processing power have increased, a single chip still cannot satisfy the operation requirements of 10 GFLOPS or even 100 GFLOPS needed in real-time imaging cases. Therefore, parallel processing becomes imperative to achieve the required high performance.

The proposed system employs a DSP+FPGA architecture, with each DSP (TMS320C6678) providing:

160 GFLOPS of peak performance when all 8 cores are fully utilized.
Eight TMS320C66x DSP Core Subsystems at 1.00 GHz and 1.25GHz
320 GMAC/160 GFLOP @ 1.25GHz
32KB L1P, 32KB L1D, 512KB L2 Per Core
4MB Shared L2

In summary, the huge data volumes (multiple gigabytes) generated from radar echoes and the demand for gigaflop-scale processing capabilities, especially in real-time SAR imaging modes, necessitate the high-performance distributed parallel processing approach taken in this radar system design.

high-performance distributed parallel processing architecture

The proposed high-performance distributed parallel processing architecture for the miniature radar system is based on a combination of Digital Signal Processor (DSP) and Field Programmable Gate Array (FPGA) processors, interconnected through a multi-layer network structure.

DSP+FPGA Structure:
- The system employs a DSP+FPGA structure to handle signal processing tasks.
- FPGAs (Xilinx Virtex-6 XC6VLX240T) are used for pre-processing and relatively simple but computationally intensive operations like multiplication, accumulation, and FFT.
- DSPs (Texas Instruments TMS320C6678) handle more complex arithmetic operations and high-level processing.
- Each signal processing board contains 2 DSPs, each with 2GB DDR3 memory.
Distributed Parallel Structure:
- The system adopts a distributed parallel structure where each processing node has its own physically distributed memory.
- Multiple processing nodes are combined through a high-bandwidth, low-latency, customized communication network to form a larger processing scale.
- This structure allows for coarse-grained processing and flexible system framework, enabling easy scalability by adding or removing processors.
Virtual Single Node:
- To meet real-time requirements and handle the huge amount of echo data in high-resolution SAR modes, the system implements a virtual single node concept.
- Each signal processing board is considered a virtual single node, consisting of two DSPs connected by a high-speed Hyperlink interface (12.5 Gbps).
- The virtual nodes process data frames independently in a pipelined parallel manner, while each DSP within a node processes a portion of the data frame in parallel.
Multi-Layer Interconnection:
- The system features a multi-layer interconnection network to facilitate data transfer and coordination among various modules.
- High-speed serial networks like RapidIO (6.25 Gbps) and PCIe (5 Gbps) are used for transferring raw data, pre-processed data, result data, and control instructions.
- A synchronization timing network ensures strict synchronization among different parts of the system.
- A control bus is used for low-latency, real-time control signal transfer.

This distributed parallel processing architecture, combined with the multi-layer interconnection network, enables the system to handle the demanding computational requirements of real-time radar signal processing while maintaining flexibility and scalability.

Node Capacity

Based on the information provided in the paper, the capacity per node can be analyzed from two aspects: the processing capability and the storage capacity.

Processing Capability per Node:
- Each signal processing board, which is considered a virtual single node, consists of 2 DSP chips (TMS320C6678).
- Each TMS320C6678 DSP has 8 cores running at up to 1.25 GHz, providing a peak performance of 320 GMACS (Giga Multiply-Accumulate operations per second) or 160 GFLOPS (Giga Floating-Point Operations per Second).
- With 2 DSPs per node, the total processing capability of a single node is 640 GMACS or 320 GFLOPS.
Storage Capacity per Node:
- Each DSP on the signal processing board is equipped with 2 GB of DDR3 memory.
- With 2 DSPs per node, the total storage capacity of a single node is 4 GB.

The paper also provides examples of the processing capacity per node in different SAR imaging modes:

In the large-scale spotlight mode, each processing node finishes one real-time image of 32K*16K (4 GB) complex points within 23.89 seconds.
In the detailed strip mode, each processing node finishes a dual-polarization real-time image, with each DSP responsible for one polarization image (16K*16K complex points, 4 GB total for the dual-polarization image) within 23.76 seconds.

It's important to note that the system is designed to be scalable, allowing for the addition of more processing nodes to handle more complex algorithms and imaging modes as needed.

Scalability to Radar Requirements

To assess how well the proposed architecture meets the radar requirements and its efficiency in combining node capacity, we need to compare the system's capabilities with the computational demands of the radar signal processing tasks.

Meeting Radar Requirements:
- The paper mentions that real-time radar signal processing, especially in SAR imaging cases, requires operation capabilities of 10 GFLOPS or even 100 GFLOPS.
- A single node in the proposed system provides a peak performance of 320 GFLOPS, which is more than sufficient to meet the mentioned requirements.
- The system's ability to process large-scale spotlight mode images (32K16K complex points) and detailed strip mode dual-polarization images (16K16K complex points per polarization) within around 24 seconds demonstrates its capability to handle demanding radar signal processing tasks in real-time.
Efficiency in Combining Node Capacity:
- The distributed parallel processing architecture allows for efficient combination of node capacities through the multi-layer interconnection network.
- The high-speed serial networks (RapidIO at 6.25 Gbps and PCIe at 5 Gbps) provide sufficient bandwidth for data transfer among processing nodes and storage modules, minimizing communication bottlenecks.
- The virtual single node concept, where each node consists of two DSPs connected by a high-speed Hyperlink interface (12.5 Gbps), enables efficient parallel processing within a node.
- The pipelined parallel processing approach, where each node processes data frames independently, further enhances the overall system efficiency.
Scalability and Resource Utilization:
- The modular design and distributed architecture allow for easy scalability by adding more processing nodes to meet increasing computational demands.
- The paper mentions that the processing capability, AD/DA working speed, I/O module usage, and high-speed bus bandwidth are not fully utilized in the presented application cases, indicating that the system has the potential to handle more complex algorithms and imaging modes.

In summary, the proposed high-performance distributed parallel processing architecture efficiently combines node capacities to meet and exceed the computational requirements of radar signal processing tasks. The system's scalability and resource utilization efficiency make it adaptable to various radar applications with demanding performance needs.

SWAP

The paper does not provide explicit details about the size, weight, and power (SWaP) requirements per node or for the full radar system. However, it does mention that miniaturization is a key design consideration, especially for airborne and space-borne applications where SWaP constraints are critical.

Size and Form Factor:
- The entire system hardware boards use the 3U VPX standard, which specifies a board size of 100 mm by 160 mm.
- The use of this compact form factor contributes to the system's miniaturization goals.
- However, the exact dimensions of the complete system, including the housing and cooling components, are not specified.
Weight:
- The paper does not provide any information about the weight of the individual nodes or the complete radar system.
- However, it mentions that the system is designed to meet the strict limitations on weight for airborne and space-borne applications.
Power Requirements:
- The power subsystem is mentioned as providing stable, configurable, and multiple power supplies to the system, allowing for easy adjustment through software programming according to different application cases.
- However, the paper does not specify the actual power consumption per node or the total power requirements for the full radar system.

VPX (Virtual Path Cross-Connect), also known as VITA 46, is a set of standards for connecting components of a computer (known as a computer bus), commonly used by defense contractors. Some are ANSI standards such as ANSI/VITA 46.0–2019. VPX provides VMEbus-based systems with support for switched fabrics over a new high speed connector. Defined by the VMEbus International Trade Association (VITA) working group starting in 2003, it was first demonstrated in 2004, and became an ANSI standard in 2007. The VPX standard was updated in 2013 and 2019.^[5]Technologies in VPX include:

Both 3U and 6U formats
New 7-row high speed connector rated up to 6.25 Gbit/s
Choice of high speed serial fabrics
PMC, FMC (VITA 57), and XMC (VITA 42) mezzanines
Hybrid backplanes to accommodate VME64, VME320 VXS, and VPX boards
VPX - bus to bus bridges

The 3U VPX form factor is compact and extremely well suited for avionics, including UAVs, shipboard, satellite, and airborne radar and signal intelligence applications. 3U VPX dimensions are 100 mm height in a 5.25 in (133.35 mm) enclosure and 6U VPX dimensions are 233.35 mm height in a 10.5 in (266.70 mm) high enclosure.

SECTION I. Introduction

With the enhanced quality and widespread application of the radar system including geological mapping, marine research, military surveillance etc., higher and stricter requirements have been put forward.

High-performance radar system is urgently needed by the large scale of data flow and calculation burden, especially in the real-time signal processing cases. The selection of chips, the design of processing structure and the realization of interconnection framework would all directly influence the system performance.

To achieve the universality, one is breaking the bondage of traditional mode that the design of radar system is subject to the algorithm. The other is building the universal radar system to lower the design cost, cycle. From the software aspect, universality means providing a hardware platform, on which diverse arithmetic complexity and different data granularity could perform well. From the hardware aspect, by the way of modularized design, universality could be obtained. Namely, we could design and optimize every unit of radar system such as the signal processing part, AD/DA part, storage part, respectively. Then according to the different function characteristics and design ideas, diverse radar system could be finished by extending, reconstructing or updating these modularized units. The modularized design is benefit for system universality, extension, flexibility and reconstruction, especially for saving the cost and cycle of design substantially.

Limited by requirements on the weight, volume and power consumption of the special application platform such as the space-borne, airborne, especially the UAV(unmanned Aerial Vehicle), miniaturization is necessary. Therefore, the detailed technology at board level such as the board layout or PCB routing need to be specially designed.

Dealing with the issues mentioned above, we discuss the method of designing and realizing a kind of high-performance universal miniature radar system in this paper.

SECTION II.Theory Analysis of System Structure

A. Module of Parallel Structure

Although the chip technology and processing power have increasingly enhanced, single chip still cannot satisfy the operation requirements of 10GFLOPS or even 100GFLOPS in the real-time imaging cases. Thus the parallel processing would be imperative for the sake of the high-performance. The parallel processing structure, which is mainly embodied in the chip-level and system-level parallelism, directly decides the performance of the system. The most common two kinds of the parallel processing are shown as followed. (P(processor), M(Memory))

Fig. 1.Shared bus sturcture&distrubuted bus sturcture

Tuesday, April 9, 2024

A high-performance universal miniature SAR/GMTI and space-borne imaging radar system

Authors

Summary

data flow and computational burden

high-performance distributed parallel processing architecture

Node Capacity

Scalability to Radar Requirements

SWAP

SECTION I. Introduction

SECTION II.Theory Analysis of System Structure

A. Module of Parallel Structure

B. Interconnection Structure

SECTION III. System Structure

A. AD/DA High Frenquency Module

B. Storage Subsystem

C. Signal Processing Unit

D. Multi-Layer Interconnectin

E. Power Subsystem

F. Display and Console Software

SECTION IV. Other Design

A. Heat Dissipation Design

B. Assistant Debugging Software

SECTION V. Aplication

A. Imaging Alogithm

B. Data Flow

C. Resources Utilization

SECTION VI.Conclusion

ACKNOWLEDGMENT

Introduction

Theory Analysis of System Structure

A. Module of Parallel Structure

B. Interconnection Structure

System Structure

A. AD/DA High Frenquency Module

B. Storage Subsystem

C. Signal Processing Unit

D. Multi-Layer Interconnectin

E. Power Subsystem

F. Display and Console Software

Other Design

A. Heat Dissipation Design

B. Assistant Debugging Software

Apllication

A. Imaging Alogithm

B. Data Flow

C. Rescoures Utilization

Conclusion

ACKNOWLEDGMENT

No comments:

Post a Comment

Satellites Get Smarter at Spotting Ground Movement—With a Little Help From AI