Accelerating Edge AI Inference with Vitis AI NPU on iWave’s Versal AI Edge Boards

Edge Artificial Intelligence (AI) is rapidly transforming the way embedded systems process and respond to real-time data. To meet this growing demand for intelligence at the edge, iWave Global successfully integrated Vitis AI NPU (Neural Processing Unit) on the Versal™ AI Edge VE2302 System on Module (SoM) and its evaluation platform.

This integration demonstrates iWave’s commitment to empowering developers with high-performance, power-efficient, and ultra-low latency AI inference solutions. Built on the iW-RainboW-G57D SoM, the platform accelerates complex AI workloads such as real-time object detection delivering edge intelligence closer to where data is generated.

Vitis AI: Bringing FPGA Acceleration to AI Workloads

Vitis AI is AMD’s unified AI inference software stack designed for FPGA-based hardware platforms, including the Versal Adaptive SoCs. It enables seamless migration of AI models trained for GPUs to FPGA architectures without major rework.

Supporting popular deep learning frameworks such as TensorFlow, PyTorch, ONNX, and Caffe, Vitis AI allows developers to deploy trained neural networks efficiently on hardware optimized for parallelism and low power.

The result is an AI acceleration pipeline that merges FPGA flexibility with the performance of specialized neural accelerators—ideal for latency-critical edge applications.

System Overview and Demo Setup

The live demonstration of the Vitis AI NPU running on iWave’s VE2302 Versal AI Edge Evaluation Kit showcases real-time object detection capabilities.

Demo Components:

  • iWave VE2302 Versal AI Edge Development Kit
  • HDMI Display
  • USB Webcam for live video input
  • 12V/5A Power Supply and debug cable

Dataflow Process for AI Inference

The AI inference workflow on iW-RainboW-G57D, leverages the VART X API modules and structured for real-time processing. A script is created to run the NPU application on the SoM. This script captures video using the USB camera and converts it to NV12 format. The converted video is then processed by NPU and detected objects are highlighted on the HDMI display.

Dataflow process is as follows:

Live Demo in Action

Experience real-time object detection on VE2302 Versal AI Edge SoM! Watch the demo as iWave experts showcase AI inference using NPU IP for precise and efficient object detection.

Why Versal AI Edge for Edge AI?

The Versal AI Edge family from AMD combines programmable logic, AI Engines, and a heterogeneous processing system in a single chip. This architecture enables sensor fusion, vision analytics, and AI inference on one platform while maintaining deterministic real-time control.

Key highlights include:

  • AI Engines & DSP Engines for vision, radar, and LiDAR workloads
  • Native MIPI support for up to 8MP resolution
  • Single and half-precision floating-point support for diverse AI and signal processing tasks

The combination of AI compute with programmable logic provides a scalable foundation for a wide range of edge AI use cases—from robotics to industrial automation.

Features of iWave’s Versal AI Edge System on Module

  • Compatible with VE2302 / VE2202 / VE2102 / VE2002 devices
  • Dual-core Arm Cortex-A72 and Cortex-R5F processors
  • Up to 328K logic cells and 150K LUTs
  • 8 GTYP transceivers at 32 Gbps
  • Up to 8GB LPDDR4 RAM and 128GB eMMC storage
  • Dual 240-pin high-speed connectors for expansion
  • Connectivity: PCIe Gen4, Ethernet, USB 3.0

Measuring compact yet robust, the SoM supports 40G Ethernet, MIPI camera interfaces, and 122 configurable I/O, ensuring seamless integration into edge AI systems that demand high-speed data movement and real-time inference.

Real-World Applications

The Vitis AI NPU on iWave’s Versal AI Edge SoM unlocks new opportunities across a broad spectrum of industries:

  • Smart Surveillance: Real-time object and facial recognition for intelligent monitoring
  • Automotive & ADAS: High-speed detection for traffic signs and pedestrians
  • Industrial Automation: On-device analytics for defect detection and predictive maintenance
  • Healthcare: AI-driven diagnostic imaging and patient monitoring
  • Smart Retail: Automated checkout and customer analytics
  • Smart Cities: Adaptive traffic management using live video analytics

The solution merges high-throughput AI processing with power efficiency delivering precise, real-time performance in environments where milliseconds matter.

Empowering Developers with Edge AI Tools

iWave provides a full suite of software tools, libraries, and board support packages (BSPs) to simplify the AI development cycle on Versal platforms. With support for Vitis AI, OpenCV, and Linux-based development, engineers can deploy, test, and optimize AI workloads faster.

Backed by comprehensive documentation, long-term availability (10+ years), and ODM design services, iWave ensures that customers can scale from prototype to production with confidence.

iWave Global is a trusted engineering solutions provider specializing in FPGA-based System on Modules (SoMs) and ODM design services for industrial, automotive, medical, and defense markets. Leveraging decades of embedded expertise, iWave enables innovation at the edge through reliable, scalable, and high-performance hardware platforms.

To explore how the Versal AI Edge SoM can power your next AI innovation, visit www.iwave-global.com or contact mktg@iwave-global.com

작성자 정보

Image of Tawfeeq Ahmad

Tawfeeq Ahmad는 iWave Systems Technologies Pvt. Ltd에서 제품 마케팅 부서를 이끌고 있습니다. 전자 부품에 대한 열정과 마케팅 및 영업에 대한 관심을 품고 있는 Tawfeeq는 iWave의 다양한 내장 전문 지식을 통해 전 세계 조직이 제품 개발에서 개발 주기와 효율성을 향상시킬 수 있도록 지원하는 것을 목표로 합니다. 전자 및 통신 분야에서 학사 학위를 받고 마케팅 분야에서 MBA를 취득한 Tawfeeq는 iWave Systems가 제품 엔지니어링 조직으로서 글로벌 리더 반열에 오르는 것을 목표로 합니다.

More posts by Tawfeeq Ahmad
 TechForum

Have questions or comments? Continue the conversation on TechForum, Digi-Key's online community and technical resource.

Visit TechForum