Home > Products > Ceva enhances Edge AI with TinyML NPUs for AIoT devices

Artificial Intelligence

Ceva enhances Edge AI with TinyML NPUs for AIoT devices

25th June 2024

CEVA Inc

Sheryl Miles

0 0

Ceva, the pioneering licensor of silicon and software IP that enables Smart Edge devices to connect, sense, and infer data more reliably and efficiently, announces that it has extended its Ceva-NeuPro family of Edge AI NPUs with the introduction of Ceva-NeuPro-Nano NPUs.

These highly-efficient, self-sufficient NPUs deliver the power, performance, and cost efficiencies needed for semiconductor companies and OEMs to integrate TinyML models into their SoCs for consumer, industrial, and general-purpose AIoT products.

TinyML refers to the deployment of machine learning models on low-power, resource-constrained devices to bring the power of AI to the Internet of Things (IoT). Driven by the increasing demand for efficient and specialised AI solutions in IoT devices, the market for TinyML is growing rapidly. According to research firm ABI Research, by 2030 over 40% of TinyML shipments will be powered by dedicated TinyML hardware rather than all-purpose MCUs. By addressing the specific performance challenges of TinyML, the Ceva-NeuPro-Nano NPUs aim to make AI ubiquitous, economical, and practical for a wide range of use cases, spanning voice, vision, predictive maintenance, and health sensing in consumer and industrial IoT applications.

The new Ceva-NeuPro-Nano Embedded AI NPU architecture is fully programmable and efficiently executes Neural Networks, feature extraction, control code and DSP code, and supports most advanced machine learning data types and operators including native transformer computation, sparsity acceleration, and fast quantisation. This optimised, self-sufficient architecture enables Ceva-NeuPro-Nano NPUs to deliver superior power efficiency, with a smaller silicon footprint, and optimal performance compared to the existing processor solutions used for TinyML workloads which utilise a combination of CPU or DSP with AI accelerator-based architectures. Furthermore, Ceva-NetSqueeze AI compression technology directly processes compressed model weights, without the need for an intermediate decompression stage. This enables the Ceva-NeuPro-Nano NPUs to achieve up to 80% memory footprint reduction, solving a key bottleneck inhibiting the broad adoption of AIoT processors today.

"Ceva-NeuPro-Nano opens exciting opportunities for companies to integrate TinyML applications into low-power IoT SoCs and MCUs and builds on our strategy to empower smart edge devices with advanced connectivity, sensing, and inference capabilities. The Ceva-NeuPro-Nano family of NPUs enables more companies to bring AI to the very edge, resulting in intelligent IoT devices with advanced feature sets that capture more value for our customers," said Chad Lucien, Vice President and General Manager of the Sensors and Audio Business Unit at Ceva. "By leveraging our industry-leading position in wireless IoT connectivity and strong expertise in audio and vision sensing, we are uniquely positioned to help our customers unlock the potential of TinyML to enable innovative solutions that enhance user experiences, improve efficiencies, and contribute to a smarter, more connected world."

According to Paul Schell, Industry Analyst at ABI Research: "Ceva-NeuPro-Nano is a compelling solution for on-device AI in smart edge IoT devices. It addresses the power, performance, and cost requirements to enable always-on use-cases on battery-operated devices integrating voice, vision, and sensing use cases across a wide array of end markets. From TWS earbuds, headsets, wearables, and smart speakers to industrial sensors, smart appliances, home automation devices, cameras, and more, Ceva-NeuPro-Nano enables TinyML in energy constrained AIoT devices."

The Ceva-NeuPro-Nano NPU is available in two configurations – the Ceva-NPN32 with 32 int8 MACs, and the Ceva-NPN64 with 64 int8 MACs, both of which benefit from Ceva-NetSqueeze for direct processing of compressed model weights. The Ceva-NPN32 is highly optimised for most TinyML workloads targeting voice, audio, object detection, and anomaly detection use cases. The Ceva-NPN64 provides 2x performance acceleration using weight sparsity, greater memory bandwidth, more MACs, and support for four-bit weights to deliver enhanced performance for more complex on-device AI use cases such as object classification, face detection, speech recognition, health monitoring, and others.

The NPUs are delivered with a complete AI SDK – Ceva-NeuPro Studio – which is a unified AI stack that delivers a common set of tools across the entire Ceva-NeuPro NPU family, supporting open AI frameworks including TensorFlow Lite for Microcontrollers (TFLM) and microTVM (µTVM).

The Ceva-NeuPro-Nano key features

Flexible and scalable NPU architecture

Fully programmable to efficiently execute Neural Networks, feature extraction, control code, and DSP code
Scalable performance by design to meet a wide range of use cases
MAC configurations with up to 64 int8 MACs per cycle
Future proof architecture that supports the most advanced ML data types and operators
Four-bit to 32-bit integer support
Native transformer computation
Ultimate ML performance for all use cases using advanced mechanisms
Sparsity acceleration
Acceleration of non-linear activation types
Fast quantisation

Edge NPU with ultra-low memory requirements

Highly efficient, single core design for NN compute, feature extraction, control code, and DSP code eliminates need for a companion MCU for these computationally intensive tasks
Up to 80% memory footprint reduction via Ceva-NetSqueeze which directly process compressed model weights without the need for an intermediate decompression stage

Ultra-low energy achieved through innovative energy optimisation techniques

Automatic on-the-fly energy tuning
Dramatic energy and bandwidth reduction by distilling computations using weight-sparsity acceleration

Complete, simple to use AI SDK

Ceva-NeuPro Studio provides a unified AI stack, with an easy click-and-run experience, for all Ceva-NeuPro NPUs, from the new Ceva-NeuPro-Nano to the powerful Ceva-NeuPro-M
Fast time to market by accelerating software development and deployment
Optimised to work seamlessly with leading, open AI inference frameworks including TFLM and µTVM
Model Zoo of pretrained and optimised TinyML models covering voice, vision, and sensing use cases
Flexible to adapt to new models, applications, and market needs
Comprehensive portfolio of optimised runtime libraries and off-the-shelf application-specific software

Availability

Ceva-NeuPro-Nano NPUs are now available for licensing.

Product Spotlight

STDRIVEG611Q

STMicroelectronics

Gate Drivers High voltage and high-speed half-bridge gate driver for GaN power s...

SKU:	497-STDRIVEG611QTR-ND
Stock:	0
Cost:	$2.63

Buy Now Learn More

TBF10SL-4PS-B

ITT Interconnect Solutions

Circular Connector Standard 5/2 Female Sockets/Male Pins Panel Mount CA/5015 Co...

SKU:	1003-TBF10SL-4PS-B-ND
Stock:	50
Cost:	$45.59

Buy Now Learn More

CAA572C0G3A663J640LJ

TDK Corporation

Speciality Ceramic Capacitors Inline MEGA Cap,2220,C0G,1000V,66nF,+/-5%,6.4mm AE...

SKU:
Stock:	1037
Cost:	$9.16

Buy Now Learn More

R30-3002002

Harwin

20.00mm M3 Metric M/F Threaded Hex Brass Spacer/Pillar Hardware - Spacer (Stand...

SKU:	952-1510-ND
Stock:	6545
Cost:	$0.76

Buy Now Learn More

NRF54L15-QFAA-R

Nordic Semiconductor

RF System on a Chip - SoC Ultra-low power Bluetooth Multiprotocol 5.4 SoC System...

SKU:	4823-NRF54L15-QFAA-RTR-ND
Stock:	0
Cost:	$2.39

Buy Now Learn More

STDRIVEG611Q

STMicroelectronics

Gate Drivers High voltage and high-speed half-bridge gate driver for GaN power s...

SKU:	497-STDRIVEG611QTR-ND
Stock:	0
Cost:	$2.63

Buy Now Learn More

TBF10SL-4PS-B

ITT Interconnect Solutions

Circular Connector Standard 5/2 Female Sockets/Male Pins Panel Mount CA/5015 Co...

SKU:	1003-TBF10SL-4PS-B-ND
Stock:	50
Cost:	$45.59

Buy Now Learn More

CAA572C0G3A663J640LJ

TDK Corporation

Speciality Ceramic Capacitors Inline MEGA Cap,2220,C0G,1000V,66nF,+/-5%,6.4mm AE...

SKU:
Stock:	1037
Cost:	$9.16

Buy Now Learn More

R30-3002002

Harwin

20.00mm M3 Metric M/F Threaded Hex Brass Spacer/Pillar Hardware - Spacer (Stand...

SKU:	952-1510-ND
Stock:	6545
Cost:	$0.76

Buy Now Learn More

NRF54L15-QFAA-R

Nordic Semiconductor

RF System on a Chip - SoC Ultra-low power Bluetooth Multiprotocol 5.4 SoC System...

SKU:	4823-NRF54L15-QFAA-RTR-ND
Stock:	0
Cost:	$2.39

Buy Now Learn More

STDRIVEG611Q

STMicroelectronics

Gate Drivers High voltage and high-speed half-bridge gate driver for GaN power s...

SKU:	497-STDRIVEG611QTR-ND
Stock:	0
Cost:	$2.63

Buy Now Learn More

Tech Videos