DSP, Micros & Memory


Scaling ML processing with external AI accelerators

23 November 2022 DSP, Micros & Memory AI & ML

At NXP, we craft our i.MX applications processors and our i.MXX RT crossover microcontroller (MCU) portfolios and families of devices to cover a very wide spectrum of market needs. From verticals needing very low-power dissipation to other end products needing a complex heterogeneous compute platform with multiple CPUs, 2D-3D GPUs, DSPs and NPU machine learning accelerators, our product teams must solve a multi-dimensional optimisation problem. The ultimate goal is to deliver all the compute power and connectivity your application and products need, with minimal unused features so that the area footprint, power dissipation, and cost requirements of the devices are also met.

When talking about machine learning, there are some end uses such as autonomous vehicles and natural language processing that push the compute requirements for edge devices to limits of tens and hundreds of tera operations per second (TOPs). With more emphasis on developing efficient ML models specifically for the edge, and using techniques such as quantisation and pruning, many of the edge machine learning applications fit in the

Giga-Ops to low single digit TOPs range of ML compute performance. NXP solutions natively cover this large portion of AI processing needs.

The software investment dominates the hardware choice these days, especially when considering roadmaps and multiple generation of products. There is a clear benefit to choosing the same or similar applications processor when developing a range of end products, so that the development effort translates to higher maturity and quality, with reuse. Different market segments require different performance on some portion of the application, but the fundamentals remain unchanged between versions and tiers of products. Choosing a scalable processor family such as i.MX applications processors gives the developer flexibility in advanced features and performance range, while simultaneously providing a common base compute architecture and feature set to leverage across both the portfolio and common software enablement elements. NXP provides GStreamer and NNStreamer frameworks to simplify deployment of vision applications with ML.X GStreamer is used as a framework for creating streaming media applications, abstracting the hardware layer to allow the use of any i.MX SoC, without having to change the underlying vision pipeline software.

Application requirements and markets evolve even after a product is launched in the market. So, what does one do when more is still needed from the selected applications processor? Going back to the selection process and looking for a higher performance processor is usually not a preferred option. Adding another device to provide additional acceleration when needed is a possible path, especially with high-speed high bandwidth, low latency chip-to-chip connectivity options such as PCIe. This is where NXP’s ecosystem partners with dedicated ML accelerator chips can help.

Kinara is such an NXP ecosystem partner that develops the Ara-1 Edge AI processors for dedicated ML acceleration. The Gstreamer- and NNStreamer-based vision pipeline support offered by NXP, and the set of Kinara-developed Gstreamer compatible plugins, make it seamless to integrate Ara-1X into NXP inference pipelines and to also easily migrate the design to different i.MXX applications processors if the feature requirements change.

Combining the native MLX processing capabilities of NXP’sX i.MXX applications processors with dedicated MLX accelerators from NXP ecosystem partners such as Kinara, creates an even larger scalability range than that offered by NXP alone while still maintaining software reuse.


Credit(s)



Share this article:
Share via emailShare via LinkedInPrint this page

Further reading:

The end of ‘entry-level’: STMicroelectronics’ STM32C5 sets a new baseline for embedded systems
DSP, Micros & Memory
[Sponsored] Instead of incrementally improving legacy Cortex-M0+ architectures, STM32C5 introduces a Cortex-M33-based platform into the entry-level category. This changes not only performance expectations, but also how engineers approach system architecture, consolidation, and long-term scalability.

Read more...
GigaDevice expands GD25UF Series density
NuVision Electronics DSP, Micros & Memory
GigaDevice has announced the expanded density range of its GD25UF series 1,2 V ultra-low power SPI NOR Flash, now spanning from 8 Mb to 256 Mb.

Read more...
ARINC 429 line driver evaluation board
ASIC Design Services DSP, Micros & Memory
Holt Integrated Circuits have announced the release of the ADK-85104 Evaluation Board, a compact, ready-to-use platform designed to help engineers rapidly evaluate and characterise Holt’s HI-85104.

Read more...
Highly integrated 24-channel mixed signal IC
EBV Electrolink DSP, Micros & Memory
Microchip Technology has announced the LX4580, a 24-channel mixed-signal IC designed to replace multiple discrete components with a single device that supports synchronised data acquisition, fault monitoring, and motor control.

Read more...
Lower-power Thread and BLE connectivity
iCorp Technologies DSP, Micros & Memory
Espressif has released the ESP32-H21, a low-power wireless SoC aimed at Thread, Matter, Zigbee, and Bluetooth LE device designs.

Read more...
Touch-enabled 32-bit MCU
EBV Electrolink DSP, Micros & Memory
Microchip’s PIC32CM PL10 microcontroller family expands the company’s Arm Cortex-M0+ portfolio, delivering a compact, low-power 32-bit platform designed for cost-sensitive embedded applications.

Read more...
Compact 6 A automotive buck converter
Altron Arrow Power Electronics / Power Management
Delivering up to 6 A of continuous output current, the DCP0606Y from STMicroelectronics enables efficient regulation of low-voltage rails commonly used in modern vehicle electronics and industrial systems.

Read more...
Build smarter with UNO Q
Electrocomp Express DSP, Micros & Memory
The Arduino UNO Q’s hybrid design combines a Linux Debian-capable microprocessor with a real-time STM32U585 microcontroller making it the perfect dual-brain platform for the next innovation.

Read more...
Next-gen modules for rugged edge AI systems
Altron Arrow Computer/Embedded Technology
Designed for real-world edge deployments where systems do not sit still, SolidRun’s P100 COMx6 series targets mobile platforms as well as harsh, mission-critical environments.

Read more...
Compact AI modules for imaging
Otto Wireless Solutions DSP, Micros & Memory
SIMCom has introduced two compact smart AI modules, the SIM8666 and SIM8668, designed to simplify the development of imaging-based IoT applications.

Read more...









While every effort has been made to ensure the accuracy of the information contained herein, the publisher and its agents cannot be held responsible for any errors contained, or any loss incurred as a result. Articles published do not necessarily reflect the views of the publishers. The editor reserves the right to alter or cut copy. Articles submitted are deemed to have been cleared for publication. Advertisements and company contact details are published as provided by the advertiser. Technews Publishing (Pty) Ltd cannot be held responsible for the accuracy or veracity of supplied material.




© Technews Publishing (Pty) Ltd | All Rights Reserved