
AI co-processor boosts NPU performance and efficiency
Cadence has announced the Tensilica NeuroEdge 130 AI co-processor, designed to complement any neural processing unit (NPU) and enable end-to-end execution of the latest agentic and physical AI networks on advanced automotive, consumer, industrial, and mobile SoCs.
Based on the proven architecture of the highly successful Tensilica Vision DSP family, the NeuroEdge 130 AI co-processor delivers more than 30% area savings and over 20% savings in dynamic power and energy without impacting performance. It also leverages the same software, AI compilers, libraries and frameworks to deliver faster time to market.
“With the rapid proliferation of AI processing in physical AI applications such as autonomous vehicles, robotics, drones, industrial automation, and healthcare, NPUs are assuming a more critical role,” said Karl Freund, founder and principal analyst of Cambrian AI Research. “Today, NPUs handle the bulk of the computationally intensive AI/ML workloads, but a large number of non-MAC layers include pre- and post-processing tasks that are better offloaded to specialised processors. However, current CPU, GPU, and DSP solutions involve trade-offs, and the industry needs a low-power, high-performance solution that is optimised for co-processing and allows future-proofing for rapidly evolving AI processing needs.”
Featuring an extensible design that enables seamless compatibility with in-house NPUs, Cadence Neo™ NPUs and third-party NPU IP, the Tensilica NeuroEdge 130 AI co-processor performs offloaded tasks with high performance and better efficiency than its application-specific predecessors. Taking the inherent power, performance and area (PPA) advantages of Tensilica DSPs to new levels, the NeuroEdge 130 delivers over 30% area savings and a more than 20% reduction in dynamic power and energy with comparable performance to Tensilica Vision DSPs on AI networks and operators.
The Tensilica NeuroEdge 130 AI co-processor features a VLIW-based SIMD architecture with configurable options that enable high performance and low power consumption. It issues instructions and commands to the NPU as a control processor. Optimised ISA and instructions run non-NPU optimal tasks such as ReLU, sigmoid, tanh, and more. The Tensilica NeuroEdge 130 provides programmability, flexibility, and future readiness to the AI subsystem, facilitating end-to-end execution of unseen and future AI workloads.
The Tensilica NeuroEdge 130 AI co-processor is supported by the Cadence NeuroWeave™ Software Development Kit (SDK), which is used across all of Cadence’s AI IP. Leveraging the Tensor Virtual Machine (TVM) stack, the NeuroWeave SDK is easy to use and enables architects to tune, optimise, and deploy their AI models for Cadence’s AI IP. The Tensilica NeuroEdge 130 also features a lightweight standalone AI library, allowing customers to program AI layers directly on the new processor and bypass potential overheads associated with some compiler frameworks.
The Tensilica NeuroEdge 130 AI co-processor is supported by the Cadence NeuroWeave™ Software Development Kit (SDK), which is used across all of Cadence’s AI IP. Leveraging the Tensor Virtual Machine (TVM) stack, the NeuroWeave SDK is easy to use and enables architects to tune, optimise, and deploy their AI models for Cadence’s AI IP. Additionally, the Tensilica NeuroEdge 130 is equipped with a lightweight standalone AI library, allowing customers to program AI layers directly on the new processor and bypass potential overheads associated with some compiler frameworks.
“Cadence has proven AI co-processor use cases with our Tensilica DSPs. With AI workloads transforming and becoming less domain-specific, our AI SoC and systems customers have been seeking a small and efficient AI-focused co-processor for better PPA and future-proofing,” said Boyd Phelps, senior vice president and general manager of the Silicon Solutions Group at Cadence. “Continuing our track record of IP innovations, we’ve introduced a purpose-built new class of processors. Designed as an NPU companion, the Tensilica NeuroEdge 130 AI co-processor raises the bar for performance efficiency to address our customers’ most demanding AI applications.”
