ARM launches two machine learning processors
The processors are the ARM ML and the ARM OD with OD standing for object detection. These devices will enable trillions of operations per second and are intended to be used on edge devices such as within mobile phones. Indeed, ARM claims the ML processor is the “most efficient solution to run neural networks.”
The ARM ML comprises fixed-function engines and programmable-layer engines for selected primitive operations while allowing for innovation and future algorithms. A network control unit manages the overall execution of the neural network and a DMA unit moves data in and out of main memory. Onboard memory allows central storage for weights and feature maps, thus reducing traffic to the external memory and, therefore, power.
Not much detail yet in ARM’s description of its ML processor. Source: ARM
The performance is said to be greater than 4.6TOPS in mobile environments with an efficiency of 3TOPS per watt. It is also said to provide a “massive efficiency uplift from CPUs, GPUs, DSPs and accelerators.”
However, while the design is said to be tuned for advanced geometry implementation, ARM does not indicate whether the IP has already been licensed to lead partners. Also ARM does not indicate more precisely the nature of the programmable and fixed-function engines or what data types are supported.
Next: OD
The ML processor can be used stand alone but also can be used with the OD processor, which is described as ARM’s second generation of Object Detection processor. This is designed to work with 2D fields and with visual fields in particular.
The OD processor scans each frame at 60fps and provides a list of detected objects, along with their location within the scene. The devices detects human forms, faces, heads and shoulders, and can even determine the direction each person is facing. Object sizes detected can be as small as 50 by 60 pixels.
ARM claims the OD processor offers 80x the performance of a traditional DSP, and a significant improvement in detection quality relative to previous Arm technologies.
The OD processor is intended to be used as a pre-processor to detect regions of interest – and particularly people of interest – and it can be used with ARM Cortex CPUs, Mali GPUs and the ML processor.
The ML and OD processors can be deployed together or separately but they also can make use of ARM NN software and the ARM Compute Library (see ARM’s soft launch for machine learning library).
Roadmap for ARM NN software as it bridges from TensorFlow, Caffe etc. to various processors. Source: ARM
ARM NN software, when used alongside the ARM Compute Library and CMSIS-NN is optimized for NNs and bridges the gap between NN frameworks such as TensorFlow, Caffe, and Android NN and the full range of Cortex CPUs, Mali GPUs, and ML processors.
Jem Davies, general manager of machine learning business at ARM, said that the advent of machine learning represents “the biggest inflection point in computing for more than a generation.” He added that it will be done at the edge rather than in data centers wherever possible, for reasons of energy efficiency, latency, safety-criticality, economics and privacy.
ARM added that future ML products will enable developers to pick their point on a performance curve from sensors and smart speakers, to mobile, home entertainment, and beyond.
ARM stated that the ARM machine learning IP suite will be available for general availability in mid-2018.
Related links and articles:
Machine Learning Developer Portal
News articles:
Thoughts on Jem Davies leading ARM’s machine learning group
ARM has R&D interest in neural network cores
ARM’s soft launch for machine learning library
ARM acquires ChaoLogix for security reasons