NXP Semiconductors has launched software and training tools for local voice control on its ARM Cortex-M microcontrollers.
This is based on voice technology developed by Retune DSP in Copenhagen, Denmark, which NXP acquired in 2021.
NXP says the Voice Intelligent Technology (VIT) is the industry’s first fully comprehensive, local voice control software package with online training tools free for customers, regardless of end application production volumes. VIT is a ready-to-use library that provides a far-field audio front end supporting up to three microphones, an always-on wake word engine, and a voice command engine. NXP’s free online tools enable customers to define a wake word and voice commands themselves, using simple text entry and without the need for voice recordings.
Developing reliable, on-device voice control can be complicated with speech processing software. This frequently requires managing an audio front-end beamformer, as well as a separate wake-word and voice command engines, often from separate software vendors. VIT streamlines development by providing a comprehensive, flexible software solution that can be customized for specific applications and incorporates all the elements needed to create a highly accurate, on-device voice control solution.
“Voice is the interface of choice for many smart technologies, including those in smart homes, smart cities and smart factories,” said Joe Yu, Vice President and General Manager of the IoT Edge Processing Product Line at NXP. “By reducing the complexity of voice application development, we’ve made it easier and faster to bring new, on-device voice control to market.”
NXP already has a range of reference designs for voice control. The EdgeReady MCU-based solution for Amazon’s Alexa Voice Service (AVS) integration for AWS IoT Core uses the i.MX RT106A crossover MCU, enabling developers to add Alexa voice assistant capabilities to products. The EdgeReady MCU-Based Solution for Local Voice Control enables developers to add support for far field, offline local voice control leveraging and optimized speech recognition engine.
The VIT library is delivered as a ready-to-use library in the MCUXpresso SDK and the online training tool is available at vit.nxp.com.
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
The technical storage or access that is used exclusively for statistical purposes.The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.