Edge Impulse Unveils Ability to Create Synthetic Data for the Edge Using Leading Generative AI Tools

Generating Synthetic Images in Edge Impulse with GPT-4 (DALL-E) (Graphic: Business Wire)

SAN JOSE, Calif.--()--Edge Impulse, the leading platform for building, refining and deploying machine learning models to edge devices, has launched new capabilities that leverage generative AI to create and manage synthetic data on the edge, from images, to speech, to audio data.

The Synthetic Data integration offers a new and efficient way to use Edge Impulse for LLM-based data creation, enabling DALL-E for image generation, Whisper for creating speech elements for keyword spotting and ElevenLabs for audible events. Enterprise customers also have the option to add custom LLM sources such as other data providers or self-hosted LLMs. Other LLM toolkits will be added in the coming months. These new features are in addition to Edge Impulse's existing direct integration with NVIDIA Omniverse Replicator, a framework for developing custom synthetic data generation pipelines to generate highly realistic, physically based datasets tailored to train computer vision models.

Within the new Synthetic Data integration, the user can add and refine their prompts quickly and efficiently. The output, including images and audio fragments, are then displayed, allowing users to quickly evaluate and refine their prompts until they get the desired data set:

  • DALL-E Image Generation Block: Generate image datasets with DALL-E using the DALL-E model.
  • Whisper Keyword Spotting Generation Block: Generate keyword-spotting datasets using the Whisper model. Ideal for keyword spotting and speech recognition applications.
  • ElevenLabs Synthetic Audio Block: Generate audible events - like glass breaking, or alarm sounds - using the ElevenLabs Sound Effects model.
  • Custom LLM Sources: Connect to other LLM data providers or self-hosted LLMs using transformation blocks, including Edge Impulse's existing integration with GPT-4o for labeling image data.

This iterative workflow will make it easier to determine the right prompts for generating data. Additionally, any data that is not deleted will automatically be added to the project, ensuring seamless data management.

This new toolset greatly streamlines the process of generating and refining prompts to create the desired data set. It provides an efficient workflow for building models using synthetic data and makes it easier for developers to create high-quality data sets by leveraging generative AI.

Available today for Enterprise-tier users, with availability coming soon for Professional Plan users, this new feature is located as a top tab in the “data acquisition” section of Edge Impulse, alongside Dataset, Data explorer, and Data sources options. Learn more in Edge Impulse’s documentation about the Synthetic data integration and how to generate audio event datasets. Video tutorials are also available showcasing how to use Generative AI to synthetically create audio datasets using ElevenLabs.io, auto-labeling satellite imagery with ChatGPT-4o, and labeling image data using GPT-4o.

About Edge Impulse

Edge Impulse streamlines the creation of AI and machine learning models for edge hardware, allowing devices to make decisions and offer insight where data is gathered. Edge Impulse’s technology empowers developers to bring more AI products to market, and helps enterprise teams rapidly develop production-ready solutions in weeks instead of years. Powerful automations make it easier to build valuable datasets and develop advanced AI for edge devices from MCUs to CPUs to GPUs. Used by health and wearable organizations like Know Labs and Neurable, industrial organizations like TKE and Lexmark, as well as top silicon vendors and over 100,000 developers, Edge Impulse has become the trusted ML platform for enterprises and developers alike. To learn more, visit edgeimpulse.com.

Contacts

Marie Williams
Coderella
(415) 707-2793
press@edgeimpulse.com

Release Summary

Edge Impulse has launched new capabilities that leverage generative AI to create synthetic data on the edge, from images, to speech, to audio data.

Contacts

Marie Williams
Coderella
(415) 707-2793
press@edgeimpulse.com