How a Tier 1 Automotive Software Provider Creates Smarter, More Natural In-Car Infotainment Systems

By Appen. May 21, 2019

The Company

A leading provider of vehicle electronics software approached Appen to collect audio and linguistic data to help develop automatic speech recognition (ASR) capabilities for its in-car infotainment system.

The Situation

For an in-car infotainment system — or any ASR system — to recognize and correctly process voice commands, it must be trained on speech data that accounts for a broad range of inputs and all possible variation in how people speak. There are countless different verbal commands a driver might use to adjust the climate control, radio, navigation, phone, and other settings in an automobile. Training these systems to understand multiple dialects and various speaker categories poses an even bigger challenge, requiring many thousands of utterances in each of the targeted languages.

The Solution

Appen provides services to collect natural language data and text data, covering all the scenarios and variation that the system might encounter in the real world. Working with in-market, on-demand crowds of native speakers, we are able to rapidly expand ASR capabilities in new locations and languages, for any given scenario. And because the company has strict standards for audio recording quality, Appen replicates the same advanced recording procedures across different locations and studios, and supervises them to comply with quality standards for a range of languages used in the automotive industry. Services included:

Spontaneous, unscripted speech data collection in which native speakers are given a set of scenarios (i.e., How do you ask for temp lowered? Put your favorite music on? Change the radio station?), and must generate various responses
Text data collection using similar scenarios as for speech, but aimed to obtain larger volumes of data and a broader variety of speakers
Scripted speech data collection for short, fixed utterances
Test driving simulation to mimic the cognitive load of driving, so speakers come up with more natural, real-world responses
Country-specific studio data collection with specialist equipment to ensure that different studios are calibrated for precision and compliance to strict audio standards

The Outcome

Working with Appen for more than six years, the company has created a smarter, more connected and more natural in-car experience — with systems that are able to recognize natural spontaneous responses. With our data collection and annotation services, the company has rapidly expanded the system in over 20 new languages. And because our linguists have deep expertise in both creating and localising scenarios that mimic real-world driving conditions, the Tier 1 provider knows that it is receiving the high-quality speech and language data it needs to train its ASR systems.

Benefits

Spontaneous speech data that fits users’ natural behaviour
Rapid deployment in new languages & locations
Strict audio quality compliance across a range of over 40 languages

More Articles Like This

All Articles

Blog

robot hand touching a laptop keyboard. icons representing different data and file types floating around.

The Impending Data Crisis in the AI Economy

Data is the lifeblood of artificial intelligence. It plays a central role in the development and efficacy of AI systems, fueling their ability to learn, adapt, and make informed decisions. However, the availability of natural data, which is crucial for AI systems to improve, is becoming increasingly limited. Natural data refers to information derived from the real-world environment and is …

Blog

Deciphering AI from Human Generated Text: The Behavioral Approach

One of the most important elements of building a well-functioning AI model is consistent human feedback. When generative AI models are trained by human annotators, they serve as more effective tools for the end user, which in turn helps drive progress towards a brighter future. The more behavioral signals we can measure, the higher the chance we have of creating quality …

Blog

Building AI We Can Trust

Today, President Biden issued a landmark Executive Order to ensure the rapid innovation in AI is executed responsibly, safely, and in a human-centric way. It is vital that AI is developed and deployed in a trustworthy and responsible way for all users, which we believe will be supported by the frameworks and core principles of the Executive Order. At Appen, …

Blog

UNGC, Appen, AI,

Appen and the UNGC: Defining Sustainability and Ethics in the AI Era

In January 2023, Appen signed the United Nations Global Compact (UNGC), joining more than 23,000 companies that have committed to supporting sustainable business practices and to taking ownership of creating a brighter, more just society. In our rapidly evolving world, it’s never been more important for businesses to operate within a strong, shared system of values — and that’s exactly …

Blog

Data, crowd segmentation, AI, Artificial Intelligence, blog

How the Human Element Balances AI and Contributor Efforts for Optimal Outcomes

We are committed to delivering dependable solutions to power artificial intelligence applications, and our Crowd plays a crucial role in accomplishing this objective. With a global community of over one million contributors, our diverse Crowd provides invaluable feedback on our clients’ AI models. Their collective expertise enhances operational efficiency and customer satisfaction, making them indispensable to our business success. Given …

Blog

benchmarking LLM blog appen

Appen's Benchmarking Solution: Confidently Choosing the Right LLM for Your Application

With Large Language Model innovation happening at a rapid pace, LLMs are presenting both opportunities and challenges for practitioners. One of the most prominent problems they face is how to strategically select the most suitable model for a specific enterprise application – a decision that holds far-reaching consequences for aspects like user experience, maintenance and profitability. Model selection requires a …

Request a consult

Website for deploying AI with world class training data