NVIDIA Enables Era of Interactive Conversational AI with New Inference Software

By Tiera Oliver

Assistant Managing Editor

Embedded Computing Design

January 02, 2020

News

NVIDIA Enables Era of Interactive Conversational AI with New Inference Software

NVIDIA TensorRT 7?s Compiler Delivers Real-Time Inference for Smarter Human-to-AI Interactions.

NVIDIA, a technology company that designs graphics processing units for gaming and professional markets, and system on a chip units for the mobile computing and automotive market, introduced inference software that developers can use to deliver conversational AI applications, inference latency, and interactive engagement.

NVIDIA TensorRT 7, according to the company, opens the door to smarter human-to-AI interactions, enabling real-time engagement with applications such as voice agents, chatbots and recommendation engines.

It is also estimated that there are 3.25 billion digital voice assistants being used in devices around the world, according to Juniper Research. By 2023, that number is expected to reach 8 billion, more than the world’s total population.

TensorRT 7 features a new deep learning compiler designed to optimize and accelerate the recurrent and transformer-based neural networks needed for AI speech applications. According to the company, this speeds the components of conversational AI by more than 10x compared to when run on CPUs, driving latency below the 300-millisecond threshold considered necessary for real-time interactions.

Some companies are already taking advantage of NVIDIA’s conversational AI acceleration capabilities. Among these is Sogou, which provides search services to WeChat, a frequently used application on mobile phones.

Rising Importance of Recurrent Neural Networks
With TensorRT’s new deep learning compiler, developers everywhere now have the ability to automatically optimize these networks, such as bespoke automatic speech recognition networks, and WaveRNN and Tacotron 2 for text-to-speech, and to deliver performance and low latencies.

The new compiler also optimizes transformer-based models like BERT for natural language processing.

Accelerating Inference from Edge to Cloud
According to NVIDIA, TensorRT 7 can optimize, validate and deploy a trained neural network for inference by hyperscale data centers, embedded or automotive GPU platforms.

NVIDIA’s inference platform, which includes TensorRT, as well as several NVIDIA CUDA-X AI libraries and NVIDIA GPUs, delivers low-latency, high-throughput inference for applications beyond conversational AI, including image classification, fraud detection, segmentation, object detection and recommendation engines. Its capabilities are used by some of the world’s leading enterprise and consumer technology companies, including Alibaba, American Express, Baidu, PayPal, Pinterest, Snap, Tencent and Twitter.

Availability
TensorRT 7 will be available in the coming days for development and deployment, without charge to members of the NVIDIA Developer program from the TensorRT webpage. The latest versions of plug-ins, parsers and samples are also available as open source from the TensorRT GitHub repository.

For more information, please visit: https://www.nvidia.com/en-us/#source=pr

Tiera Oliver, Assistant Managing Editor for Embedded Computing Design, is responsible for web content edits, product news, and constructing stories. She develops content and constructs ECD podcasts, such as Embedded Insiders. Before working at ECD, Tiera graduated from Northern Arizona University, where she received her B.S. in journalism and political science and worked as a news reporter for the university’s student-led newspaper, The Lumberjack.

Topic Tags

#NVIDIA GTC

Embedded Computing Design

NVIDIA Enables Era of Interactive Conversational AI with New Inference Software

By Tiera Oliver

NVIDIA TensorRT 7?s Compiler Delivers Real-Time Inference for Smarter Human-to-AI Interactions.

Categories

Processing - Chips & SoCs

Processing - Compute Modules

Topic Tags

Trending Articles

Avalue ECM-ASL3 Industrial Board Offers Intel Next-Gen Compatibility for Edge AI and Automation

Vigit International Helps Businesses Deploy Smart Signage, Powered by Intel

The Open AD Kit Blueprint is Accelerating SDV Development Across the Industry

Secure Boot and the Manufacturing Chain: Implementation and Impact

albatron.ai Launches TALO-25000 Embedded AI System Powered by NVIDIA Jetson AGX Orin

Consumer

TDK Adds SmartMotion for Smart Glasses to its Custom Sensing Solutions for AI Glasses and Augmented Reality

Open Source

Embedded Executive: RISC-V Works Great At Low Power Levels, Too | Upbeat Technology

Security

New RunSafe Security Report: Engineering Leaders Brace for Rising Cyber Risks in Embedded AI

HPC/Datacenters

A Comprehensive Digital Twin Environment and Semiconductor Lifecycle Management Can Ensure Reliable Data-Center Operations