Mipsology Zebra on Xilinx FPGA Beats GPUs, ASICs for ML Inference Efficiency

By Tiera Oliver

Associate Editor

Embedded Computing Design

November 16, 2020

Blog

Mipsology Zebra on Xilinx FPGA Beats GPUs, ASICs for ML Inference Efficiency

Results Highlight “Zero Effort” Architecture Benefits for Computing Neural Networks.

Mipsology announced that its Zebra AI inference accelerator achieved the highest efficiency based on the latest MLPerf inference benchmarking. Per the company, the Zebra on a Xilinx Alveo U250 accelerator card achieved more than 2x higher peak performance efficiency compared to all other commercial accelerators.

Efficiency of Computation (source: MLPerf and Internet data)

Peak TOPS have been the standard for measuring computation performance potential, many assume that more TOPS equal higher performance. However, this fails to take into consideration the real efficiency of the architecture, and the fact that at some point there are diminishing returns. This achievement, similar to “dark silicon” for power, occurs when the circuitry can not be used because of existing limitations. Zebra has proven to scale along with TOPS, maintaining the same high efficiency while peak TOPS are growing.

With a peak TOPS of 38.3 announced by Xilinx, the Zebra-powered Alveo U250 accelerator card outperformed competitors in terms of throughput per TOPS and ranks among the best accelerators available today. It delivers performance similar to an NVIDIA T4, based on the MLPerf v0.7 inference results, while it has 3.5x less TOPS. In other words, Zebra on the same number of TOPS as a GPU would deliver 3.5x more throughput or 6.5x higher than a TPU v3.

This performance does not come at the cost of changing the neural network. Zebra was accepted in the closed category of MLPerf, requiring no neural network changes, high accuracy, and no pruning or other methods requiring user intervention. Zebra achieves this efficiency all while maintaining TensorFlow and Pytorch framework programmability.

MLPerf has been the industry benchmark for comparing the training performance of ML hardware, software and services since 2018, and inference performance since 2019.

For more information, visit: www.mipsology.com

Tiera Oliver, Associate Editor for Embedded Computing Design, is responsible for web content edits, product news, and constructing stories. She also assists with newsletter updates as well as contributing and editing content for ECD podcasts and the ECD YouTube channel. Before working at ECD, Tiera graduated from Northern Arizona University where she received her B.S. in journalism and political science and worked as a news reporter for the university’s student led newspaper, The Lumberjack.

Embedded Computing Design

Mipsology Zebra on Xilinx FPGA Beats GPUs, ASICs for ML Inference Efficiency

By Tiera Oliver

Results Highlight “Zero Effort” Architecture Benefits for Computing Neural Networks.

Categories

AI & Machine Learning - AI Development Tools & Frameworks

Trending Articles

Back to Basics: Innovation is More than Marketing

Embedded Executive: Looking For a Job? Focus on Your Writing, Webster & Webster

SYSGO Supports RISC-V with its Embedded Linux ELinOS Version 7.2

Embracing FIPS Validation in Medical Device Security

Alif Semiconductor Announces BLE and Matter Wireless Microcontroller With Neural Co-Processor for AI/ML Workloads

Industrial

MediaTek's Vice Chairman and CEO Rick Tsai to Deliver COMPUTEX Keynote

Storage

Embedded World 2024: High-Endurance, Robust Cross-Temp Reliability 176-Layer Storage, DDR5-5600 Solutions Take Center Stage at ATP Electronics’ Exhibit

Networking & 5G

Fibocom’s 5G Premium Smart Module SC171 Awarded Best in Show by Embedded Computing Design at Embedded World 2024

Processing

The Evolution of Processor Cores, and Embedded World 2024