Sagence AI Emerges from Stealth Tackling Economic Viability of Inference Hardware for Generative AI

Sagence AI™ today emerged from stealth unveiling a groundbreaking advanced analog in-memory compute architecture that directly addresses the untenable power/performance/price and environmental susta...

Business Wire

High performance AI inference compute at astounding 100X lower MAC power and 20X lower cost

SANTA CLARA, Calif.: Sagence AI™ today emerged from stealth unveiling a groundbreaking advanced analog in-memory compute architecture that directly addresses the untenable power/performance/price and environmental sustainability conundrum facing AI inferencing. Driven by its industry-first architectural innovations using analog technology, Sagence AI makes possible multiple orders of magnitude improvement in energy efficiency and cost reductions, while sustaining performance equivalent to high performance GPU/CPU based systems.

Compared to industry’s leading volume GPU processing the Llama2-70B large language model with performance normalized to 666K tokens/sec, Sagence technology performs with 10X lower power, 20X lower price, and 20X smaller rack space. Using a modular chiplet architecture for maximum integration, Sagence technology makes possible a highly efficient inference machine that scales from data center generative AI to edge computer visions applications across multiple industries. This previously unimaginable balance of high performance and low power at affordable cost addresses the growing ROI problem for generative AI applications at scale, as AI compute in the data center shifts from training models to deployment of models to inference tasks.

“A fundamental advancement in AI inference hardware is vital to the future of AI. Use of large language models (LLMs) and Generative AI drives demand for rapid and massive change at the nucleus of computing, requiring an unprecedented combination of highest performance at lowest power and economics that match costs to the value created,” said Vishal Sarin, CEO & Founder, Sagence AI. “The legacy computing devices today that are capable of extreme high performance AI inference cost too much to be economically viable and consume too much energy to be environmentally sustainable. Our mission is to break those performance and economic limitations in an environmentally responsible way.”

“The demands of the new generation of AI models have resulted in accelerators with massive on-package memory and consequently extremely high-power consumption. Between 2018 and today, the most powerful GPUs have gone from 300W to 1200W, while top-tier server CPUs have caught up to the power consumption levels of NVIDIA’s A100 GPU from 2020,” said Alexander Harrowell, Principal Analyst, Advanced Computing, Omdia. “This has knock-on effects for data center cooling, electrical distribution, AI applications’ unit economics, and much else. One way out of the bind is to rediscover analog computing, which offers much lower power consumption, very low latency, and permits working with mature process nodes.”

On the Frontier of Analog In-memory Compute

Sagence AI leads the industry on the frontier of in-memory compute innovation. Sagence technology is the first to do deep subthreshold compute inside multi-level memory cells, an unprecedented combination that opens doors to the orders of magnitude improvements necessary to deliver inference at scale. As digital technology reaches limits in ability to scale power and cost, Sagence innovated a new analog path forward leveraging the inherent benefits of analog in energy efficiency and costs to make possible mass adoption of AI that is both economically viable and environmentally sustainable.

In-memory Computing Aligned to AI Inference

In-memory computing aligns closely with the essential elements of efficiency in AI inference applications. Merging storage and compute inside memory cells eliminates single-purpose memory storage and complex scheduled multiply-accumulate circuits that run the vector-matrix multiplication integral to AI computing. The resulting chips and systems are much simpler, lower cost, lower power and with vastly more compute capability.

Sagence views the AI inference challenge not as a general-purpose computing problem, but a mathematically intensive data processing problem. Managing the massive amount of arithmetic processing needed to “run” a neural network on CPU/GPU digital machines requires extremely complicated hardware reuse and hardware scheduling. The natural hardware solution is not a general-purpose computing machine, rather an architecture that more closely mirrors how biological neural networks operate.

Reduced Software Complexity

The statically scheduled deep subthreshold in-memory compute architecture employed by Sagence chips is much simpler and eliminates the variabilities and complexities of the dynamic scheduling required of CPUs and GPUs. Dynamic scheduling places extreme demands on the SDK to generate the runtime code and contributes to cost and power inefficiencies. The Sagence AI design flow imports a trained neural network using standards-based interfaces like PyTorch, ONNX and TensorFlow, and automatically converts it into Sagence format. The Sagence system receives the neural network long after GPU software created it, negating further need of the GPU software.

About Sagence AI

Sagence AI, formerly known as Analog Inference, innovates fundamental semiconductor technology to fully unlock the untapped potential of Analog in-memory compute methods to meet the unprecedented performance, power and cost requirements of pervasive AI computing. The Sagence technology team, comprised of multiple 30-year career experts in compute, memory, and AI technologies, developed the final frontier of true in-memory compute to solve the economic viability and sustainability issues facing AI inference deployments. Sagence AI has US $58M in funding from strategic and venture investors. The company was seeded by marque silicon valley investors – Vinod Khosla (Khosla Ventures), Andy Bechtolsheim (founder SUN Microsystems), and Atiq Raza (former President COO, AMD). Series A and B investors include TDK Ventures, P7 Ventures (Aramco Ventures), Alumni Ventures, Cambium Capital Partners, and New Science Ventures. Sagence is headquartered in Santa Clara, CA.
www.sagence-AI.com

Copyright 2024 © Sagence AI. Sagence AI, the Sagence AI logo, and other designated brands included herein are trademarks or registered trademarks of Sagence AI Incorporated in the United States and other countries. All other brand and product names are trademarks or service marks of their respective owners.

Fonte: Business Wire

Last News

RSA at Cybertech Europe 2024

Alaa Abdul Nabi, Vice President, Sales International at RSA presents the innovations the vendor brings to Cybertech as part of a passwordless vision for…

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

G11 Media's SecurityOpenLab magazine rewards excellence in cybersecurity: the best vendors based on user votes

How Austria is making its AI ecosystem grow

Always keeping an European perspective, Austria has developed a thriving AI ecosystem that now can attract talents and companies from other countries

Sparkle and Telsy test Quantum Key Distribution in practice

Successfully completing a Proof of Concept implementation in Athens, the two Italian companies prove that QKD can be easily implemented also in pre-existing…

G11 Media Networks

InnovationOpenLab is a channel of BitCity, a newspaper registered at the court of Como ,
n. 21/2007 del 11/10/2007- Registration ROC n. 15698

G11 MEDIA S.R.L. Registered office Via NUOVA VALASSINA, 4 22046 MERONE (CO) - P.IVA/C.F.03062910132 Como business register n. 03062910132 - REA n. 293834 CAPITALE SOCIALE Euro 30.000 i.v.

M-Files Recognized in 2024 Gartner® Magic Quadrant™ for Document Management and Gartner® Critical Capabilities for Document Management

Nintendo Download: Season’s Greetings!

Digital Wave Technology Named a “Vendor to Watch” in the IDC MarketScape: Worldwide Retail Promotions Management, 2024-2025 Vendor Assessment

WiSA Technologies Inks Definitive Agreement to Acquire CompuSystems, Inc.

Pitch in to Provide: Donate to the Holiday Food Drive hosted by Peraton and Partners

Middle East & Africa Data Center Colocation Market Industry Outlook & Forecast 2024-2029: New Entrants Include Agility, Compass Datacenters, Cloudoon, EDGNEX Data Centres, Kasi Cloud, NED and Nxtra - ResearchAndMarkets.com

i-ESG Accelerates Expansion into the Middle East with Abu Dhabi Entity Establishment

Axcelead DDP Enters into Drug Discovery Service Agreement with Astellas for Targeted Protein Degraders

Sagence AI Emerges from Stealth Tackling Economic Viability of Inference Hardware for Generative AI

Related news

Last News

RSA at Cybertech Europe 2024

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

How Austria is making its AI ecosystem grow

Sparkle and Telsy test Quantum Key Distribution in practice

Most read

Mutual of Omaha and Workday to Help Companies Enhance Employee Benefits…

Sei Labs Releases New “Giga” Roadmap That Will Bring 50x Improvement to…

Swoop Celebrates Triple Recognition in PM360’s 13th Annual Innovations…

Mastercard Finalizes Acquisition of Recorded Future

G11 Media Networks

Sagence AI Emerges from Stealth Tackling Economic Viability of Inference Hardware for Generative AI

Related news

Last News

Most read

Newsletter signup

G11 Media Networks