HomeBig TechsGoogleMeta presents its new custom AI chip

Google Intel Meta IA Artificial Intelligence Startups

Meta presents its new custom AI chip

Meta, hell-bent on catching up to its rivals in the generative AI space, is spending thousands of millions in the effort. A part of those billions goes to recruit AI researchers. But an even bigger chunk is being spent on hardware development, specifically chips to run and train Meta's AI models.

Meta unveiled the latest fruit of its chip development efforts, after Intel will announce its latest AI accelerator hardware. Called the “next-generation” Meta Training and Inference Accelerator (MTIA), the successor to last year’s MTIA v1, the chip runs models that include ranking and recommending display ads on Meta properties (e.g., Facebook).

The next generation MTIA is 5nm, in contrast to MTIA v1, which was developed for a 7nm process. (The "process" in chip manufacturing refers to the size of the smallest component that can be built on the chip in nanometers, nm.) The physical design of the next-generation MTIA is larger and features more processing cores than its predecessor. Although it consumes more power (90W vs. 25W), it also has more internal memory (128MB vs. 64MB) and runs at a higher average clock speed (1,35GHz vs. 800MHz).

The next-generation MTIA, according to Meta, is currently available in 16 data center regions and offers up to three times the overall performance of MTIA v1. If the “3x” statement seems ambiguous, you are not wrong. However, Meta claimed that the figure was obtained by evaluating the performance of "four key models" on both chips.

“We can achieve greater efficiency compared to commercially available GPUs because we control the entire stack,” Meta writes in a blog post.

First, Meta announces in the blog post that it is not currently using the next-generation MTIA for generative AI training workloads, despite the company stating that it has "several programs in place" to investigate. this matter. Second, Meta recognizes that the next-generation MTIA will complement rather than replace GPUs for running or training models.

Meta advances slowly, perhaps more than they expect.

Meta's AI teams are likely under pressure to drive down costs. By the end of 2024, the company is set to invest around $18 billion in GPUs to train and run generative AI models. With training costs for cutting-edge generative models running into the tens of millions of dollars, in-house hardware is an attractive option.

As Meta's hardware deteriorates, competitors are taking over, worrying Meta's leadership.

This week, Google introduced TPU v5p, its fifth-generation custom chip for training AI models, and Axion, its first model execution chip. Amazon has a variety of custom AI chip families. The year before, Microsoft debuted the Azure Maia AI accelerator and Azure Cobalt 100 CPU.

Meta claims in the blog post that "going from first silicon to production models" of the next-generation MTIA took less than nine months, which, to be fair, is shorter than the typical window between Google's TPUs. However, Meta must work hard to upgrade if it hopes to achieve some autonomy from third-party GPUs and keep up with its aggressive competition.

next >>

Investors are increasingly wary of AI

Rabbit Partners with ElevenLabs to Power Voice Commands on Your Device

UK antitrust authority warns of Big Tech control over GenAI

Learning unicorn Multiverse acquires AI-focused Searchlight

Poe introduces price-per-message revenue model for AI bot creators

Building a strong startup development culture requires constant adjustment

Goody-2, AI too ethical to discuss anything

DEI: latest legal and corporate challenges

Key AI policies: Unlock your potential and protect from risks at work

It's never too late to start

Poe introduces price-per-message revenue model for AI bot creators

TikTok now allows creators in more countries to earn money from their effects

The creative economy is ready for a labor movement

Pay attention to the hidden costs of AI to avoid ruining innovation

Cambio puts artificial intelligence robots on the phone to negotiate debts and talk to bank customers

Tesla risks losing its lead without an affordable electric vehicle

Learning unicorn Multiverse acquires AI-focused Searchlight

Robinhood credit card wants to compete with Apple Card

Fintech funding slows to lowest level since 2017

Rabbit Partners with ElevenLabs to Power Voice Commands on Your Device

Learning unicorn Multiverse acquires AI-focused Searchlight

Buffet App Tackles Loneliness Epidemic by Connecting People in the Real World

AirMyne harnesses geothermal energy to directly capture carbon from the air

Apple acquires AI startup to oversee manufacturing components

The chronology you need to know about the AI Chatbot

AI: summary of main concepts

How to present a Startup to Investors

OKR Model

Creation of a Strategic Plan

Meta presents its new custom AI chip

Rabbit Partners with ElevenLabs to Power Voice Commands on Your Device

UK antitrust authority warns of Big Tech control over GenAI

Learning unicorn Multiverse acquires AI-focused Searchlight

SUBSCRIBE TO TRPLANE.COM

Publish on TRPlane.com

MORE PUBLICATIONS

Samsung cuts chip production after its worst quarter since 2009

Skydio hits $2.200bn valuation after raising $230m from Series E

The Still Transformer

Microsoft hits Twitter

Using Data to address the key pain points of banking customers