Arcmira media summary

Mixture of Experts (MoE), explained: podcasts, interviews & video clips

Explore podcasts, interviews & explainers on Mixture of Experts (MoE) — 5 indexed from Lex Fridman & Dwarkesh Patel, updated Apr 2026.

Representative appearances

The math behind how LLMs are trained and served – Reiner Pope
A model architecture style that uses sparse activation to save compute.
State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490
A sparse neural network architecture that uses a router to select specific 'experts' for processing.
⚡ Open Model Pretraining Masterclass — Elie Bakouch, HuggingFace SmolLM 3, FineWeb, FinePDF
A model architecture that uses sparse activation to improve efficiency.
Everything We Know About GPT-5 So Far
Rumored architecture for GPT-5 to lower inference costs.
DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459
Architectural technique used to reduce training and inference costs by activating only a subset of parameters.

Organizations

Products

Channels

What does Arcmira know about Mixture of Experts (MoE)?

Arcmira tracks 5 indexed media appearances or mentions for Mixture of Experts (MoE), tied to source videos, channels, and transcript-derived context.

Where does Arcmira's data about Mixture of Experts (MoE) come from?

Arcmira uses indexed YouTube videos and transcripts. Representative source evidence on this page includes "The math behind how LLMs are trained and served – Reiner Pope" with transcript-derived context and links when available.

What is Mixture of Experts (MoE) connected to?

Mixture of Experts (MoE) is connected to Google, OpenAI, DeepSeek in Arcmira's media graph.

Topic

Mixture of Experts (MoE)

Mentions

2.2M

Views

Narrative Tracking

Track Mixture of Experts (MoE) Mentions

Get alerts when "Mixture of Experts (MoE)" is mentioned on YouTube.

Expert Network

Find Topic Experts

Discover the key voices and thought leaders discussing Mixture of Experts (MoE).

Mixture of Experts (MoE) mentions on podcasts & videos

The math behind how LLMs are trained and served – Reiner Pope

@ 31:59

Dwarkesh PatelBrief•4/29/2026

The math behind how LLMs are trained and served – Reiner Pope

“A model architecture style that uses sparse activation to save compute.”

State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490

@ 37:55

Lex FridmanBrief•1/31/2026

State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490

“A sparse neural network architecture that uses a router to select specific 'experts' for processing.”

⚡ Open Model Pretraining Masterclass — Elie Bakouch, HuggingFace SmolLM 3, FineWeb, FinePDF

@ 21:16

Latent SpaceBrief•10/20/2025

⚡ Open Model Pretraining Masterclass — Elie Bakouch, HuggingFace SmolLM 3, FineWeb, FinePDF

“A model architecture that uses sparse activation to improve efficiency.”

@ 04:57

The AI Daily Brief: Artificial Intelligence NewsBrief•7/9/2025

Everything We Know About GPT-5 So Far

“Rumored architecture for GPT-5 to lower inference costs.”

DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459

@ 25:18

Lex FridmanBrief•2/3/2025

DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459

“Architectural technique used to reduce training and inference costs by activating only a subset of parameters.”

Mixture Of Experts Moe

Indexing

Sifting through hundreds of thousands of hours of indexed videos

Mixture Of Experts Moe

Mixture of Experts (MoE) mentions on podcasts & videos

@ 31:59

Dwarkesh PatelBrief•4/29/2026

The math behind how LLMs are trained and served – Reiner Pope

“A model architecture style that uses sparse activation to save compute.”

@ 37:55

Lex FridmanBrief•1/31/2026

State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490

“A sparse neural network architecture that uses a router to select specific 'experts' for processing.”

@ 21:16

Latent SpaceBrief•10/20/2025

⚡ Open Model Pretraining Masterclass — Elie Bakouch, HuggingFace SmolLM 3, FineWeb, FinePDF

“A model architecture that uses sparse activation to improve efficiency.”

@ 04:57

The AI Daily Brief: Artificial Intelligence NewsBrief•7/9/2025

Everything We Know About GPT-5 So Far

“Rumored architecture for GPT-5 to lower inference costs.”

@ 25:18

Lex FridmanBrief•2/3/2025

DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459

“Architectural technique used to reduce training and inference costs by activating only a subset of parameters.”

Mixture of Experts (MoE), explained: podcasts, interviews & video clips

Representative appearances

Organizations

Products

Channels

Related topics

What does Arcmira know about Mixture of Experts (MoE)?

Where does Arcmira's data about Mixture of Experts (MoE) come from?

What is Mixture of Experts (MoE) connected to?

Mixture of Experts (MoE)

Mixture of Experts (MoE) mentions on podcasts & videos

The math behind how LLMs are trained and served – Reiner Pope

State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490

⚡ Open Model Pretraining Masterclass — Elie Bakouch, HuggingFace SmolLM 3, FineWeb, FinePDF

Everything We Know About GPT-5 So Far

DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459

Mixture Of Experts Moe

Mixture of Experts (MoE), explained: podcasts, interviews & video clips

Representative appearances

Organizations

Products

Channels

Related topics

What does Arcmira know about Mixture of Experts (MoE)?

Where does Arcmira's data about Mixture of Experts (MoE) come from?

What is Mixture of Experts (MoE) connected to?

Mixture of Experts (MoE)

Mixture of Experts (MoE) mentions on podcasts & videos

The math behind how LLMs are trained and served – Reiner Pope

State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490

⚡ Open Model Pretraining Masterclass — Elie Bakouch, HuggingFace SmolLM 3, FineWeb, FinePDF

Everything We Know About GPT-5 So Far

DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459