Minicpm
Extracting target signal
Minicpm
2
Mentions
2.0K
Views

“A specific model mentioned as a successful validation of MUP at scale.”

“A Chinese model mentioned for its successful upcycling from 2.4B to 13.4B parameters.”
Arcmira media summary
Arcmira tracks where MiniCPM is discussed across indexed YouTube videos, transcripts, channels, and related entities.
A specific model mentioned as a successful validation of MUP at scale.
A Chinese model mentioned for its successful upcycling from 2.4B to 13.4B parameters.
Arcmira tracks 2 indexed media appearances or mentions for MiniCPM, tied to source videos, channels, and transcript-derived context.
Arcmira uses indexed YouTube videos and transcripts. Representative source evidence on this page includes "Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 11: Scaling Laws" with transcript-derived context and links when available.
MiniCPM is connected to muon, load balancing, WSD Learning Rate in Arcmira's media graph.