Reward Function
Sifting through hundreds of thousands of hours of indexed videos
Reward Function
Sifting through hundreds of thousands of hours of indexed videos
Reward Function
3
Mentions
113.0K
Views

“The criteria defining what an AI considers a 'reward' during training.”

“That's always been the hard challenge with reinforcement learning has been, in domains that are more messy or real-world-like, how do you specify the reward function or the objective function that you...”

“We just have a reward function and you just have a next state which is a new prompt which is really not related in the history.”
Arcmira media summary
Arcmira tracks where Reward function is discussed across indexed YouTube videos, transcripts, channels, and related entities.
The criteria defining what an AI considers a 'reward' during training.
That's always been the hard challenge with reinforcement learning has been, in domains that are more messy or real-world-like, how do you specify the reward function or the objective function that you're trying to optimize?
We just have a reward function and you just have a next state which is a new prompt which is really not related in the history.
Arcmira tracks 3 indexed media appearances or mentions for Reward function, tied to source videos, channels, and transcript-derived context.
Arcmira uses indexed YouTube videos and transcripts. Representative source evidence on this page includes "Could LLMs Be The Route To Superintelligence? — With Mustafa Suleyman" with transcript-derived context and links when available.
Reward function is connected to DeepMind, OpenAI, Amazon in Arcmira's media graph.