Rich Sutton, The OaK Architecture: A Vision of SuperIntelligence from Experience - RLC 2025
“It's the Ubox. The viewbox. I want you to notice because this is a is a um kind of reinforcement learning where we don't assume that the state is available to the agent.”