Robo R1 - Search News

OS-R1: Agentic Operating System Kernel Tuning with Reinforcement Learning

OS-R1 is an agentic Linux kernel tuning framework that leverages reinforcement learning (RL) and large language models (LLMs) for efficient kernel configuration. It introduces a rule-based RL approach ...

GitHub

Pioneering Perception Policy with Reinforcement Learning

We present Perception-R1, a scalable RL framework using Group Relative Policy Optimization (GRPO) during MLLM post-training. Key innovations: 🎯 Perceptual Perplexity Analysis: We introduce a novel ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

OS-R1: Agentic Operating System Kernel Tuning with Reinforcement Learning

Pioneering Perception Policy with Reinforcement Learning

Trending now