Auriel Wright, working on RL at Gemini, highlights common issues with RL environments, including not reading trajectories and lacking domain experts. She emphasizes the importance of improving data quality.
Rl Environments
Your weight: normal
- 26.
Your weight: normal
Auriel Wright, working on RL at Gemini, highlights common issues with RL environments, including not reading trajectories and lacking domain experts. She emphasizes the importance of improving data quality.