home posts email
  • How scaling pretraining affects RL sample efficiency ↗ October 20, 2025
  • How to Vibe Code Effectively September 3, 2025
  • Proposal: Self-Refined RL (SRRL) ↗ August 4, 2025
  • Real Work ↗ July 15, 2025
  • Fast RL using off-policy sampling ↗ July 13, 2025