Reinforcement Learning | Jehyeok Yeon

Nov 23, 2025	Literature Review: Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals
Nov 22, 2025	Literature Review: Reasoning Models Don't Always Say What They Think
Oct 13, 2025	Literature Review: Tricks or Traps? A Deep Dive into RL for LLM Reasoning
Oct 13, 2025	Literature Review: R-Zero: Self-Evolving Reasoning LLM from Zero Data
Jul 05, 2025	Literature Review: On-Policy RL with Optimal Reward Baseline
Jun 28, 2025	Literature Review: Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning
Jun 14, 2025	Literature Review: Beyond the 80/20 Rule – High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning
Jun 09, 2025	Literature Review: Thinkless: LLM Learns When to Think