Nov 23, 2025 Literature Review: Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals Nov 22, 2025 Literature Review: Reasoning Models Don't Always Say What They Think Oct 13, 2025 Literature Review: Tricks or Traps? A Deep Dive into RL for LLM Reasoning Oct 13, 2025 Literature Review: R-Zero: Self-Evolving Reasoning LLM from Zero Data Jul 05, 2025 Literature Review: On-Policy RL with Optimal Reward Baseline Jun 28, 2025 Literature Review: Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning Jun 14, 2025 Literature Review: Beyond the 80/20 Rule – High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning Jun 09, 2025 Literature Review: Thinkless: LLM Learns When to Think