- reflection
- research
- opinion
- creative
•
•
•
-
Literature Review: Beyond the 80/20 Rule – High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning
-
Literature Review: Auto-Patching: Enhancing Multi-Hop Reasoning in Language Models
-
Literature Review: Thinkless: LLM Learns When to Think
-
Literature Review: DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies
-
Literature Review: Gaming Tool Preferences in Agentic LLMs