Sep 30, 2025 Literature Review: Agentic Misalignment – How LLMs Could Be Insider Threats Sep 30, 2025 Literature Review: Effective Red-Teaming of Policy-Adherent Agents Sep 25, 2025 Literature Review: Agent A/B — Automated and Scalable Web A/B Testing with Interactive LLM Agents Jul 19, 2025 Literature Review: AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench Jun 25, 2025 Literature Review: RedCode: Risky Code Execution and Generation Benchmark for Code Agents Jun 21, 2025 Literature Review: A Practical Memory Injection Attack against LLM Agents Jun 09, 2025 Literature Review: Gaming Tool Preferences in Agentic LLMs May 12, 2025 Literature Review: PolicyEvol-Agent: Evolving Policy via Environment Perception and Self-Awareness with Theory of Mind May 11, 2025 Agentic AI: The New 'Groundbreaking Technology' of 2025 Apr 29, 2025 Literature Review: Learning to Contextualize Web Pages for Enhanced Decision Making by LLM Agents Apr 29, 2025 Literature Review: API Agents vs. GUI Agents: Divergence and Convergence