Scam2Prompt: A Scalable Framework for Auditing Malicious Scam Endpoints in Production LLMs Paper • 2509.02372 • Published Oct 2, 2025 • 1
FlashSyn: Flash Loan Attack Synthesis via Counter Example Driven Approximation Paper • 2206.10708 • Published Jan 12, 2024
Demystifying Invariant Effectiveness for Securing Smart Contracts Paper • 2404.14580 • Published Jul 14, 2024
OpenTracer: A Dynamic Transaction Trace Analyzer for Smart Contract Invariant Generation and Beyond Paper • 2407.10039 • Published Jul 14, 2024
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction Paper • 2605.05242 • Published 14 days ago • 106
LLM Safety From Within: Detecting Harmful Content with Internal Representations Paper • 2604.18519 • Published 27 days ago • 26
LLM Safety From Within: Detecting Harmful Content with Internal Representations Paper • 2604.18519 • Published 27 days ago • 26
RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-Thought Paper • 2506.04277 • Published Jun 4, 2025
VEU-Bench: Towards Comprehensive Understanding of Video Editing Paper • 2504.17828 • Published Apr 24, 2025
SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding Paper • 2603.16124 • Published Mar 17 • 3
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published Apr 2 • 42
SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models Paper • 2508.18179 • Published Aug 25, 2025 • 9
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published Apr 2 • 42
MosaicMem: Hybrid Spatial Memory for Controllable Video World Models Paper • 2603.17117 • Published Mar 17 • 87
Dr. Bench: A Multidimensional Evaluation for Deep Research Agents, from Answers to Reports Paper • 2510.02190 • Published Jan 29 • 20
VideoScore2: Think before You Score in Generative Video Evaluation Paper • 2509.22799 • Published Sep 26, 2025 • 26
CAD-Tokenizer: Towards Text-based CAD Prototyping via Modality-Specific Tokenization Paper • 2509.21150 • Published Sep 25, 2025 • 5