DeepSlide: From Artifacts to Presentation Delivery
Original reporting by arXiv (cs.AI)

Presentations remain a primary medium for scholarly communication, yet most AI slide generators today fall short in a critical area. While these tools excel at producing visually plausible decks, they often under-optimize the *delivery process*—neglecting crucial elements like pacing, narrative flow, and comprehensive preparation. This oversight leaves presenters with a static artifact, but little aid in crafting an engaging, dynamic experience.
Rethinking presentation AI
A new multi-agent system, DeepSlide, offers a comprehensive solution, transforming the entire presentation pipeline. DeepSlide integrates a human-in-the-loop approach to support every stage, from initial requirement elicitation and time-budgeted narrative planning, to evidence-grounded slide-script generation, audience attention augmentation, and rehearsal support. Its architecture combines a controllable logical-chain planner, a content-tree retriever for grounding, and sequential rendering.
To precisely evaluate its impact, researchers introduced a novel dual-scoreboard benchmark, rigorously separating static artifact quality from dynamic delivery excellence. Tested across 20 diverse domains and audience profiles, DeepSlide not only matched strong baselines in conventional slide quality but consistently demonstrated superior performance in delivery metrics. It significantly improved narrative flow, pacing precision, and the crucial synergy between slides and presenter script, ultimately enhancing the overall presentation experience.
DeepSlide represents a significant pivot in the application of AI to communication, moving beyond the superficial aesthetics of presentation decks to fundamentally enhance the *delivery* process itself. By integrating a human-in-the-loop, multi-agent system, DeepSlide addresses crucial aspects like narrative planning, pacing precision, and slide-script synergy that traditional AI generators often overlook. Its ability to support the full presentation journey—from initial requirements to rehearsal—marks a shift towards AI tools that are not merely creators of content, but collaborative partners in optimizing human performance. The benchmark results, showing substantial gains in dynamic delivery excellence while maintaining artifact quality, underscore its effectiveness.