Workflow
Paper2Poster
icon
Search documents
不愁了!开源智能体Paper2Poster「一键生成」学术海报
机器之心· 2025-06-06 09:12
做海报有多痛苦? 大家做学术应该都懂那种感觉:临近 DDL 前熬夜赶制海报,得把上万字的论文浓缩进一页 PPT,还要图文并茂兼顾美观。一不小心排版崩了、字体太小或 者信息太多,导师改起来也是花样百出,直呼「再精简!」……可以说,做学术海报是科研工作中让人头秃的环节之一。 要是有个工具能替我们自动把论文变成海报就好了? 还别说,真的有科研团队朝这个方向努力了!2025 年 5 月,来自滑铁卢大学、新加坡国立大学和牛津大学的研究者发布了一个有趣的系统—— Paper2Poster。顾名思义,它试图用大型语言模型(LLM)当助手,把长篇论文内容自动生成一张精美的学术海报。 论文标题: Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers 这个工作开创了学术海报自动生成的新领域:一方面,它提出了首个从论文生成海报的完整框架,能够智能提炼论文并排版;另一方面,作者还搭建了配套 的评测基准和指标体系,来量化评估 AI 生成海报的效果。换句话说,不仅要让 AI 会「画」海报,还要知道它画得好不好,这可是前所未有的尝试。 插图 1: ...
论文秒变海报!开源框架PosterAgent一键生成顶会级学术Poster
量子位· 2025-06-03 07:59
Core Viewpoint - The article introduces PosterAgent, a tool designed to convert academic papers into visually appealing posters, highlighting its efficiency and effectiveness compared to existing methods like GPT-4o [2][18]. Group 1: PosterAgent Overview - PosterAgent can transform a 22-page paper into an editable ".pptx" poster for only $0.0045, significantly reducing token usage by 87% compared to GPT-4o [2][36]. - The tool is built upon the Paper2Poster framework, which establishes the first academic poster evaluation standard, addressing gaps in long-context and multi-modal compression assessments [4][18]. Group 2: Evaluation Metrics - Paper2Poster includes 100 pairs of AI-related papers and their corresponding posters, covering various subfields like computer vision (19%), natural language processing (17%), and reinforcement learning (10%) [20]. - The evaluation metrics focus on four dimensions: visual quality, text coherence, overall assessment, and PaperQuiz, which simulates communication between authors and readers [22][23]. Group 3: PosterAgent Components - The PosterAgent framework consists of three key components: a parser for extracting key content, a planner for organizing text and visuals, and a painter-commenter for generating and refining the poster layout [28][29]. - The system employs a top-down design approach to ensure coherence and alignment of content [25]. Group 4: Performance Comparison - In comparative tests, PosterAgent achieved the highest graphic relevance and visual similarity to human-designed posters, scoring an average of 3.72 when evaluated by a visual language model (VLM) [31][32]. - While GPT-4o-image had the highest visual similarity, it recorded the lowest coherence, indicating that its outputs may appear attractive but lack textual clarity [30][31]. Group 5: Cost Efficiency - PosterAgent demonstrated significant cost efficiency, requiring only 101.1K and 47.6K tokens for different variants, translating to a cost of $0.55 (based on GPT-4o) or $0.0045 (based on Qwen) per poster [36].