36氪获悉,国内大厂首个开源龙虾类产品LobsterAI (网易有道龙虾)近日宣布上线图片生成与视频生成能力,并一次性接入包括Seedream、Seedance、HappyHorse、MiniMax-Hailuo在内的模型。
AI 点评 · 多模型矩阵整合,开源策略降低使用门槛,推动AI创作生态。
共 16 条相关资讯 · 来自历史归档
36氪获悉,国内大厂首个开源龙虾类产品LobsterAI (网易有道龙虾)近日宣布上线图片生成与视频生成能力,并一次性接入包括Seedream、Seedance、HappyHorse、MiniMax-Hailuo在内的模型。
AI 点评 · 多模型矩阵整合,开源策略降低使用门槛,推动AI创作生态。
Autoregressive (AR) video diffusion enables variable-length synthesis, but long-horizon generation often suffers from accumulated errors and identity drift. For efficiency, existing methods commonly a…
AI 点评 · 提出通用检索增强框架,解决长视频生成的累积误差与身份漂移,兼顾效率与质量。
The recent "Reasoning with Video" paradigm utilizes Video Generation Models (VGMs) to generate temporally coherent visual trajectories to complete reasoning tasks. Although state-of-the-art VGMs excel…
AI 点评 · 自适应测试时优化让视觉语言模型成为视频推理的“好老师”,突破传统方法局限。
Official Implementation of LongLive-RAG: A general retrieval-augmented framework for long video generation.
AI 点评 · 开源长视频RAG框架,突破生成时长限制,为AI视频创作提供新路径。
Text-to-video (T2V) generation faces challenging questions when generating videos with long horizons containing multiple events. Inspired by the intrinsics of the diffusion process, we probe video dif…
AI 点评 · 无需额外训练,即可精准控制多事件视频生成,大幅降低算力门槛,推动视频创作民主化。
Recent advances in video generative models have promoted rapid progress in controllable world models. However, maintaining fine-grained spatio-temporal consistency under long-horizon reasoning remains…
As video diffusion models (VDMs) advance toward world models, a key question arises: do they truly understand causality, or merely overfit to statistical temporal patterns? Existing benchmarks mostly…
AI 点评 · 从因果视角评估视频生成模型,揭示其是否真正理解物理世界规律,而非仅拟合统计模式。
Recent video diffusion foundation models have achieved remarkable progress in high-quality video generation, yet turning them into real-time interactive video world models remains challenging. Interac…
AI 点评 · 开源全栈框架实现实时交互视频世界模型,突破现有视频生成技术瓶颈。
Autoregressive video diffusion models generate streaming video by producing frames sequentially, conditioning each chunk on previously generated content. These models are structurally anchored to the…
AI 点评 · 自进化锚点机制突破流式视频生成瓶颈,实现更连贯的无限长视频输出。
Joint audio-video generation aims to synthesize temporally synchronized and semantically coherent visual-acoustic content. However, existing open-source methods mainly rely on either dual-tower design…
AI 点评 · 多模态对齐技术突破,让音视频生成更同步自然,推动AI内容创作进入新阶段。
Runway Unlimited Pro Gen-3 with unlimited credits, premium models, motion brush, lip sync, upscale, and the complete creative suite—top subscription tier fully…
The narrative quality of a video fundamentally determines its perceptual value. Although existing video generation methods can produce visually appealing content, they predominantly rely on sparse con…
AI 点评 · 智能导演技术突破,首次实现关键帧驱动的叙事节奏控制,让AI视频生成更懂剧情。
Spatial intelligence requires visual representations that capture both semantic objects and geometric structure in the physical world. To support this, two major pre-training schemes are now widely us…
AI 点评 · 对比视觉语言与视频生成模型,揭示哪种预训练范式更利于空间智能发展。
Real-time streaming joint audio-video generation for character animation requires a generator to speak the requested transcript, maintain visual identity across chunks, and run within a strict playbac…
AI 点评 · 解耦式编排实现长时流式音视频生成,突破实时角色动画的连贯性与延迟瓶颈。
Recent advances have substantially improved real-time interactive video generation in the autoregressive regime. However, most existing few-step autoregressive video generation methods, often distille…
AI 点评 · 提出单步自回归视频生成新范式,有望突破实时交互瓶颈,显著提升生成稳定性。
Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video generation. T…
AI 点评 · 视频生成新突破,扩散模型从图像迈向动态世界。