bytedance

https://github.com/bytedance

seedance-v1.5/image-to-video

bytedance / seedance-v1.5/image-to-video

Transform static images into dynamic videos with synchronized audio. Supports text-guided animations and start/end frame keying.

seedance-v1.5/text-to-video

bytedance / seedance-v1.5/text-to-video

Generate high-quality videos with synchronized audio directly from text prompts using Seedance 1.5.

seedream-v5/edit

bytedance / seedream-v5/edit

Edit and seamlessly compose images using text prompts and multiple reference images with the fast, high-quality Seedream 5.0 Lite model.

seedream-v5/text-to-image

bytedance / seedream-v5/text-to-image

Generate high-quality, intelligent images from text prompts using the fast Lite version of Seedream 5.0.

seedream-v4.5/edit

bytedance / seedream-v4.5/edit

Advanced image editing model by ByteDance that uses text prompts and up to 10 reference images to stylize, transform, and seamlessly composite visuals.

seedream-v4.5/text-to-image

bytedance / seedream-v4.5/text-to-image

A next-generation text-to-image model by ByteDance, capable of high-fidelity generation, precise text rendering, and complex stylistic control for highly detailed visual compositions.

seedream-v4.5/text-to-image(deprecated)

bytedance / seedream-v4.5/text-to-image(deprecated)

Generate ultra-high-resolution, photorealistic images from text prompts with customizable sizes and batch generation capabilities.

bytedance / seedance-v1-pro

Seedance 1.0 generates 1080P videos with smooth motion, rich detail, and diverse styles, while the pro version adds multi-shot narrative and advanced instruction following for cinematic results.

seedream-v4

bytedance / seedream-v4

Seedream 4.0 is a next-generation image creation model that unifies generation and editing in a single architecture, enabling advanced multimodal reasoning and reference consistency while delivering stunning 4K images with significantly faster inference.

sdxl-lightning-4step

bytedance / sdxl-lightning-4step

SDXL-Lightning is a lightning-fast text-to-image generation model that produces high-quality 1024px images in just a few steps, distilled from Stable Diffusion XL.