All articles

Article

What Is Seedance 2.0?

Seedance 2.0 is ByteDance's latest multimodal AI video generation model, available through Jimeng (即梦) and integrated natively on HiArt. Unlike text-only video models, Seedance 2.0 accepts images, reference videos, and audio alongside your prompt — then generates a short clip with matching sound effects or background music.
If you have heard about "omni-reference" or the @ syntax in AI video circles, that is Seedance 2.0's signature workflow: you upload assets, assign each one a role in natural language, and the model stitches motion, style, and rhythm together in a single 4–15 second render.

What makes it different

  • Multimodal inputs — combine up to 12 reference files (images, videos, audio) in one generation
  • Built-in audio — outputs include synced sound effects or background music by default
  • Director-level control — replicate camera moves, choreography, visual effects, and beat-matched motion from reference media
  • Multiple aspect ratios — 16:9, 9:16, 1:1, 4:3, 3:4, 21:9, or adaptive for cross-platform exports
  • Canvas-native on HiArt — iterate on the same node without leaving your project

Three generation modes

Seedance 2.0 supports three distinct workflows. Pick the one that matches your starting assets.

Text-to-video

Describe the scene, action, and camera in plain language. No uploads required — ideal for quick concept tests and mood boards.

First / last frame

Upload one image as the opening frame, or two images as the start and end points. The model generates the motion between them — useful for product reveals, logo animations, and controlled transitions.

Omni-reference

The most powerful mode. Upload reference images for character look or scene style, reference videos for camera movement or choreography, and reference audio for rhythm or mood. Use @Image1, @Video1, @Audio1 syntax in your prompt to tell the model what each file is for.

Input limits (omni-reference)

Input
Limit
Format
Max size
Images
≤ 9
jpeg, png, webp, bmp, tiff, gif
30 MB each
Videos
≤ 3
mp4, mov
50 MB each, 2–15s total
Audio
≤ 3
mp3, wav
15 MB each, ≤ 15s total
Total
≤ 12 files

Output specs

  • Duration: 4–15 seconds (selectable)
  • Resolution: 480p or 720p
  • Audio: auto-generated sound effects / background music
  • Aspect ratio: 16:9, 9:16, 4:3, 3:4, 1:1, 21:9, or adaptive

Seedance 2.0 vs 2.0 Fast

HiArt offers two variants of the same model family:
  • Seedance 2.0 — full-quality renders for final output. Use when you are happy with the prompt and ready to ship.
  • Seedance 2.0 Fast — same multimodal controls with faster turnaround. Ideal for motion previews and prompt iteration before committing to a final render.
A typical workflow: draft with 2.0 Fast, refine your prompt and references, then switch to 2.0 for the final clip.

Who is it for?

  • E-commerce & ads — product loops, hero shots, and 6-second social placements
  • Short drama & storytelling — emotional close-ups, dialogue-driven scenes, beat-synced cuts
  • Social content — dance replication, trend remixes, vertical 9:16 exports
  • Creative prototyping — one-take tracking shots, VFX replication, style transfer from reference video

Things to know before you start

  • Uploaded images and videos cannot contain realistic human faces — the platform blocks such content for compliance.
  • Reference audio cannot be used alone; pair it with at least one reference image or video.
  • First/last frame mode and omni-reference mode cannot be mixed in a single generation.
  • Results depend heavily on prompt quality — especially how clearly you assign roles to each @ reference.

Ready to generate? Open a video node on HiArt canvas, select Seedance 2.0, and start with a simple text-to-video prompt. When you are ready to go deeper, read our Seedance 2.0 prompt writing guide for the @ reference system, camera language, and copy-paste templates.