Models & training

ControlNet

ControlNet is an add-on that conditions a diffusion model on a reference structure - such as a pose skeleton, edge map or depth map - so you control composition precisely, not just with words.

Prompts are great at describing content but weak at pinning down exact layout. ControlNet solves that. It is a neural network that plugs into a diffusion model and feeds it an extra structural hint - derived from a reference image - so the output follows that structure while your prompt fills in the style and content.

Common control types

Pose (OpenPose): copy a person's exact body pose into a new character or scene.
Canny / line art: follow the edges of a reference so the composition matches precisely.
Depth: preserve the 3D layout and spatial arrangement of a scene.
Scribble: turn a rough doodle into a finished image that respects your lines.

Why it matters

ControlNet gives you reproducible composition control that prompting alone cannot. It is essential when you need a specific pose, a consistent layout across a series, or a faithful sketch-to-render workflow. It pairs naturally with image-to-image and inpainting for fine-grained editing.

Try it in the generator

Put controlnet to work right now - free daily generations, commercial license included.

Start creating free

Related terms

Back to the glossary

ControlNet

Common control types

Why it matters

Related terms

Ready to get started?