recommend_training

Pick a Scenario training base architecture: image LoRA family + variant, or voice clone, from a user training intent.

When to use

Call this tool when the user asks "which model should I train" / "what LoRA base fits my dataset" / "voice clone for X". It returns the exact type string to pass to model_create. For picking models that generate outputs, use recommend instead.

Decision tree

The picker filters by dataset shape, scores by style + subject, then picks a variant inside the winning family by priority.

Image families

Family	Kind	Best for	Status
Flux 2	standard	New projects training a single-image LoRA — characters, products, environments, or distinctive styles.	stable
Flux 2 Edit	edit	When your dataset is image pairs with text instructions describing the change.	stable
Qwen Image	standard	High-volume use cases where budget matters and you want strong prompt adherence.	stable
Qwen Edit	edit	Edit recipes that should keep the surrounding context intact.	stable
Z-Image	standard	Stylized work — character art, illustration, concept art — or anywhere the highest training fidelity matters.	stable
Flux 1 Dev	standard	Existing projects with Flux 1 LoRAs already in production. New projects should use Flux 2.	legacy
Flux Kontext	edit	Existing edit recipes already using Flux Kontext. Consider Flux 2 Edit or Qwen Edit for new projects.	legacy

Voice options

Instant Voice Cloning (IVC) (stable) — Quick voice clone from a short audio sample. Ready in seconds.
Professional Voice Clone (soon) — High-fidelity clone trained on longer, cleaner audio — better expressiveness and realism.

Required inputs

At minimum, pass prompt describing the training goal — the LLM extracts everything else. Hosts that already know the modality + dataset shape can pass them as structured params and skip LLM 1.

Output shape

{
  family: "Flux 2",
  recommended: {
    type: "flux.2-dev-lora",
    label: "Dev (32B)",
    why: "..."
  },
  alternatives: [
    { type, when_better, tradeoff }
  ],
  dataset_requirements: {
    shape: "single_images",
    recommended_size: "5-15 images",
    max_size: 50
  },
  family_notes?: string,
  resolution_notes: string,
  _degraded?: "llm_unavailable" | "llm_timeout"
}

Worked examples

Image — character LoRA

Prompt: "I have 30 photos of my dog. Train a model."

family: Flux 2
recommended: flux.2-dev-lora
dataset: 5–15 single images, max 50

Voice — quick clone

Prompt: "15-second voice memo, want a quick clone."

family: Instant Voice Cloning (IVC)
recommended: voice-clone-ivc
dataset: short audio sample, <30s

Reference

Public training docs: docs.scenario.com/get-started/training/training-models