recommend_training

Pick a Scenario training base architecture: image LoRA family + variant, or voice clone, from a user training intent.

When to use

Call this tool when the user asks "which model should I train" / "what LoRA base fits my dataset" / "voice clone for X". It returns the exact type string to pass to manage_models create. For picking models that generate outputs, use recommend instead.

Decision tree

The picker filters by dataset shape, scores by style + subject, then picks a variant inside the winning family by priority.

Image families

FamilyKindBest forStatus
Flux 2standardNew projects training a single-image LoRA — characters, products, environments, or distinctive styles.stable
Flux 2 EditeditWhen your dataset is image pairs with text instructions describing the change.stable
Qwen ImagestandardHigh-volume use cases where budget matters and you want strong prompt adherence.stable
Qwen EditeditEdit recipes that should keep the surrounding context intact.stable
Z-ImagestandardStylized work — character art, illustration, concept art — or anywhere the highest training fidelity matters.stable
Flux 1 DevstandardExisting projects with Flux 1 LoRAs already in production. New projects should use Flux 2.legacy
Flux KontexteditExisting edit recipes already using Flux Kontext. Consider Flux 2 Edit or Qwen Edit for new projects.legacy

Voice options

  • Instant Voice Cloning (IVC) (stable) — Quick voice clone from a short audio sample. Ready in seconds.
  • Professional Voice Clone (soon) — High-fidelity clone trained on longer, cleaner audio — better expressiveness and realism.

Required inputs

At minimum, pass prompt describing the training goal — the LLM extracts everything else. Hosts that already know the modality + dataset shape can pass them as structured params and skip LLM 1.

Output shape

{
  family: "Flux 2",
  recommended: {
    type: "flux.2-dev-lora",
    label: "Dev (32B)",
    why: "..."
  },
  alternatives: [
    { type, when_better, tradeoff }
  ],
  dataset_requirements: {
    shape: "single_images",
    recommended_size: "5-15 images",
    max_size: 50
  },
  family_notes?: string,
  resolution_notes: string,
  _degraded?: "llm_unavailable" | "llm_timeout"
}

Worked examples

Image — character LoRA

Prompt: "I have 30 photos of my dog. Train a model."

  • family: Flux 2
  • recommended: flux.2-dev-lora
  • dataset: 5–15 single images, max 50

Voice — quick clone

Prompt: "15-second voice memo, want a quick clone."

  • family: Instant Voice Cloning (IVC)
  • recommended: voice-clone-ivc
  • dataset: short audio sample, <30s

Reference

Public training docs: docs.scenario.com/get-started/training/training-models