Core Framework

Use this structure for consistent results: Subject + Action + Style + Context Example: “Red fox sitting in tall grass, wildlife documentary photography, misty dawn”
  • Subject: The main focus (person, object, character)
  • Action: What the subject is doing or their pose
  • Style: Artistic approach, medium, or aesthetic
  • Context: Setting, lighting, time, mood, or atmospheric conditions
Prompt Example

Example of a structured prompt

Use Structured Descriptions

Use natural language for relationships and descriptions, but direct specifications for technical and atmospheric elements.
  1. “Human explorer in futuristic gear walking through cyberpunk forest, dramatic atmospheric lighting, sci-fi fantasy art style, cinematic composition”
  2. “An astronaut with a silver spacesuit floating outside the International Space Station, cinematic photography with dramatic lighting, peaceful and awe-inspiring”
  3. “Retro game style detective in old school suit, upper body shot, colorful, futuristic design with vibrant glow”
Human explorer in forest
Astronaut in space
Futuristic detective

Word Order Matters

Front-load your most important elements. FLUX pays more attention to what comes first. Priority order: Main subject → Key action → Critical style → Essential context → Secondary details

Enhancement Layers

Build beyond the basic framework with these optional layers: Foundation: Subject + Action + Style + Context + Visual Layer: Specific lighting, color palette, composition details + Technical Layer: Camera settings, lens specs, quality markers + Atmospheric Layer: Mood, emotional tone, narrative elements Example progression:
  • Foundation: “An astronaut floating outside the space station, cinematic photography”
  • Enhanced: “An astronaut with silver spacesuit floating gracefully outside the International Space Station, cinematic photography with dramatic lighting, bathed in golden sunlight, deep blue Earth tones, shallow depth of field, 85mm lens, conveying wonder and achievement”

Optimal Prompt Length

Short (10-30 words): Quick concepts and style exploration Medium (30-80 words): Usually ideal for most projects Long (80+ words): Complex scenes requiring detailed specifications

Work Without Negatives

Instead of “no crowds,” write “peaceful solitude” Instead of “without glasses,” write “clear, unobstructed eyes” Ask: “If this thing wasn’t there, what would I see instead?”

Quick Templates

Portrait: [Subject description], [pose/expression], [style], [lighting], [background] Product: [Product details], [placement], [lighting setup], [style], [mood] Landscape: [Location/setting], [time/weather], [camera angle], [style], [atmosphere] Architecture: [Building/space], [perspective], [lighting], [style], [mood]

Text Integration

FLUX can generate readable text in images when you describe it clearly. Here’s how to get good text results:
  • Use quotation marks: “The text ‘OPEN’ appears in red neon letters above the door”
  • Specify placement: Where the text appears in relation to other elements
  • Describe style: “elegant serif typography” or “bold industrial lettering”

Common Use Case Patterns

Different types of images work better with different prompt structures. Put the most important elements first based on what you want to emphasize:

Character-focused

For portraits, character art: Start with detailed character description and then add the rest of the prompt.
Detailed character → Action → Style → Context
Building progression:
  1. Foundation: “Elderly wizard with long white beard and piercing blue eyes”
  2. + Action: “Elderly wizard with long white beard and piercing blue eyes, casting a spell”
  3. + Style: “Elderly wizard with long white beard and piercing blue eyes, casting a spell, fantasy art style”
  4. + Context: “Elderly wizard with long white beard and piercing blue eyes, casting a spell, fantasy art style, in a magical forest clearing”

Context-focused

For landscapes, architectural shots: Lead with the setting and then add the rest of the prompt.
Setting → Atmospheric conditions → Style → Technical specs
Building progression:
  1. Foundation: “Ancient Greek temple ruins”
  2. + Atmosphere: “Ancient Greek temple ruins at sunset, golden hour lighting”
  3. + Style: “Ancient Greek temple ruins at sunset, golden hour lighting, cinematic photography style”
  4. + Details: “Ancient Greek temple ruins at sunset, golden hour lighting, cinematic photography style, with scattered marble columns”

Style-focused

For artistic interpretations: Begin with the art style or reference and then add the rest of the prompt.
Artistic reference → Subject → Context → Technical execution
Building progression:
  1. Foundation: “Van Gogh painting style with swirling brushstrokes”
  2. + Subject: “Van Gogh painting style with swirling brushstrokes, depicting a modern city street”
  3. + Context: “Van Gogh painting style with swirling brushstrokes, depicting a modern city street, vibrant blues and yellows”
  4. + Technical: “Van Gogh painting style with swirling brushstrokes, depicting a modern city street, vibrant blues and yellows, impressionist technique”

Technical-focused

For professional photography: Start with the subject and then add the rest of the prompt, finishing with camera settings.
Subject → Background → Lighting → Lens/settings
Building progression:
  1. Foundation: “Professional headshot of a business executive”
  2. + Background: “Professional headshot of a business executive, clean white background”
  3. + Lighting: “Professional headshot of a business executive, clean white background, studio lighting”
  4. + Technical: “Professional headshot of a business executive, clean white background, studio lighting, 85mm lens, f/1.4, shallow depth of field”

Professional Photography Control (Advanced)

Background blur vs. sharpness (f-numbers): Usually called “aperture”, the f-number controls how blurry vs. sharp your background is. Small numbers (f/1.4) blur the background; big numbers (f/8) keep everything sharp. Scene width & zoom (mm numbers): Usually called “focal length”, the mm number controls how much of the scene you see and how “zoomed in” it looks. Small numbers (24mm) show wide scenes; big numbers (85mm) zoom in closer. Lighting: Allow you to control the lighting style in the image. For instance, "Rembrandt lighting" for dramatic portraits, "golden hour" for warm atmosphere Example: “Professional headshot, 85mm lens, f/2.8, Rembrandt lighting, corporate setting”

Quality Control Checklist

  • Does your prompt include the core framework elements?
  • Are your most important elements mentioned first?
  • Have you replaced vague terms with specific descriptors?
  • Are you describing what you want to see, not what you want to avoid?
  • Do all elements work together toward a unified vision?

Ready to Start Creating?

Learn More about Prompting