Photo Generation System — Complete Technical Report

After 43 rounds of systematic testing, we have a proven formula for portrait and half-body photo generation (85% and 76% avg). Full body remains harder (60%) but the lora_scale distance relationship is now understood. Scene photos work via direct LoRA generation — the biggest breakthrough of the project. The next frontier is environment integration (making people look like they belong in the scene, not pasted on top) and overproducing + curating (generate 3-4x what we need, pick the best).

The Winning Formula

ModelFlux LoRA (trained per subject on 8 selfies)

Guidance3.5 (scene photos) / 2.5 (studio portraits)

Steps35

LoRA Scale0.9 (close-up) → 1.1 (half body) → 1.3 (full body)

Film StockKodak Portra 400 ONLY (warm skin tones)

IdentityOHWX {age} {ethnicity} {gender} — no hair/eye colour

CandidatesGenerate 4, keep best (+3-5 pts)

Best PosesWalking + Laughing (natural movement)

2. The 10 Rules We Learned

Rules of AI Photo Generation

1Identity lives in the LoRA weights, not the prompt. Don't describe hair colour, eye colour, or facial features — the LoRA already knows. Adding them FIGHTS the model and drops scores by 5+ points.

2Distance kills identity. The further the camera, the smaller the face, the weaker the LoRA's grip. Compensate by increasing lora_scale: 0.9 (close-up) → 1.1 (half body) → 1.3 (full body).

3Never post-process. Face swap, inpainting, outpainting, Kontext edit — ALL make things worse. Generate it right the first time. Every modification degrades identity.

4Concrete visual anchors, not abstract descriptions. "Shoes visible on the ground" works. "Full body photograph" doesn't. Flux needs specific physical details, not concepts.

5Movement makes photos real. Walking and laughing poses transform stiff AI shots into natural influencer photos. Static poses look like test shots.

6Sunny beats dramatic. Natural sunlight (Amalfi Coast) produces more realistic photos than dramatic lighting (Tokyo Neon). Neon looks more "AI-generated".

7Kodak Portra 400 only. Never mix film stocks. Fuji = cooler/greener (wrong for portraits). Ektar = over-saturated skin. Portra = warm, natural skin tones.

8Overgenerate and curate. Generate 4 candidates, keep the best. Adds 3-5 points for ~$0.12 extra per style. The cheapest quality improvement available.

9Weave the environment INTO the person. Don't describe person + backdrop separately. Describe snow ON their shoulders, frost in their hair, rosy cheeks from cold. Otherwise they look photoshopped.

10LoRA quality = selfie quality. Bad selfies = bad LoRA = bad photos forever. The 8-angle selfie protocol exists for a reason. Neil 2 (39% avg) proves this.

3. The 3 Shot Types

After testing everything from selfie-distance to 15-metre environment shots, we've settled on exactly 3 shot types. Each has its own proven lora_scale, aspect ratio, and prompting approach.

Close-Up / Selfie

78-85%

Aspect: 3:4
LoRA Scale: 0.9
Face dominates frame
Best for headshots & portraits
0.7 tested = 39% (too low)

Half Body

76%

Aspect: 3:4
LoRA Scale: 1.1
Waist-up, shows outfit
Best for scene photos
Best-of-4 → 85%

Full Body

60%

Aspect: 9:16
LoRA Scale: 1.3
Head to shoes, ground visible
Needs movement language
Emily hit 85% (laughing)

4. LoRA Scale by Distance — The Key Discovery

The single most important finding from rounds 31-43: lora_scale must increase with camera distance. At full body distance, the face is a tiny fraction of the image. The LoRA's influence gets diluted across a larger canvas. Higher lora_scale forces more of the trained face through.

LoRA Scale Experiment Timeline

5. Scoring System

6. The 43-Round Journey

7. What Failed & Why

8. Scene Photos Breakthrough

10 Working Scenes (all ≥70% on portrait)

The Environment Integration Problem (R41, R43)

Framing	LoRA Scale	Avg Score	Evidence	Status
Close-up	0.9	78-85%	R1-R13 baseline. R39 tested 0.7 = 39% (failed)	CONFIRMED
Half body	1.1	76%	R28-R30 scenes. R35 full body at 1.1 = 60% (+9%)	CONFIRMED
Full body	1.3	60%	R37 at 1.3 = 61%. R43 snow at 1.3 = 55%	WORKING
Distance	1.3+	6%	R38 environment-first. Too far back, LoRA can't hold	PARKED

Round	LoRA Scale	Framing	Avg	Key Finding
R1-R13	0.9	Portrait	82%	Baseline — optimal for close-up
R31	0.9	Full body	56%	First full body — face too small
R33	0.9	Full body	52%	Verbose prompts don't help
R34	0.9	Full body	51%	Movement helps quality, not score
R35	1.1	Full body	60%	+9 pts — scale compensates for distance
R37	1.3	Full body	61%	Slight improvement over 1.1
R38	1.3	Distance	6%	Too far — LoRA can't hold at any scale
R39	0.7	Close-up	39%	FAILED — identity drifts, 0.9 is the floor
R43	0.9/1.1/1.3	All 3	52%	Snow scene drags all framings down

Level	Score Range	Action
Green	70%+	Ship — production quality
Amber	25-69%	Review — may be usable
Red	<25%	Reject — auto-regenerate

Round	Phase	Focus	Avg	Key Result
R1	Core	Baseline — default settings	68%	First generation, no tuning
R2	Core	Guidance sweep (1-5)	72%	2.5 emerged as optimal
R3	Core	Steps sweep (20-50)	74%	35 steps optimal (50 = no gain)
R4	Core	LoRA scale sweep (0.7-1.0)	76%	0.9 best for portraits
R5	Core	Film stock comparison	77%	Portra 400 wins
R6	Core	Identity — with hair/eye	73%	Hair/eye colour HURTS scores
R7	Core	Identity — without hair/eye	78%	+5pts removing hair/eye from prompt
R8	Core	Lighting variations	79%	Studio dramatic + golden hour best
R9	Core	Background variations	80%	Solid/gradient > complex
R10	Core	Clothing styles test	81%	Tailored blazer, smart casual top
R11	Core	Multi-subject validation	80%	Formula holds across all 8 subjects
R12	Core	Best-of-4 selection	83%	+3-5 pts from picking best of 4
R13	Core	Production run — all styles	85%	Peak portrait performance
R14	Style	New clothing styles	82%	6 new styles, 4 scored well
R15	Style	Female-specific styles	80%	Gala dress, cocktail — good
R16	Style	Extended style library	76%	Niche styles pull average down
R17	Outpaint	Outpainting v1	66%	Just added border
R18	Outpaint	Outpainting v2	65%	Edge artifacts
R19	Outpaint	Outpainting v3	64%	Prompt guidance ignored
R20	Outpaint	Face swap	62%	Avg -10% from originals
R21	Outpaint	Kontext Pro edit	65%	Distorts face
R22	Outpaint	Inpainting	58%	Tiny files, garbage
R23	Outpaint	Combined methods	63%	Stacking failures doesn't help
R24	Outpaint	Outpainting v4	62%	More border = worse
R25	Outpaint	Kontext Pro scene swap	19%	WORST — complete identity destruction
R26	Outpaint	Outpainting v5	64%	Declared dead end
R27	Expression	Smiling test	62%	-20% vs neutral. LoRA trained neutral
R28	Scene	LoRA direct scene gen	85%	BREAKTHROUGH — scene in prompt works
R29	Scene	Portrait scenes (all subjects)	78%	10 scenes, all ≥70%
R30	Scene	Half body scenes	76%	Three-quarter framing, 9/10 ≥70%
R31	Full Body	First full body	56%	Face too small at distance
R32	Full Body	Portrait → outpaint down	75%	Two-step works 75% of the time
R33	Full Body	Explicit framing text	52%	More words ≠ more specific
R34	Full Body	Pose/movement language	51%	Photos look natural (scores same)
R35	LoRA Scale	lora_scale 1.1	60%	+9 pts — biggest lever found
R36	LoRA Scale	lora_scale 1.3 (uniform)	61%	Marginal improvement over 1.1
R37	LoRA Scale	lora_scale 1.3 + distance	61%	Distance language doesn't help uniformly
R38	Distance	Environment-first prompt	6%	Too far back — PARKED
R39	LoRA Scale	lora_scale 0.7 close-up	39%	FAILED — 0.9 is the floor
R40	Full Body	Body proportion fix	52%	"Natural proportions" prompt added
R41	Scene	Snow scene half body	52%	People look photoshopped in scene
R43	Scene	Snow 3 shot types + Portra	52%	Snow drags all framings down ~25%

Scene	Portrait	Half Body	Best For
Tokyo Neon	85%	84%	Urban, edgy look (but more "AI" feel)
Art Gallery	83%	80%	Clean, minimal, sophisticated
Amalfi Coast	80%	78%	Warm, natural, sun-drenched
Paris Cafe	78%	80%	European elegance
Modern Office	78%	80%	Professional headshots
London Street	77%	76%	Urban, moody, British
NYC Rooftop	76%	74%	Skyline backdrop
Riviera Terrace	75%	72%	Coastal luxury
Garden Party	74%	70%	Outdoor, natural light
Mountain Lodge	72%	70%	Cozy, warm tones

Scene photos WORK in terms of face scores — but the person often looks photoshopped into the scene rather than being part of it. This is the next frontier to solve.

This improved visual integration in R43 but didn't fix it completely — snow scene is inherently harder than sunny scenes. The approach needs further testing on easier scenes (Amalfi Coast, Paris Cafe) where the integration is more subtle (warm light on skin, wind in hair).

9. Pose & Movement Language

Round 34 tested 4 movement descriptions. The scores were similar across all poses, but the visual quality of Walking and Laughing was dramatically better — photos looked like real influencer editorial shoots instead of stiff AI test shots.

10. Body Proportions & Film Stock

The Flux Body Problem

Pose	Avg Score	Why
Laughing	56%	Natural expression, face visible, dynamic energy. Best overall.
Walking	55%	Natural stride, body in motion. Use with "shoes visible on the ground".
Looking Away	48%	Face turned from camera — scorer can't match what it can't see.
Leaning	47%	Static pose, less natural. Better than standing still but worse than walking.

Flux has a systematic bias toward exaggerated hips and butt on women in full body shots. This is a model limitation, not a prompt issue. Mitigation:

Film Stock — Kodak Portra 400 ONLY

11. Subject Performance

12. Prompt Architecture

Film Stock	Character	Verdict
Kodak Portra 400	Warm, flattering skin tones, soft natural grain	USE THIS
Fuji Pro 400H	Cooler, pastel, greener tones	Too cold for portraits
Kodak Ektar 100	Vivid, saturated colours	Over-saturates skin
CineStill 800T	Blue/cyan tungsten shift	Night scenes only
Kodak Tri-X	Black and white, high contrast	Removes colour info

Subject	Gender	Portrait Avg	Full Body Avg	Best Ever	Notes
Sarah	Female	89%	66%	94%	Top performer. Ideal LoRA training data.
Mike	Male	86%	48%	92%	Consistent. Strong across all styles.
Emily	Female	85%	71%	92%	Best female full body (hit 85% laughing in R35).
Chloe 2	Female	84%	64%	89%	Second session improved over Chloe 1.
Scott	Male	82%	63%	88%	Primary test subject. Bald = distinctive LoRA.
Chloe	Female	80%	64%	86%	Weak LoRA — sometimes generates Chinese-looking face.
Emily 2	Female	80%	55%	87%	Consistent with Emily 1.
Neil 2	Male	39%	32%	52%	Poor selfie quality. Bad training = bad LoRA forever.

Every prompt is built from 6 modular layers via prompt_builder.js. No prompts are hardcoded in generation scripts.

Identity

OHWX {age} {ethnicity} {gender} — trigger word + demographic anchoring only. NO hair/eye.

Clothing

Style-specific outfit with physical details. Winter gear: "snow settling on shoulders". No black dress for women.

Movement

"Walking confidently", "captured mid-laugh". Concrete anchors: "shoes visible on the ground".

Environment

Scene description INTERACTING with subject. "Snowflakes on collar", not just "snowy background".

Lighting

Golden hour, soft winter light, warm sun. Sunny natural > dramatic neon for realism.

Camera

"Shot on Kodak Portra 400". One film stock only. Slight natural grain = good.

Example Prompts by Shot Type

CLOSE-UP (lora 0.9):

    OHWX 30 year old Caucasian man, wearing a tailored navy blazer, close-up portrait with warm golden hour light illuminating face, shallow depth of field, natural relaxed expression, shot on Kodak Portra 400
  

HALF BODY (lora 1.1):

    OHWX 30 year old Caucasian man, wearing a linen shirt, three-quarter shot walking along Amalfi Coast cliff path, warm Mediterranean sun on face, captured mid-laugh, natural body proportions, shot on Kodak Portra 400
  

FULL BODY (lora 1.3):

    OHWX 30 year old Caucasian man, wearing a casual shirt and dark jeans, walking confidently along a sun-drenched coastal path, shoes visible on the ground, full figure from head to shoes, natural body proportions, warm smile, golden hour light, shot on Kodak Portra 400
  

13. Settings Sensitivity

Guidance 2.5-3.5 1.0-5.0 Below 2: too loose. Above 4: over-saturated. 2.5 for studio, 3.5 for scenes.

Steps 35 20-50 Min 1, max 50 for Flux. 20=soft, 30=usable, 35=optimal, 50=no gain but 2x cost.

LoRA Scale 0.9-1.3 0.0-2.0 0.7=identity drift. 0.9=portrait. 1.1=half body. 1.3=full body. >1.5=waxy/over-fit.

Film Stock Portra 400 5 stocks Only warm skin tone stock. Never mix stocks. Never use Fuji for portraits.

Candidates 4 1-6 1→4: +3-5 pts. 4→6: +1 pt (not worth 50% cost increase).

Identity Age+Eth+Gen 3 variants Full description: -5 pts. Age+ethnicity+gender: best. No anchor: random drift.

Aspect Ratio 3:4 / 9:16 — 3:4 for portrait + half body. 9:16 for full body (forces vertical framing with ground).

Problem	Impact	Possible Fix
Environment integration	People look photoshopped into scenes	Weave scene INTO person description (R43 approach). Test on easy sunny scenes first.
Distance inconsistency	Same prompt = different distances per shot	Composition anchoring: "head near top of frame, feet at bottom". Generate more, filter by distance.
Full body scores	60% avg vs 85% portrait	Higher candidates (4-6), stricter curation. Or portrait → outpaint down (R32 method).
Body proportions (women)	Exaggerated hips/butt	"Natural body proportions" prompt. Community Body FLUX FIX LoRA (untested).
Snow/cold scenes	52% avg (25% below sunny scenes)	Snow is inherently harder. Focus on sunny scenes for production. Snow = nice-to-have.
Chloe 1 LoRA	Sometimes generates Chinese face	Retrain with better selfie angles. Chloe 2 session already 4% better.