AI Self-Portraits in 2 Hours: Stable Diffusion LoRA Guide

22/3/2025
8-minute read

Quick Start: Turn yourself into any character or style in 2 hours. Create AI self-portraits that actually look like you with just 15-20 photos.

🎯 What You’ll Achieve

By the end of this guide, you’ll have:

✅ Your own AI model that generates realistic self-portraits
✅ Professional results in any style (cyberpunk, Renaissance, anime, etc.)
✅ Small file size (40-150MB) for easy sharing
✅ Full control over how much you look like yourself

Time Required: 1-2 hours | Difficulty: Beginner-friendly | Cost: Free (with Google Colab)

The ability to generate photorealistic images of yourself in different styles, environments, and scenarios has become increasingly accessible through AI image generation. Low-Rank Adaptation (LoRA) offers an efficient way to personalize Stable Diffusion models with just a handful of personal photos. In this tutorial, I’ll walk you through creating your own LoRA model to generate high-quality, diverse self-portraits that actually look like you.

🤔 What is LoRA and Why Use It?

When I first tried creating AI versions of myself, the results were… well, let’s just say they looked like my distant cousin from another dimension! That’s because general models don’t know what makes you uniquely you. This is where LoRA comes in.

LoRA (Low-Rank Adaptation) is a smart way to teach AI models what you look like using just 15-30 photos, without breaking the model’s creativity.

🚀 Why LoRA Beats Other Methods:

⚡ Super Efficient: Uses 90% less computing power than traditional training (my laptop survived!)
📱 Tiny Files: Models are only 40-150MB (perfect for sharing with friends)
⏱️ Lightning Fast: Complete training in 1-2 hours on free GPUs
🎛️ Full Control: Adjust how much you look like yourself (subtle to “definitely you”)
🎨 Creative Freedom: Works with any art style, lighting, or scenario

Think of it like teaching a professional artist your face—they can then draw you in any style you want!

Preparing Your Training Dataset

The quality of your training images directly impacts the success of your LoRA model. Trust me on this one—I learned it the hard way after my first attempt gave me three eyes! Here’s how to create an optimal dataset:

Image Selection Guidelines

Quantity: 15-30 images minimum (20-25 is ideal for most uses). I used 22 in my successful model.
Diversity: Include various angles, expressions, lighting conditions. Remember that time I only used forward-facing photos? The side profile generations were… creative!
Consistency: All photos should be clearly of the same person (no group photos where the model gets confused about who you are)
Quality: High-resolution, well-lit images work best. Those dim selfies from 2 AM? Maybe leave those out.
Background: Simple or removable backgrounds are preferred. I spent an extra hour removing backgrounds, and it was worth every minute.

Image Preparation Process

Resolution: Resize images to 512×512 or 768×768 pixels (square aspect ratio). I prefer 768×768 for more detail.
Background removal: Use tools like remove.bg or Photoshop. This was a game-changer for my training quality!
Face prominence: Your face should occupy approximately 40-60% of the frame. Too small and the model won’t learn your features; too large and you’ll get weird cropping in generations.
File format: Save as PNG or high-quality JPG. I stick with PNG for training—why compress what you don’t need to?
Organization: Create a dedicated folder for your training images. Future you will thank present you for staying organized!

Remember to avoid overtraining by ensuring diversity in your dataset. Including too many similar images can lead to overfitting and reduced creativity in the results. My first model could only generate me with the exact same expression because I used too many similar photos!

Setting Up Your Training Environment

You can train a LoRA model using various platforms. Here are the most popular options—I’ve personally tried all of them and will share my experiences:

Option 1: Using Google Colab (Beginner-Friendly)

Google Colab offers free GPU access, making it ideal for those without powerful hardware. This is how I created my first successful LoRA:

Visit TheLastBen’s fast-stable-diffusion Colab
Run the setup cells (get a coffee while it installs dependencies)
Enable the training tab
Upload your prepared images (this took me about 10 minutes with my internet speed)
Configure parameters as shown in the table below (I’ll share what worked for me)

Option 2: Using Kohya SS GUI (Intermediate)

After outgrowing Colab, I moved to this option for more control:

Download Kohya SS
Install dependencies following the README (prepare for some troubleshooting if you’re on Windows like me!)
Launch the GUI and navigate to the LoRA tab
Configure parameters and point to your image folder

Option 3: Using FluxGym (Beginner to Intermediate)

FluxGym is a user-friendly web UI specifically designed for training FLUX LoRAs with support for lower VRAM GPUs (12GB/16GB/20GB). I found this option perfect when I upgraded from Colab but wasn’t ready for command-line work yet:

Clone the repository: git clone https://github.com/cocktailpeanut/fluxgym
Navigate to the folder: cd fluxgym
Clone the Kohya scripts: git clone -b sd3 https://github.com/kohya-ss/sd-scripts
Run with Docker (recommended for simple setup):
```
docker compose up -d --build
```
Open your browser and navigate to: http://localhost:7860

Using FluxGym is straightforward:

Enter your LoRA information (name, trigger word)
Upload your prepared images and caption them with your trigger word
Click “Start” and let it train

What I love about FluxGym is its automatic sample image generation—you can configure it to show progress images every N steps, which helped me catch training issues early. If you want more control, there’s also an “Advanced” tab that gives you access to all of Kohya’s training options.

FluxGym automatically downloads the models when you select them, supporting Flux1-dev, Flux1-dev2pro, and others without manual downloads. It’s been a fantastic middle ground between the simplicity of Colab and the power of command-line options!

Option 4: Command Line (Advanced)

For you fellow terminal enthusiasts (I eventually graduated to this method):

# Clone the repository
git clone https://github.com/kohya-ss/sd-scripts
cd sd-scripts

# Install dependencies
pip install -r requirements.txt

# Run training
accelerate launch --num_cpu_threads_per_process 8 train_network.py \
  --pretrained_model_name_or_path="runwayml/stable-diffusion-v1-5" \
  --train_data_dir="./your_training_images" \
  --output_dir="./output" \
  --caption_extension=".txt" \
  --max_train_epochs=15 \
  --learning_rate=1e-4 \
  --lr_scheduler="cosine_with_restarts" \
  --network_module="networks.lora" \
  --network_dim=32 \
  --network_alpha=16

Critical Training Parameters

Tuning these parameters significantly affects your results. I’ve spent countless hours experimenting with these values (so you don’t have to!):

Parameter	Recommended Value	Effect
Network Dimension	32-128	Higher values capture more details but may overfit. I use 64 for my portraits.
Learning Rate	1e-4 to 5e-4	Higher rates learn faster but may be unstable. 2e-4 has been my sweet spot.
Training Steps	1500-2500	More steps improve results until overfitting. I watch the preview images and usually stop around 2000.
Batch Size	1-4	Depends on available VRAM. My poor GTX 1660 can only handle 1!
Clip Skip	2	Improves generation quality for SD 1.5. Don’t skip this one (pun intended)!
Regularization Images	0-200	Prevents overfitting; use class images. I use about 100 “person” images.

Example Training Configuration

For most self-portrait LoRAs, these settings work well (and have been tested through my many trials and errors):

Base model: runwayml/stable-diffusion-v1-5
Network Dim: 64
Network Alpha: 32
Learning rate: 2e-4
Epochs: 15-20
Clip skip: 2
Resolution: 768×768
Caption: "photo of sanj" (replace with your chosen trigger word)

Testing and Using Your LoRA

Once training completes, you’ll have a .safetensors file containing your LoRA weights. I still remember the excitement of testing my first successful model! Here’s how you can use yours:

Install the AUTOMATIC1111 WebUI
Place your .safetensors file in the models/Lora directory
Restart the WebUI if it’s running
In the text prompt, add your trigger word and <lora:your_file_name:0.8> where 0.8 is the strength

Effective Prompting Techniques

The key to getting diverse yet accurate self-portraits lies in the prompt. After hundreds of generations, here are my tried-and-true prompting patterns:

Basic format: photo of sanj, [style description], [setting], [lighting], [camera details] <lora:sanj_lora:0.8>
Style examples: portrait photography, cinematic, film grain, cyberpunk, anime, oil painting
Lighting examples: golden hour, blue hour, studio lighting, dramatic shadows
Camera details: 85mm portrait lens, Leica, Canon, bokeh, shallow depth of field

My personal favorite prompt? photo of sanj, wearing explorer outfit, in dense jungle, cinematic lighting, National Geographic style, 85mm lens, detailed, 4k <lora:sanj_lora:0.85>

Negative prompt example: deformed, distorted, disfigured, poorly drawn, bad anatomy, wrong anatomy, extra limb, missing limb, floating limbs, disconnected limbs, mutation, mutated, ugly, disgusting, blurry, out of focus

Troubleshooting Common Issues

Been there, fixed that! Here are solutions to problems I’ve personally encountered:

Problem	Possible Solution
Inconsistent likeness	Include more diverse angles, increase training steps. I added 5 profile shots to fix this!
Overfitting (too similar to training)	Add regularization images, reduce steps. I reduced from 3000 to 2000 steps.
Distorted features	Check for inconsistent images in dataset, adjust resolution. I had one weird angle causing this.
Wrong gender features appearing	Add gender-specific terms to negative prompt. “Female” in negative prompt fixed masculine features for me.
Poor quality output	Try higher resolution, adjust VAE, use better base model. Switching to SDXL base model was game-changing!

Ethical Considerations and Best Practices

When creating AI self-portraits, I always follow these guidelines and recommend you do too:

Keep your LoRA model secure, especially if you plan to create photorealistic images
Be transparent when sharing AI-generated images of yourself (I use #AIGenerated in my posts)
Respect copyright when using your model with celebrity or character prompts
Consider watermarking AI-generated content if sharing publicly

This technology opens creative possibilities but comes with responsibilities. Use it ethically and safely. I’ve had so much fun creating alternate versions of myself—from “me as a cyberpunk hacker” to “me in Renaissance painting style”—without crossing ethical lines.