While I am calling this release 2.1, theres a fairly significant number of changes to it. For one, ive shortened the trigger tag to "source_photo" for ease of use (though I could probably just drop it to just "photo" in the future), and used Booru tags instead of sentence captions for the images. Its still capapable of processing sentence style prompts, obviously, though it does seem to like booru tags when specifying backgrounds. This version also doesnt need an excess of negatives like the prior versions, and seems to function best with a CFG scale at 4-5 and the DPM++ 2M Karras sampler at around 15-17 steps. After some additonal testing, I have determined that a CFG around 7 with a Euler A Automatic sampler at 30 steps also yields very good results, though it takes a bit longer on lower end hardware. Depending on the character, you may need to add more negative and positive tags to enforce realism and discourage it from doing a 2d or CGI effect. I also recommend putting
"plastic,plastic skin,overexposure,blurry" in the negatives for most prompts unless you like the shiny skin effect that most realistic AI models seem to give people.