Natural language has a strong impact on this model as of V7. It won’t understand everything but it can get you places. And then when you can’t squeeze any more out of it, you can use e621 tags to help refine your prompt. (or you can just use tags, either way will work.)
Start your prompt with a short natural language prompt describing what you want, then pad it with e621 tags to refine specific concepts. As this is a v-prediction model, prompt interpretation can be a bit more literal.
Many flavor words and artists from SD 1.5 work again.
PolyFur is trained on MiniGPT-4 captions, so try being really flowery with your prompts and use sentences even. (check the generation info on the preview images to see how)
If an established character isn’t coming out accurately, try increasing the strength of their token and adding a few implied tags that describe their appearance. Keep in mind that characters that aren’t very popular or don’t have many images in FluffyRock’s dataset typically won’t fare that well without a LORA.
Avoid weighting camera angle keywords too strongly, especially close-up.
Resolutions between 576 and 1088 should work reasonably well as that is the range of FluffyRock.
A example for a solo focus pic:
feral pony fluttershy ( doing something / action ), equine, quadruped, pegasus, blue green eyes,
(feral:1.1) <- (this is weighted, using ( ) and :0.1-2.0 it makes a tag weaker / stronger for instance) friendship is magic, my little pony, meadow, ( a bunch of quality tags that change depending on what you want to go for such as best quality, high quality… ect. )
BREAK ( separator that can help for multicharacter / after initial style prompt )
( other character, in this case offscreen human )
For multi character interactions it might be best to stablish style / number of characters / the type / perspective of the scene first then use a BREAK between the different characters. It helps it from getting them mixed up.
To start learning prompting I would recommend to visit the furry diffusion discord and just browsing E621 and memorizing tags that go with the kind of pic you would want to make. I recommend checking these discords for tons of examples: https://discord.gg/furrydiffusion https://discord.gg/SQVcWVbqKx