Closed
Description
I am using a simple red image as input:
from stable_diffusion_pytorch import pipeline
from PIL import Image
prompts = ["a photograph of an astronaut riding a horse"]
input_images = [Image.open('red.png')]
images = pipeline.generate(prompts, input_images=input_images)
images[0].save('output.png')
But I am getting the input image shifted down 8px,8px and it generates ugly brown border:
I am pretty sure it happens during the Encode
pass as its already shifter in latent space. Here is custom dumping of latent space to image:
Some thing in the Encode pass that is shifting it by a pixel in the latent space. And I can't figure out what.
Metadata
Metadata
Assignees
Labels
No labels