RSTABLEDIFFUSION Telegram 55741
Wan2.2 I2V - 2 vs 3 Ksamplers - questions on steps & samplers

I'm currently testing different WFs between 2 and 3 Ksamplers for Wan2.2 ITV and wanted to ask for different experiences and share my own + settings!

3 Ksamplers (HN without Lightning, then HN/LN with Lightning Strength 1) seems to give me the best output quality, BUT for me it seems to change the likeness of the subject from the input image a lot over the course of the video (often even immediately after the first frame).

On 3KS I am using 12 total steps, 4 Steps on HN1, 4 on HN2 and 4 on LN, Euler Simple worked best for me there. Maybe more LN steps would be better? Not tested yet!

2 Ksamplers (HN/LN both with Lightning Strength 1) faster generation at generally slightly worse quality than 3 Ksamplers, but the likeness of the input image stays MUCH more consistent for me. For that though outputs can be hit or miss depending on the input (f.e. weird colors, unnatural stains on human skin, slight deformations etc.).

On 2 KS I am using 10 total steps, 4 on HN and 6 on LN. LCM + sgm_uniform worked best for me here, more steps with other samplers (like Euler simple/beta) often resulted in generally the better video, but then screwing up some anatomical detail which made it weird :D

Happy about any Step&Sampler combination you can recommend for me to try. I mostly work with human subjects, both SFW and non, so skin detail is important to me. Subjects are my own creations (SDXL, Flux Kontext etc.), so using a character lora to get rid of the likeness issue in the 3KS option is not ideal (except if I wanted to create a Lora for each of my characters which.. I'm not there yet :D ).

I wanted to try to work without lightning because I heard it impacts quality a lot, but I could not find a proper setting either on 2 or 3KS and the long generation times are rough to do proper testing for me. Between 20 and 30 steps still giving blurry/hazy videos, maybe I need way more? I wouldn't mind the long generation time for videos that are important for me.

Also wanting to try the WanMoE Ksampler as I heard a lot of great things, but did not get around to build a WF for it yet. Maybe that's my solution?

I generally let it generate in 720x1280 and most input images I also scaled to 720x1280 before. If using bigger images as input, I sometimes had WAY better outputs in terms of details (skin details especially), but sometimes worse. So not sure if it really factors in? Maybe some of you have experiences with this.

Generating in 480p and then upscaling did not work great for me. Especially in terms of skin detail I feel like 480p leaves out a lot and upscaling does not really bring it back (did not test SeedVR yet, but wanting to).

https://redd.it/1o3w5r0
@rStableDiffusion



tgoop.com/rStableDiffusion/55741
Create:
Last Update:

Wan2.2 I2V - 2 vs 3 Ksamplers - questions on steps & samplers

I'm currently testing different WFs between 2 and 3 Ksamplers for Wan2.2 ITV and wanted to ask for different experiences and share my own + settings!

3 Ksamplers (HN without Lightning, then HN/LN with Lightning Strength 1) seems to give me the best output quality, BUT for me it seems to change the likeness of the subject from the input image a lot over the course of the video (often even immediately after the first frame).

On 3KS I am using 12 total steps, 4 Steps on HN1, 4 on HN2 and 4 on LN, Euler Simple worked best for me there. Maybe more LN steps would be better? Not tested yet!

2 Ksamplers (HN/LN both with Lightning Strength 1) faster generation at generally slightly worse quality than 3 Ksamplers, but the likeness of the input image stays MUCH more consistent for me. For that though outputs can be hit or miss depending on the input (f.e. weird colors, unnatural stains on human skin, slight deformations etc.).

On 2 KS I am using 10 total steps, 4 on HN and 6 on LN. LCM + sgm_uniform worked best for me here, more steps with other samplers (like Euler simple/beta) often resulted in generally the better video, but then screwing up some anatomical detail which made it weird :D

Happy about any Step&Sampler combination you can recommend for me to try. I mostly work with human subjects, both SFW and non, so skin detail is important to me. Subjects are my own creations (SDXL, Flux Kontext etc.), so using a character lora to get rid of the likeness issue in the 3KS option is not ideal (except if I wanted to create a Lora for each of my characters which.. I'm not there yet :D ).

I wanted to try to work without lightning because I heard it impacts quality a lot, but I could not find a proper setting either on 2 or 3KS and the long generation times are rough to do proper testing for me. Between 20 and 30 steps still giving blurry/hazy videos, maybe I need way more? I wouldn't mind the long generation time for videos that are important for me.

Also wanting to try the WanMoE Ksampler as I heard a lot of great things, but did not get around to build a WF for it yet. Maybe that's my solution?

I generally let it generate in 720x1280 and most input images I also scaled to 720x1280 before. If using bigger images as input, I sometimes had WAY better outputs in terms of details (skin details especially), but sometimes worse. So not sure if it really factors in? Maybe some of you have experiences with this.

Generating in 480p and then upscaling did not work great for me. Especially in terms of skin detail I feel like 480p leaves out a lot and upscaling does not really bring it back (did not test SeedVR yet, but wanting to).

https://redd.it/1o3w5r0
@rStableDiffusion

BY r/StableDiffusion




Share with your friend now:
tgoop.com/rStableDiffusion/55741

View MORE
Open in Telegram


Telegram News

Date: |

Deputy District Judge Peter Hui sentenced computer technician Ng Man-ho on Thursday, a month after the 27-year-old, who ran a Telegram group called SUCK Channel, was found guilty of seven charges of conspiring to incite others to commit illegal acts during the 2019 extradition bill protests and subsequent months. Telegram channels fall into two types: Don’t publish new content at nighttime. Since not all users disable notifications for the night, you risk inadvertently disturbing them. The public channel had more than 109,000 subscribers, Judge Hui said. Ng had the power to remove or amend the messages in the channel, but he “allowed them to exist.” Find your optimal posting schedule and stick to it. The peak posting times include 8 am, 6 pm, and 8 pm on social media. Try to publish serious stuff in the morning and leave less demanding content later in the day.
from us


Telegram r/StableDiffusion
FROM American