RSTABLEDIFFUSION Telegram 55851
Compile fp8 on RTX 30xx in triton-windows 3.5

I've merged the patch to let torch.compile work with fp8 on Ampere GPUs and let's see how it rolls out: https://github.com/woct0rdho/triton-windows/pull/140

I hoped this could be superseded by GGUF + better torch.compile or Nunchaku, but as of PyTorch 2.9 I realized that fp8 + the block swap in ComfyUI-WanVideoWrapper (or ComfyUI-wanBlockswap for native workflows) runs faster and causes fewer recompilations than GGUF + the block swap in ComfyUI-GGUF on my machine.

This is the first feature in the 'core' part (rather than the Windows support code) that's deliberately different from the official Triton. It should also work on Linux but I'm not sure what's the best way to publish Linux wheels.

I'm not an expert on PTX. Welcome help in optimizing those PTX code.

triton-windows 3.2.0.post21 is also released, which supports fp8 on RTX 20xx.

https://redd.it/1o75zgt
@rStableDiffusion



tgoop.com/rStableDiffusion/55851
Create:
Last Update:

Compile fp8 on RTX 30xx in triton-windows 3.5

I've merged the patch to let torch.compile work with fp8 on Ampere GPUs and let's see how it rolls out: https://github.com/woct0rdho/triton-windows/pull/140

I hoped this could be superseded by GGUF + better torch.compile or Nunchaku, but as of PyTorch 2.9 I realized that fp8 + the block swap in ComfyUI-WanVideoWrapper (or ComfyUI-wanBlockswap for native workflows) runs faster and causes fewer recompilations than GGUF + the block swap in ComfyUI-GGUF on my machine.

This is the first feature in the 'core' part (rather than the Windows support code) that's deliberately different from the official Triton. It should also work on Linux but I'm not sure what's the best way to publish Linux wheels.

I'm not an expert on PTX. Welcome help in optimizing those PTX code.

triton-windows 3.2.0.post21 is also released, which supports fp8 on RTX 20xx.

https://redd.it/1o75zgt
@rStableDiffusion

BY r/StableDiffusion


Share with your friend now:
tgoop.com/rStableDiffusion/55851

View MORE
Open in Telegram


Telegram News

Date: |

To edit your name or bio, click the Menu icon and select “Manage Channel.” To view your bio, click the Menu icon and select “View channel info.” SUCK Channel Telegram Earlier, crypto enthusiasts had created a self-described “meme app” dubbed “gm” app wherein users would greet each other with “gm” or “good morning” messages. However, in September 2021, the gm app was down after a hacker reportedly gained access to the user data. Hashtags are a fast way to find the correct information on social media. To put your content out there, be sure to add hashtags to each post. We have two intelligent tips to give you:
from us


Telegram r/StableDiffusion
FROM American