Even though I'm running the quantized version of diffusers, shouldn't it be a bit faster? I'm literally getting 6 minutes for a single Img2Img task on an NVIDIA H100 80GB.
The slow speed is intended
Why?
· Sign up or log in to comment