You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the PiGDM paper (Sec A.1, Algorithm 1) it says that we need to scale the guidance term by $\sqrt{\alpha_t}$.
In the code we scale by $\sqrt{\alpha_t} \cdot \sqrt{\alpha_{t-1}}$:
If we only scale by $\sqrt{\alpha_t}$ we get NaN during inference due to large guidance.
From were this additional scaling by $\sqrt{\alpha_{t-1}}$ comes from?
The text was updated successfully, but these errors were encountered:
In the PiGDM paper (Sec A.1, Algorithm 1) it says that we need to scale the guidance term by$\sqrt{\alpha_t}$ .$\sqrt{\alpha_t} \cdot \sqrt{\alpha_{t-1}}$ :
In the code we scale by
If we only scale by$\sqrt{\alpha_t}$ we get $\sqrt{\alpha_{t-1}}$ comes from?
NaN
during inference due to large guidance.From were this additional scaling by
The text was updated successfully, but these errors were encountered: