You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
right now I have to wrap shampoo in optax.flatten or the tracing time on init is crazy long (284s in one case) and I don't know the compile time because I didn't wait long enough, but it's at least 10 minutes.
but wrapping in optax.flatten will reshard the entire parameter array every step, which is often going to be quite slow.
The text was updated successfully, but these errors were encountered:
right now I have to wrap shampoo in
optax.flatten
or the tracing time on init is crazy long (284s in one case) and I don't know the compile time because I didn't wait long enough, but it's at least 10 minutes.but wrapping in
optax.flatten
will reshard the entire parameter array every step, which is often going to be quite slow.The text was updated successfully, but these errors were encountered: