-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Face Distortion #52
Comments
Face distortion happens when the face is relatively small in the image. This problem will be mitigated in Sana-1.5 with DC-AE 1.5 later this year. |
Thanks for your response! Could it be addressed by training facial dataset at a 2K resolution? |
It will help if you try to fine-tune on some facial datasets with smaller faces. |
Facial datasets usually have large faces |
wow 200k OMG
…--------------------------------
----- 原始邮件 -----
***@***.***>
***@***.***>等3人
主题:Re: [NVlabs/Sana] Face Distortion (Issue #52)
日期:2024年12月04日 14:21:29
Thanks for your response! Could it be addressed by training facial dataset at a 2K resolution?
It will help if you try to fine-tune on some facial datasets with smaller faces.
I tried fine-tuning 200k+ facial datasets at 1.5K resolution, which resulted in a slight improvement, but it's still not a perfect solution. Looking forward to SANA-1.5 and new DC-AE!
A.candid.snapshot.of.travelers.boarding.a.train.at.a.busy.station.with.luggage.in.hand.and.exciteme.jpg (view on web)
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
@lawrence-cj will DC-AE remain an AE or switch to a VAE? |
Impossible to convert to VAE
----- 原始邮件 -----
发件人:Muinez ***@***.***>
收件人:NVlabs/Sana ***@***.***>
抄送人:靳灿奇 ***@***.***>, Comment ***@***.***>
主题:Re: [NVlabs/Sana] Face Distortion (Issue #52)
日期:2024年12月05日 13点34分
Face distortion happens when the face is relatively small in the image. This problem will be mitigated in Sana-1.5 with DC-AE 1.5 later this year.
@lawrence-cj will DC-AE remain an AE or switch to a VAE?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
why? |
I think it's because the feature dimensions are different |
If you think about it, this is interesting. They went to so much trouble (and published a paper) to switch from VAE to AE and achieve 32x compression. Now, if you ask them to go back to VAE, they will punch the screen when they hear this. It is estimated that training such a model costs 200 US dollars, in terms of renting computing power. |
Hi,
when the face region is relatively small, it tends to become distorted. Is this due to the high compression ratio of dc-ae? Is there a solution to this problem? Would using a 2K image better?
Thank you very much for your wonderful work!!
The text was updated successfully, but these errors were encountered: