-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Awful Image Generation #93
Comments
Try something without anatomic? We are trying to improve human body and human face in our next release. |
@lawrence-cj I cannot get anything to look realistic not just humans. See below for prompt "large oak tree". I thought your goals were to beat Flux for realism with 4x speed, but there isn't a single prompt that looks realistic to me, so I must be missing something? |
@lawrence-cj I've noticed that lowering CFG to 2.0-2.5 makes a massive difference with realism, under 2 then images don't look right, over 3 and they are as above kind of over exposed with lots of highlights on the edges of the detail etc. Below is CFG 2.5, but even on this can you see on the ball of yarn the highlights on the top are blown out? |
Oh, if you want the content to be more realistic, then the prompt needs to be refined. Line 64 in bcad148
BTW, we can't guarantee that Sana's smaller models will beat the FLUX just yet. Sana's larger model sizes and better VAEs are also in development and we have got pretty cool results. What we can guarantee is that we will keep improving and keep everything efficient. Maybe we still need time. At last, thank you for your time to play with Sana. We will keep working hard. |
Besides, rewriting the prompt to a longer one will make the quality better in our experiment. |
@lawrence-cj I am used to writing very complex prompts with Flux with layering, background, middle ground, foreground etc. I was just testing as even in flux with a few words you get very good realistic results still. Using longer prompts there is generally more beneficial for finer control of the environment, subject, focus and such. Using a few of your keywords above I don't get any better results. CFG does hugely effect the image though, I don't know if this is expected behavior or something odd going on with Comfy or schedulers etc? Don't worry, I know this is a WIP and the speed is honestly outstanding, hopefully you can get "women lying on grass" on par with Flux! CFG 4 CFG 2.5 |
@shaun-ba Thanks for your valuable testing results. I think "CFG will affect a lot" is a normal phenomenon. Higher CFG will make the image saturated and lower will make the image style more diverse and unstable. Is there no big difference between CFGs in FLUX? What kind of CFG is better during your testing? Besides, our official Flow-DPM-Solver scheduler is not supported in ComfyUI for now. For the KSampler-Euler in ComfyUI, I didn't do a lot of experiments, I just made sure the workflow was working. |
@lawrence-cj as flux is distilled, it has no CFG |
Emmm, Flux-schnel has it. |
that is fake guidance, not real cfg. it is distilled into that input argument and it behaves nothing like true cfg; you can use it up to 20.0 and beyond without ruining the image |
I mean what is this? I've followed all guides and reported various bugs for the past few days and this is the outcome?
The text was updated successfully, but these errors were encountered: