constant.device in mog_log_prob remains on cpu #1343

ali-akhavan89 · 2024-12-31T03:12:18Z

I have a custom embedding network, and everything is on cuda on my script. However, I haven't been able to use 'mdn' in multi-round trading (I'm still on SBI 0.23.2). After adding a lot of debugging statement, I realized that in calculations of mog_log_prob, constant is still on cpu. It might be still an issue from my side, but I just wanted to share this. here's the printing statements of other components of that function:

theta.device = cuda:0 //from my script
experiment_data.device = cuda:0 //from my script

Using SNPE-C with non-atomic loss

theta.device in mog_log_prob = cuda:0
logits_pp.device in mog_log_prob = cuda:0
means_pp.device in mog_log_prob = cuda:0
precisions_pp.device in mog_log_prob = cuda:0
weights.device in mog_log_prob = cuda:0
constant.device in mog_log_prob = cpu //this one
log_det.device in mog_log_prob = cuda:0
theta_minus_mean.device in mog_log_prob = cuda:0
exponent.device in mog_log_prob = cuda:0

janfb · 2025-01-02T11:43:55Z

Hi @ali-akhavan89 thanks for digging into this and reporting!

Yes, this looks like a bug, constant should be created on the current device. I am just wondering why our GPU tests dont catch this because they seem to cover multi-round NPE_C with MDN. I will have a look and propose a fix soon.

ali-akhavan89 added the question Further information is requested label Dec 31, 2024

janfb added the bug Something isn't working label Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

constant.device in mog_log_prob remains on cpu #1343

constant.device in mog_log_prob remains on cpu #1343

ali-akhavan89 commented Dec 31, 2024 •

edited

Loading

janfb commented Jan 2, 2025

constant.device in mog_log_prob remains on cpu #1343

constant.device in mog_log_prob remains on cpu #1343

Comments

ali-akhavan89 commented Dec 31, 2024 • edited Loading

janfb commented Jan 2, 2025

ali-akhavan89 commented Dec 31, 2024 •

edited

Loading