You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
sess_config=tf.ConfigProto(
# If the device you specify doesn't exist, allow TF to assign the device automaticallyallow_soft_placement=True,
log_device_placement=False, # Whether to print the device assignment log
)
sess_config.gpu_options.force_gpu_compatible=Truesess_config.gpu_options.allow_growth=Truewithtf.train.MonitoredTrainingSession(master="", checkpoint_dir=self.__ckpt_dir, config=sess_config):
Willing to contribute
Yes
The text was updated successfully, but these errors were encountered:
Current behavior
In distributed mode, deeprec works fine when training on one hour of data, but hangs when training on one day or more. Log:
Nvidia-smi:
cpu:
Expected behavior
Deeprec works fine in distributed mode. Log:
System information
Code to reproduce
Willing to contribute
Yes
The text was updated successfully, but these errors were encountered: