Add --print-progress flag to show puller progress #66

scele · 2018-02-14T15:45:44Z

Add minimal prints to indicate puller progress if --print-progress
flag is passed to fast puller. The intention is that the bazel
container_pull workspace rules could eventually output something,
instead of bazel getting seemingly stuck for 10 minutes when
pulling an image that is many gigabytes in size.

We are intentionally not using the logging facility and the existing
--stderrthreshold argument to display these prints, because that
will produce log-formatted output that looks too detailed when inlined
with other bazel output. The intention is to show user-friendly
progress instead:

Downloading from gcr.io/tensorflow/tensorflow:latest (1/12)
Downloading from gcr.io/tensorflow/tensorflow:latest (2/12)
Downloading from gcr.io/tensorflow/tensorflow:latest (3/12)
Downloading from gcr.io/tensorflow/tensorflow:latest (4/12)
Downloading from gcr.io/tensorflow/tensorflow:latest (5/12)
...

Currently this depends on a forked version of puller.par. Pending pull request to upstream puller.par is here: google/containerregistry#66

Add minimal prints to indicate puller progress if container_pull has print_progress=True attribute. The intention is that the pull operation would provide some feedback to the user, instead of bazel getting seemingly stuck for 10 minutes when pulling an image that is many gigabytes in size. Pending pull request to upstream puller.par is here: google/containerregistry#66

xingao267 · 2018-10-05T14:59:42Z

client/v2_2/save_.py

@@ -168,7 +172,8 @@ def write_file(name, accessor,
      future_to_params[f] = digest_name

      layer_name = os.path.join(directory, '%03d.tar.gz' % idx)
-      f = executor.submit(write_file, layer_name, image.blob, blob)
+      message = 'Downloading from {} ({}/{})'.format(image.name(), idx+1, num_layers)


Downloading from gcr.io/tensorflow/tensorflow:latest (1/12 layers)
Downloading from gcr.io/tensorflow/tensorflow:latest (2/12 layers)
...

maybe more clear?

Good idea! I went with slightly modified version:

Downloading from gcr.io/tensorflow/tensorflow:latest (layer 1/12)
Downloading from gcr.io/tensorflow/tensorflow:latest (layer 2/12)
...

Is that ok?

Looks great!

nlopezgi · 2018-10-05T16:51:20Z

I do think it makes sense to print some form of progress message for these long running actions. My one concern is wrt how to print out these messages (i.e., use of sys vs something else for logging that can help control these messages at a coarse grain).
@KaylaNguyen could you comment on this PR? Do you have any advice as to how to print out the messages?

scele · 2018-10-16T15:21:59Z

@KaylaNguyen ping?

scele · 2018-10-25T10:58:31Z

@KaylaNguyen could you please provide feedback about this PR? I opened it already in February, and still no response.

KaylaNguyen · 2018-11-06T17:03:54Z

Hi @scele, currently we don't have any plan to maintain this repo. But I can take a look at it in my personal time. Response time will be slow to very slow. I'll see what I can do and get back to you by the end of next week. Thanks for your understanding :)

KaylaNguyen · 2018-11-21T20:31:45Z

Please wait until the next release to merge with new changes. Ping me when you're ready and I'll export this PR from internal. Thanks!

scele · 2019-01-03T19:22:19Z

@KaylaNguyen Sorry, I missed your previous comment.. :( I have rebased now, can you take a new look to get this merged? Thanks!

KaylaNguyen · 2019-01-10T18:26:34Z

Will slowly get back to you by end of next week :) Thank you for your patience.

KaylaNguyen · 2019-01-17T20:32:59Z

client/v2_2/save_.py

@@ -22,6 +22,7 @@
 import io
 import json
 import os
+import sys


Use logging instead, you can check fast_flatten or fast_pusher for reference.

See my commit message for explanation why I don't want to use logging.
Also, I would like the print to go to stderr, not stdout, because other bazel build status messages also go to stderr:

Loading: Loading: 0 packages loaded currently loading: foo Analyzing: target //foo:bar (45 packages loaded) ...

I think the pull status progresses should appear together with these messages in stderr like so:

Loading: Loading: 0 packages loaded currently loading: foo Analyzing: target //foo:bar (45 packages loaded) Downloading from gcr.io/tensorflow/tensorflow:latest (1/12) Downloading from gcr.io/tensorflow/tensorflow:latest (2/12) Downloading from gcr.io/tensorflow/tensorflow:latest (3/12) ...

If the output goes to stdout, then these status messages would clutter run_log.txt in the following invocation:

bazel run //foo:bar 2>build_log.txt >run_log.txt

KaylaNguyen · 2019-01-17T20:33:32Z

client/v2_2/save_.py

@@ -144,7 +145,8 @@ def tarball(name, image,
 def fast(image,
         directory,
         threads = 1,
-         cache_directory = None):
+         cache_directory = None,
+         print_progress = False):


update Args section below.

Add minimal prints to indicate puller progress if --print-progress flag is passed to fast puller. The intention is that the bazel container_pull workspace rules could eventually output something, instead of bazel getting seemingly stuck for 10 minutes when pulling an image that is many gigabytes in size. We are intentionally not using the logging facility and the existing --stderrthreshold argument to display these prints, because that will produce log-formatted output that looks too detailed when inlined with other bazel output. The intention is to show user-friendly progress instead: Downloading from gcr.io/tensorflow/tensorflow:latest (layer 1/12) Downloading from gcr.io/tensorflow/tensorflow:latest (layer 2/12) Downloading from gcr.io/tensorflow/tensorflow:latest (layer 3/12) Downloading from gcr.io/tensorflow/tensorflow:latest (layer 4/12) Downloading from gcr.io/tensorflow/tensorflow:latest (layer 5/12) ...

scele force-pushed the verbose_print branch from 2da8577 to 830df69 Compare February 19, 2018 11:39

scele changed the title ~~Add --verbose flag to show puller progress~~ Add --print-progress flag to show puller progress Feb 19, 2018

scele added a commit to scele/rules_docker that referenced this pull request Feb 20, 2018

Add support for print_progress to container_pull

36a790f

Currently this depends on a forked version of puller.par. Pending pull request to upstream puller.par is here: google/containerregistry#66

scele mentioned this pull request Oct 4, 2018

Add support for print_progress to container_pull bazelbuild/rules_docker#535

Closed

xingao267 reviewed Oct 5, 2018

View reviewed changes

scele force-pushed the verbose_print branch from 830df69 to 895c594 Compare October 6, 2018 19:12

scele force-pushed the verbose_print branch from 895c594 to f6530b1 Compare January 3, 2019 18:59

KaylaNguyen reviewed Jan 17, 2019

View reviewed changes

scele force-pushed the verbose_print branch from f6530b1 to b962cd8 Compare January 21, 2019 09:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --print-progress flag to show puller progress #66

Add --print-progress flag to show puller progress #66

scele commented Feb 14, 2018 •

edited

Loading

xingao267 Oct 5, 2018

scele Oct 6, 2018

xingao267 Oct 7, 2018

nlopezgi commented Oct 5, 2018

scele commented Oct 16, 2018

scele commented Oct 25, 2018

KaylaNguyen commented Nov 6, 2018

KaylaNguyen commented Nov 21, 2018

scele commented Jan 3, 2019

KaylaNguyen commented Jan 10, 2019

KaylaNguyen Jan 17, 2019

scele Jan 21, 2019

KaylaNguyen Jan 17, 2019

scele Jan 21, 2019

Add --print-progress flag to show puller progress #66

Are you sure you want to change the base?

Add --print-progress flag to show puller progress #66

Conversation

scele commented Feb 14, 2018 • edited Loading

xingao267 Oct 5, 2018

Choose a reason for hiding this comment

scele Oct 6, 2018

Choose a reason for hiding this comment

xingao267 Oct 7, 2018

Choose a reason for hiding this comment

nlopezgi commented Oct 5, 2018

scele commented Oct 16, 2018

scele commented Oct 25, 2018

KaylaNguyen commented Nov 6, 2018

KaylaNguyen commented Nov 21, 2018

scele commented Jan 3, 2019

KaylaNguyen commented Jan 10, 2019

KaylaNguyen Jan 17, 2019

Choose a reason for hiding this comment

scele Jan 21, 2019

Choose a reason for hiding this comment

KaylaNguyen Jan 17, 2019

Choose a reason for hiding this comment

scele Jan 21, 2019

Choose a reason for hiding this comment

scele commented Feb 14, 2018 •

edited

Loading