New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[BC] Generate trajectories #388

Merged

tvmarino merged 19 commits into google:main from tvmarino:bc_generate_trajectories

Nov 13, 2024

Collaborator

tvmarino commented Nov 1, 2024

Commit gen_trajectories() which is the function that loads the modules corpus, creates the worker manager to be used with ModuleWorker, collects the results and writes the results to file.

tvmarino added 13 commits

October 23, 2024 21:06


          Initial ModuleWorker commit. Adds the action and distribution wrapper

08f8718

functions and constructor.


          Merge branch 'main' of https://github.com/tvmarino/ml-compiler-opt in…

578bc62

…to bc_trajectories_module_worker


          Merge conflicts fix.

8d9827e


          Initial ModuleWorker commit.

d39bf22


          ModuleWorker class which distributes the ExplorationWorkers over

d1dfd52

multiple modules.


          pylint fix

dc74f7a


          Addressing @boomanaiden154's comments.

6edcb84


          Addressing @boomanaiden154 comments.

c4de9c0


          Addressing @mtrofin comments.

efc6f9d


          Addressing @mtrofin comments.

87986a6


          Changed ExploreModule to ModuleExplorer

a93271f


          Merge branch 'google:main' into bc_generate_trajectories

fc9d748


          Commit gen_trajectories() which is the function that loads the modules

1018e41

corpus, creates the worker manager to be used with ModuleWorker,
collects the results and writes the results to file.

tvmarino requested review from mtrofin and boomanaiden154

November 1, 2024 16:47

tvmarino added 2 commits

November 1, 2024 18:25


          Fixing flags issue.

4b36668


          Trying to fix flags.

ffc6a86

mtrofin reviewed

View reviewed changes

compiler_opt/rl/generate_bc_trajectories.py Outdated

+                  num_workers: Optional[int] = None,
+                  num_output_files: int = 1,
+                  profiling_file_path: Optional[str] = None,
+                  worker_wait: int = 10,

Collaborator

mtrofin Nov 1, 2024

nit: worker_wait_sec and it's self-documenting the unit now, too

Collaborator Author

tvmarino Nov 4, 2024

Done.

compiler_opt/rl/generate_bc_trajectories.py Outdated

+                total_work = len(corpus_elements)
+                total_failed_examples = 0
+                total_write_files = num_output_files
+                total_profiles_max: List[Optional[Dict[str, Union[str, float, int]]]] = []

Collaborator

mtrofin Nov 1, 2024

there is a lot of nesting. A trick to help comprehension is to alias the type somewhere (as a module-level def), e.g.: (I'm making the names up)

ExperimentValueType = Union[str, float, int]
ExperimentResultType = Dict[str, ExperimentValueType]

Collaborator Author

tvmarino Nov 4, 2024

I set ProfilingDictValueType = Dict[str, Union[str, float, int]].

compiler_opt/rl/generate_bc_trajectories.py Outdated

+                total_successful_examples = 0
+                total_work = len(corpus_elements)
+                total_failed_examples = 0
+                total_write_files = num_output_files

Collaborator

mtrofin Nov 1, 2024

why not leave this as num_output_files? you're never mutating it

Collaborator Author

tvmarino Nov 1, 2024

That's true. I will fix this.

compiler_opt/rl/generate_bc_trajectories.py Outdated

+                                                       Dict[str, Union[str, float, int]]],
+                                                 tf.train.SequenceExample]]] = []
+                  for written_files in range(total_write_files):

Collaborator

mtrofin Nov 1, 2024

s/written_files/written_file_index?

Collaborator Author

tvmarino Nov 4, 2024

Changed it to written_files_idx.

compiler_opt/rl/generate_bc_trajectories.py

+                              logging.INFO,
+                              ('%d success, %d failed out of %d, modules processed'
+                               ' %d\n timing compiler: %f'),
+,

Collaborator

mtrofin Nov 1, 2024

what's 10?

Collaborator Author

tvmarino Nov 1, 2024

From the log_every_n_seconds docstring it looks like it's the time between each logging.

compiler_opt/rl/generate_bc_trajectories.py Outdated

+                              modules_processed,
+                              time_compiler_calls,
+                          )
+                          if len(succeeded) == 0:

Collaborator

mtrofin Nov 1, 2024

doesn't if not succeeded work?

compiler_opt/rl/generate_bc_trajectories.py

+                max_profiles_path = ''
+                pol_profiles_path = ''
+                if profiling_file_path:

Collaborator

mtrofin Nov 1, 2024

so if profiling_file_path isn't given, then what happens with the open below?

Collaborator Author

tvmarino Nov 1, 2024

The context is set to contextlib.nullcontext(), so I think nothing happens.

Collaborator

mtrofin Nov 1, 2024

would it be cleaner to check here and do the open stuff only when profiling_file_path is set?

Collaborator Author

tvmarino Nov 4, 2024

Yes, I am addressing this.

tvmarino added 4 commits

November 4, 2024 16:27


          Addressing @mtrofin comments.

2f4533b


          yapf

1c1cb6b


          Added a test for gen_trajectories which involved refactoring

956e9aa

ModuleWorker and ModuleExplorer to replace class and callables which
were passed as a gin.config to be directly passed to gen_trajectories.
This is because gin.config classes and callables can not be pickled for
multiprocessing purposes.


          Addressed a comment from @mtrofin

c141f91

mtrofin approved these changes

View reviewed changes

tvmarino merged commit ad31887 into google:main

15 checks passed

tvmarino deleted the bc_generate_trajectories branch

November 13, 2024 20:11

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet