Manager YARN capacity scheduler file #84
Unanswered
rpignolet
asked this question in
New features
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Context
Currently, YARN ResourceManager capacity scheduler file is manage by the
yarn_resourcemanager_config.yml
playbook (which callconfig.yml
from roletosit.tdp.yarn.resourcemanager
). The playbookyarn_resourcemanager_capacityscheduler.yml
modify the capacity scheduler file too but this playbook is not in the DAG so it must be call manually.Problem
We don't have a process for updating YARN's capacity scheduler. Currently, if we modify the
capacity_scheduler
variable intdp_vars/yarn/yarn.yml
and run atdp reconfigure
, it will reconfigure several elements that do not need to be reconfigured likeyarn_ranger_config
,yarn_client_config
,yarn_resourcemanager_restart.yml
,yarn_nodemanager_restart.yml
.Analysis
The
yarn rmadmin -refreshQueues
command allows to read thecapacity-scheduler.xml
file and to apply the configuration of the queues except in the case of deletion of a queue, a restart of ResourceManager is necessary.Question
capacity_scheduler
variable in thetdp_vars
? This implies that after modifying this variable, atdp reconfigure
must be launched. In other words, do we keep the configuration of the capacity scheduler in the DAG? In the case where you have to do areconfigure
for the management of the YARN queues, this means that as long as a deployment is in progress, you cannot change the configuration of the YARN queues.capacity_scheduler
variable to an abstraction of queue management to facilitate queue deletion detection code?References
Beta Was this translation helpful? Give feedback.
All reactions