Multi-armed bandit

Applying reinforcement learning to the "multi-armed bandit" problem

Research steps

Build a class for a multi-armed bandit model with N arms
Create arm selection function according to the available probability distribution in the problem
Create a random arm selection function
Add reward for the selected turn
Add a function for modifying the probability distribution after the reward for the turn has become known
For each turn, enter the cost of a certain amount of resources available to the player

The purpose of research

Demonstrate the change in the probability distribution of choosing the i-th arm with increasing turn number
Demonstrate the dependence of the probability of choosing the "best" arm with increasing turn number
Demonstrate the change in the agent's payoff over time

Многорукий бандит

Применение обучения с подкреплением в задаче многорукого бандита

План работы

Построить класс для модели многорукого бандита с N ручками
Ввести функцию выбора ручки согласно имеющемуся распределению вероятностей в задаче
Ввести функцию случайного выбора ручки
Ввести вознаграждение за выбранный ход
Ввести функцию модификации распределения вероятностей после того, как стало известно вознаграждение за ход
Для каждого хода ввести стоимость некоторого количества ресурсов, имеющихся у игрока

Цель работы

Продемонстрировать изменение распределения вероятностей выбора i-й ручки с ростом номера хода
Продемонстрировать зависимость вероятности выбора "лучшей" ручки с ростом номера хода
Продемонстрировать изменение выигрыша агента со временем

Использованные материалы

Ход выполнения работы

Процесс выполнения описан в отдельном файле

Clang-format

To aply .clang-format to all C++ files use your IDE tools or the following command on linux:

find ./src -iname *.h -o -iname *.cpp | xargs clang-format -i

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
src		src
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CMakeSettings.json		CMakeSettings.json
LICENSE		LICENSE
README.md		README.md
progressDescription.md		progressDescription.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-armed bandit

Applying reinforcement learning to the "multi-armed bandit" problem

Research steps

The purpose of research

Многорукий бандит

Применение обучения с подкреплением в задаче многорукого бандита

План работы

Цель работы

Использованные материалы

Ход выполнения работы

Clang-format

About

Languages

License

nektonick/multi-armed-bandit

Folders and files

Latest commit

History

Repository files navigation

Multi-armed bandit

Applying reinforcement learning to the "multi-armed bandit" problem

Research steps

The purpose of research

Многорукий бандит

Применение обучения с подкреплением в задаче многорукого бандита

План работы

Цель работы

Использованные материалы

Ход выполнения работы

Clang-format

About

Topics

Resources

License

Stars

Watchers

Forks

Languages