Implementation of the MDP Order Dispatch Policy

This repository contains the implementation of the paper Large-Scale Order Dispatch in On-Demand Ride-Hailing Platforms: A Learning and Planning Approach in Python. Specifically, it creates a synthetic environment to simulate the ridesharing marketplace according to Section 6.1 of the paper and applies the MDP order dispatch policy developed in the paper to this example. Please refer to Demonstration.ipynb for the detailed implementation.

Summary of the Algorithm

The algorithm consists of two steps:

Policy Evaluation: Apply temporal difference learning to the historical data to learn the value function
Order Dispatch: Implement the order dispatch policy by maximizing the value function

Illustration of the policy evaluation step:

Pseudocode:

The order dispatch step:

Simulation results and comparison against other baseline policies:

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
graph		graph
Demonstration.ipynb		Demonstration.ipynb
README.md		README.md
RL for ridesharing.pdf		RL for ridesharing.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementation of the MDP Order Dispatch Policy

Summary of the Algorithm

About

Releases

Packages

Languages

callmespring/MDPOD

Folders and files

Latest commit

History

Repository files navigation

Implementation of the MDP Order Dispatch Policy

Summary of the Algorithm

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages