Repository navigation

#

markov-decision-processes

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

Julia
705
7 天前
C++
657
1 个月前

Extensible Combinatorial Optimization Learning Environments

C++
339
2 年前
Julia
324
4 天前

A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/

Python
244
6 个月前

A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.

Python
213
15 天前

Coding Demos from the School of AI's Move37 Course

Python
184
6 年前
Python
179
9 天前

🌲 Stanford CS 228 - Probabilistic Graphical Models

Python
129
8 个月前

A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.

Python
124
1 个月前

Implementation of value iteration algorithm for calculating an optimal MDP policy

Python
97
7 年前

WrightEagle Base Code for RoboCup Soccer Simulation 2D

C++
94
3 年前

Online algorithms for solving large-scale dynamic vehicle routing problems with stochastic requests

Makefile
80
3 年前

Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.

Python
77
20 天前
JavaScript
76
8 年前

Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.

Python
68
1 年前

🐍 AI that learns to play Snake using Q-Learning (Reinforcement Learning)

JavaScript
67
6 年前