Repository navigation

#

reinforcement-learning

Website
Wikipedia

Developer-Y / cs-video-courses

List of Computer Science courses with video lectures.

计算机科学算法 systems 数据库机器学习 web-development 安全 computer-architecture Bioinformatics Robotics embedded-systems database-systems 机器视觉 Quantum Computing computational-biology 深度学习 reinforcement-learning

68505

9265

3 天前

labmlai/annotated_deep_learning_paper_implementations

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

深度学习 PyTorch Generative Adversarial Network transformers reinforcement-learning optimizers neural-networks transformer 机器学习 attention literate-programming lora

Python

60091

6076

8 个月前

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

36624

6228

5 小时前

eugeneyan/applied-ml

eugeneyan / applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

applied-machine-learning production applied-data-science 机器学习数据科学 reinforcement-learning data-engineering recsys search 深度学习 data-quality data-discovery 机器视觉自然语言处理

27888

3745

9 个月前

d2l-ai / d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

深度学习机器学习 book notebook 机器视觉自然语言处理 Python kaggle 数据科学 mxnet PyTorch Tensorflow Keras gaussian-processes hyperparameter-optimization recommender-system reinforcement-learning jax

Python

25597

4599

8 个月前

Unity-Technologies/ml-agents

Unity-Technologies / ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

reinforcement-learning Unity 深度学习 deep-reinforcement-learning neural-networks 机器学习

C#

17975

4256

3 个月前

ddbourgin / numpy-ml

Machine learning, in numpy

机器学习 neural-networks topic-modeling gaussian-mixture-models hidden-markov-models gradient-boosting bayesian-inference wavenet vae resnet lstm attention reinforcement-learning knn gaussian-processes word2vec

Python

16059

3799

1 年前

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

机器学习 machine-translation 深度学习 reinforcement-learning tpu

Python

16058

3582

2 年前

AI4Finance-Foundation / FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

ChatGPT finance fintech large-language-models 机器学习自然语言处理 prompt-engineering PyTorch reinforcement-learning robo-advisor sentiment-analysis technical-analysis fingpt

Jupyter Notebook

15849

2228

4 个月前

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

机器学习深度学习 leedl-tutorial cnn reinforcement-learning transformer rnn Generative Adversarial Network pruning self-attention ChatGPT 教程 diffusion transfer-learning bert

Jupyter Notebook

14976

3010

16 天前

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

reinforcement-learning 人工智能

Python

14001

4912

8 个月前

bulletphysics / bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Simulation Robotics kinematics 虚拟现实 reinforcement-learning computer-animation 游戏开发 simulator pybullet

C++

13263

2930

3 个月前

kmario23 / deep-learning-drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

机器学习深度学习深度神经网络 pattern-recognition 机器视觉 optimization visual-recognition reinforcement-learning deep-reinforcement-learning 自然语言处理 artificial-neural-networks artificial-intelligence-algorithms bayesian-statistics speech-recognition graph-neural-networks Medical imaging geometric-deep-learning explainable-ai probability

HTML

12544

2949

6 个月前

owainlewis / awesome-artificial-intelligence

A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.

机器学习人工智能 reinforcement-learning intelligent-systems 深度学习 intelligent-machines statistical-learning unsupervised-learning 神经网络

11803

1969

4 个月前

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

deep-reinforcement-learning reinforcement-learning dqn ppo a3c q-learning sarsa imitation-learning policy-gradient ddpg double-dqn dueling-dqn td3

Jupyter Notebook

11004

2007

22 天前

aws / amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

sagemaker Amazon Web Services reinforcement-learning 机器学习深度学习 Example Jupyter Notebook mlops 数据科学 training inference

Jupyter Notebook

10446

6867

1 个月前

DLR-RM/stable-baselines3

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

reinforcement-learning reinforcement-learning-algorithms 机器学习 gym openai baselines toolbox stable-baselines Python PyTorch Robotics sde

Python

10411

1820

8 天前

wandb/wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

机器学习 experiment-track 深度学习 Keras Tensorflow PyTorch hyperparameter-search reinforcement-learning mlops 数据科学 collaboration hyperparameter-optimization reproducibility hyperparameter-tuning data-versioning model-versioning ml-platform jax 人工智能

Python

9758

730

6 小时前

Hvass-Labs / TensorFlow-Tutorials

TensorFlow Tutorials with YouTube Videos

Tensorflow 深度学习机器学习 reinforcement-learning python-notebook 教程神经网络 YouTube

Jupyter Notebook

9277

4172

4 年前

MorvanZhou / Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning 教程 q-learning sarsa sarsa-lambda deep-q-network a3c ddpg policy-gradient dqn double-dqn dueling-dqn deep-deterministic-policy-gradient actor-critic Tensorflow proximal-policy-optimization ppo 机器学习

Python

9157

5025

1 年前