Repository navigation

policy-learning

Website
Wikipedia

OpenDriveLab / End-to-end-Autonomous-Driving

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

end-to-end-autonomous-driving autonomous-driving policy-learning Simulation

3336

306

3 个月前

zubair-irshad / Awesome-Robotics-3D

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

3D benchmarks 机器视觉 gaussian-splatting 大语言模型 manipulation nerf policy-learning pretraining Robotics scene-graph Simulation vision-language-model vlm diffusion-models foundation-models navigation

762

3 个月前

OpenDriveLab / DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

foundation-model autonomous-driving embodied-ai policy-learning video-generation world-models

Python

760

3 个月前

DataCanvasIO / YLearn

YLearn, a pun of "learn why", is a python package for causal inference

causal-inference causality causal-models causal-discovery uplift-modeling policy-learning

Python

430

3 个月前

OpenDriveLab / PPGeo

[ICLR 2023] Pytorch implementation of PPGeo, a fully self-supervised driving policy pre-training framework to learn from unlabeled driving videos.

end-to-end-autonomous-driving policy-learning self-supervised-learning

Python

135

3 个月前

OpenDriveLab / MPI

[RSS 2024] Learning Manipulation by Predicting Interaction

policy-learning pre-training robot-manipulation

Python

115

3 个月前

metadriverse / ACO

[ECCV 2022] Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining

policy-learning pretraining 机器视觉

Python

3 年前

grf-labs / policytree

Policy learning via doubly robust empirical welfare maximization over trees

causal-inference policy-learning

2 个月前

mrana6 / euclideanizing_flows

Stable dynamical system (motion policy) learning using Euclideanizing flows

imitation-learning policy-learning dynamical-systems

Python

5 年前

robot-learning-freiburg / TAPAS

PyTorch code for TAPAS-GMM.

人工智能 imitation-learning policy-learning PyTorch Robotics

Jupyter Notebook

10 个月前

CausalML / doubly-robust-dropel

Off-Policy Evaluation and Learning that is both Doubly Robust and Distributionally Robust.

机器学习 policy-learning robustness

Jupyter Notebook

3 年前

mhr / kcpo-icml

Experiment code for "Koopman Constrained Policy Optimization: a Koopman operator theoretic method for differentiable optimal control in robotics" as presented at ICML 2023

mpc optimal-control policy-learning robot-learning

Jupyter Notebook

2 年前

max-eth / racer

Black-box, gradient-free optimization of car-racing policies.

gym policy-learning optimization

Python

5 年前

xiaobaobaochifan / NAC

The official repository for Net Actor-Critic

decision-making 机器学习 optimal-transport policy-learning reinforcement-learning offline-reinforcement-learning

Python

7 个月前

AIRI-Institute / SPOWL

[ECAI-2025] SPOWL: A JAX-based Safe RL framework that adaptively combines planning and policy learning with dynamic safety thresholds.

planning policy-learning

1 个月前

GermanPaul12 / Space-Invaders-Pygame-RL-Genetic-Agents

genetic-algorithm policy-learning pygame Python reinforcement-learning space-invaders

Python

4 个月前

hackerx004 / Car_Black_Box

# Car_Black_Box This smart black box monitors key vehicle metrics in real-time, alerting drivers to unsafe conditions. With AI-driven insights, fleet managers can reduce accidents and maintenance costs effectively. 🚗💻

automobile car embedded-systems gym Microcontroller obd2 optimization policy-learning Simulation

3 个月前

bansal-yash / Neural-Networks-Policy-Planning

Neural network and reinforcement learning models for efficient decision-making on classical planning benchmarks

神经网络 planning reinforcement-learning policy-learning

Python

3 个月前

suraj5424 / Q-Learning-for-Blackjack-in-different-environments

This repository 📂 implements Q-Learning 🤖 in Blackjack 🃏, comparing it with random action selection 🎲 and basic strategies 📋. Includes experiments 🔬 with various strategies, rule variations ⚖️, and deck numbers 🃏📦 to evaluate performance 📈.

agent-based-modeling 人工智能 blackjack 机器学习 policy-learning q-learning reinforcement-learning sarsa

Jupyter Notebook

6 个月前

aditKadepurkar / basic_diffusion_policy

Implementation of a basic diffusion policy in jax with a full pipeline of data collection -> data augmentation -> training -> inference/evaluation

diffusion policy-learning Robotics

Python

9 个月前