Repository navigation

#

policy-learning

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

3336
3 个月前

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

762
3 个月前

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

Python
760
3 个月前

YLearn, a pun of "learn why", is a python package for causal inference

Python
430
3 个月前

[ICLR 2023] Pytorch implementation of PPGeo, a fully self-supervised driving policy pre-training framework to learn from unlabeled driving videos.

Python
135
3 个月前

[RSS 2024] Learning Manipulation by Predicting Interaction

Python
115
3 个月前

[ECCV 2022] Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining

Python
85
3 年前

Policy learning via doubly robust empirical welfare maximization over trees

R
84
2 个月前

Stable dynamical system (motion policy) learning using Euclideanizing flows

Python
14
5 年前

Off-Policy Evaluation and Learning that is both Doubly Robust and Distributionally Robust.

Jupyter Notebook
9
3 年前

Experiment code for "Koopman Constrained Policy Optimization: a Koopman operator theoretic method for differentiable optimal control in robotics" as presented at ICML 2023

Jupyter Notebook
8
2 年前

Black-box, gradient-free optimization of car-racing policies.

Python
3
5 年前

[ECAI-2025] SPOWL: A JAX-based Safe RL framework that adaptively combines planning and policy learning with dynamic safety thresholds.

1
1 个月前

# Car_Black_Box This smart black box monitors key vehicle metrics in real-time, alerting drivers to unsafe conditions. With AI-driven insights, fleet managers can reduce accidents and maintenance costs effectively. 🚗💻

C
0
3 个月前

Neural network and reinforcement learning models for efficient decision-making on classical planning benchmarks

Python
0
3 个月前

This repository 📂 implements Q-Learning 🤖 in Blackjack 🃏, comparing it with random action selection 🎲 and basic strategies 📋. Includes experiments 🔬 with various strategies, rule variations ⚖️, and deck numbers 🃏📦 to evaluate performance 📈.

Jupyter Notebook
0
6 个月前

Implementation of a basic diffusion policy in jax with a full pipeline of data collection -> data augmentation -> training -> inference/evaluation

Python
0
9 个月前