1-day current streak·3-day longest streak
-
PBRL ★ PINNED
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Python ★ 29 4y agoExplain → -
Tensorflow-TCN ★ PINNED
Tensorflow eager implementation of Temporal Convolutional Network (TCN)
Python ★ 129 7y agoExplain → -
COPO ★ PINNED
Online Preference Alignment for Language Models via Count-based Exploration
Python ★ 20 1y agoExplain → -
UTDS ★ PINNED
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
Python ★ 18 2y agoExplain → -
OB2I ★ PINNED
Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"
Python ★ 9 5y agoExplain → -
DB ★ PINNED
Dynamic Bottleneck for Robust Self-Supervised Exploration
Python ★ 6 4y agoExplain → -
GHER
G-HER algorithm
Python ★ 18 7y agoExplain → -
MINE
Mutual Information Neural Estimation (Pytorch)
Jupyter Notebook ★ 13 6y agoExplain → -
Pix2Pix-eager
Tensorflow eager implementation of Pix2Pix (Image-to-image translation with conditional adversarial networks)
Python ★ 12 6y agoExplain → -
Contrastive-UCB
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Python ★ 11 4y agoExplain → -
MQN-offline
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
Python ★ 7 4y agoExplain → -
BHER
Code for "Addressing Hindsight Bias in Multi-Goal Reinforcement Learning"
Python ★ 7 5y agoExplain → -
Gumbel-softmax
Tensorflow eager for "categorical variational autoencoder using the Gumbel-Softmax estimator"
Python ★ 7 7y agoExplain → -
CeSD
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Python ★ 6 2y agoExplain → -
NMT-eager
tensorflow implementation of neural machine translation with attention
Python ★ 6 7y agoExplain → -
mc-dropout
Tensorflow eager code for mc dropout. [Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning]
Python ★ 5 6y agoExplain → -
VDM
Code for "Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning"
Python ★ 5 4y agoExplain → -
CycleGAN-eager
Tensorflow implementation of CycleGAN "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks"
Python ★ 5 6y agoExplain → -
Resnet
tensorflow implementation of Resnet50 with tf.keras.model and eager
Python ★ 5 7y agoExplain → -
Graduation-design
No description.
Python ★ 5 10y agoExplain → -
EmbodiedAI-Book
The code implementation of EmbodiedAI-Book
Python ★ 4 1mo agoExplain → -
DeepLearning-Self_Driving
DeepLearning for self driving cars
Python ★ 4 9y agoExplain → -
CIFAR-10-basic
using basic neural network and softmax/SVM loss function to classify the CIFAR-10 image set
Python ★ 4 9y agoExplain → -
Segmentation-keras
Keras implementation of "Multi-scale context aggregation by dilated convolutions (ICLR 2016)"
Python ★ 3 7y agoExplain → -
Rainbow ⑂
Rainbow: Combining Improvements in Deep Reinforcement Learning
★ 2 5y agoExplain → -
BeCL-MI-Entropy
Mutual Information and Entropy estimation of BeCL and baseline methods
Jupyter Notebook ★ 2 3y agoExplain → -
C3D-tensorflow-eager
Implement C3D model for video recognition proporsed in "Learning Spatiotemporal Features with 3D Convolutional Networks"
Python ★ 2 7y agoExplain → -
Non-local-nets-tensorflow
Implementation of "Non-local neural networks." (CVPR 18) with tensorflow eager execution.
Python ★ 2 7y agoExplain → -
IMDB-eager
tensorflow implementation of IMDB dataset classification
Python ★ 2 7y agoExplain → -
RNN-Sequential-Mnist
Tensorflow eager implementation to solving Sequential-Mnist classification problem using Recurrent Neural network
Python ★ 1 7y agoExplain → -
Embodied-Survey
Figures for Embodied Survey
★ 1 2y agoExplain → -
TD3
The basic implementation of TD3/DDPG algorithm with Tensorflow 2
Python ★ 1 6y agoExplain → -
probabilistic-ensemble
No description.
Python ★ 1 5y agoExplain → -
offline_safe_rl
No description.
★ 1 4y agoExplain → -
VAEAC
Tensorflow version of Variational Autoencoder with Arbitrary Conditioning
Python ★ 1 6y agoExplain → -
DCGAN-eager
Tensorflow eager implementation of "DCGAN"
Python ★ 1 6y agoExplain → -
Style-transfer
Neural Style Transfer with tensorflow eager
Jupyter Notebook ★ 1 6y agoExplain → -
zeroshot-imitation ⑂
[ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration
Python ★ 1 8y agoExplain → -
noreward-rl ⑂
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Python ★ 1 7y agoExplain → -
AI-blog ⑂
Accompanying repository for Let's make a DQN / A3C series.
Python ★ 1 7y agoExplain → -
Hierarchical-Actor-Critc-HAC- ⑂
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
Python ★ 1 7y agoExplain → -
keract ⑂
Activation Maps (Layers Outputs) and Gradients in Keras.
Python ★ 1 7y agoExplain → -
large-scale-curiosity ⑂
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
Python ★ 1 7y agoExplain → -
exploration-by-disagreement ⑂
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
Python ★ 1 7y agoExplain → -
BreakOut-A3C
A3C method in Breakout game
Python ★ 1 7y agoExplain → -
Mnist-tf-eager
Mnist classification with tensorflow eager execution
Python ★ 1 7y agoExplain → -
VAE-basic
tensorflow and keras for variational auto-encoder (VAE) model
Python ★ 1 7y agoExplain → -
Transformer
Basic implementation of Transformer model (Attention is all you need) with Tensorflow eager
Python ★ 1 7y agoExplain → -
DRAW_RNN_ImageGeneration
TensorFlow Implementation of "DRAW: A Recurrent Neural Network For Image Generation"
Python ★ 1 7y agoExplain → -
NeuralTuringMachine
Tensorflow implementation of a Neural Turing Machine (simple version)
Python ★ 1 7y agoExplain → -
PTB-eager
tensorflow implementation of language model in PTB dataset
Python ★ 1 7y agoExplain → -
Leetcode_train
Leetcode coded by me
C++ ★ 1 10y agoExplain → -
cluster-code
No description.
Python ★ 1 10y agoExplain → -
baichenjia.github.io
github pages
HTML ★ 0 12h agoExplain → -
-
具身智能书籍相关代码
★ 0 1mo agoExplain → -
lerobot
test of lerobot dataset
Python ★ 0 3mo agoExplain → -
LearningHumanoidWalking ⑂
Training a humanoid robot for locomotion using Reinforcement Learning
★ 0 2y agoExplain → -
SELM
SELM
★ 0 1y agoExplain → -
humanplus ⑂
HumanPlus: Humanoid Shadowing and Imitation from Humans
★ 0 1y agoExplain → -
BeCL ⑂
BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.
★ 0 3y agoExplain → -
Temporary_D3IL ⑂
No description.
★ 0 2y agoExplain → -
tdmpc ⑂
Code for "Temporal Difference Learning for Model Predictive Control"
★ 0 3y agoExplain → -
parkour ⑂
[CoRL 2023] Robot Parkour Learning
★ 0 2y agoExplain → -
cliport ⑂
CLIPort: What and Where Pathways for Robotic Manipulation
★ 0 3y agoExplain → -
walk-these-ways ⑂
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.
★ 0 3y agoExplain → -
CODAC ⑂
No description.
★ 0 4y agoExplain → -
EBU
Reproduce for "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update" (NeurIPS 2019) with Tensorflow
Python ★ 0 5y agoExplain → -
CVAE_exploration
No description.
Python ★ 0 5y agoExplain → -
CAVE_NoisyMinist
No description.
Python ★ 0 5y agoExplain → -
O-RAAC ⑂
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
★ 0 5y agoExplain → -
SunRise-Constrain
SunRise-Constrain
Python ★ 0 5y agoExplain → -
sunrise ⑂
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
★ 0 6y agoExplain → -
offline-rl-neurips.github.io ⑂
No description.
★ 0 5y agoExplain → -
Rebuttal-OEB3
Rebuttal of OEB3
★ 0 5y agoExplain → -
Bayesian-DQN
Efficient Exploration through Bayesian Deep Q-Networks
★ 0 5y agoExplain → -
CB
Curiosity Bottleneck in Self-Supervised Learning Setting
Python ★ 0 5y agoExplain → -
CPC ⑂
Keras implementation of Representation Learning with Contrastive Predictive Coding
★ 0 7y agoExplain → -
PPO
The basic algorithm of policy gradients
Python ★ 0 6y agoExplain → -
rltf ⑂
Reinforcement Learning implementations and research prototyping in TensorFlow
★ 0 7y agoExplain → -
Stable-Baselines-Basic
Basic use of stable baselines for reinforcement learning
Python ★ 0 6y agoExplain → -
ts_tutorial ⑂
No description.
★ 0 8y agoExplain → -
Bayesian-linear-regression
Tensorflow implementation of Bayesian linear regression (Weight Uncertainty in Neural Networks)
Python ★ 0 6y agoExplain → -
ImageCaptionWithAttention
Tensorflow implementation of Image Caption with Attention
Python ★ 0 7y agoExplain → -
responsereviewers ⑂
A basic LaTeX template document for creating journal response to reviewers letters
TeX ★ 0 12y agoExplain →
No repos match these filters.