Chenjia Bai

@Baichenjia ·baichenjia.github.io

Embodied AI, Reinforcement Learning, LLMs

84 repos
130 followers
16 following

Python 91%
Jupyter Notebook 6%
HTML 2%
C++ 2%

69 contributions in the last year

1-day current streak·3-day longest streak

‹ swipe through months ›

Jun 2025

SMTWTFS123456789101112131415161718192021222324252627282930

Jul 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Aug 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Sep 2025

SMTWTFS123456789101112131415161718192021222324252627282930

Oct 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Nov 2025

SMTWTFS123456789101112131415161718192021222324252627282930

Dec 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Jan 2026

SMTWTFS12345678910111213141516171819202122232425262728293031

Feb 2026

SMTWTFS12345678910111213141516171819202122232425262728

Mar 2026

SMTWTFS12345678910111213141516171819202122232425262728293031

Apr 2026

SMTWTFS123456789101112131415161718192021222324252627282930

May 2026

SMTWTFS12345678910111213141516171819202122232425262728293031

Jun 2026

SMTWTFS123456789101112131415161718192021222324252627282930

Less More

All public repos (84)

Show forks Show archived Sort

PBRL ★ PINNED

Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning

Python ★ 29 4y ago
Explain →
Tensorflow-TCN ★ PINNED

Tensorflow eager implementation of Temporal Convolutional Network (TCN)

Python ★ 129 7y ago
Explain →
COPO ★ PINNED

Online Preference Alignment for Language Models via Count-based Exploration

Python ★ 20 1y ago
Explain →
UTDS ★ PINNED

Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL

Python ★ 18 2y ago
Explain →
OB2I ★ PINNED

Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"

Python ★ 9 5y ago
Explain →
DB ★ PINNED

Dynamic Bottleneck for Robust Self-Supervised Exploration

Python ★ 6 4y ago
Explain →
GHER

G-HER algorithm

Python ★ 18 7y ago
Explain →
MINE

Mutual Information Neural Estimation (Pytorch)

Jupyter Notebook ★ 13 6y ago
Explain →
Pix2Pix-eager

Tensorflow eager implementation of Pix2Pix (Image-to-image translation with conditional adversarial networks)

Python ★ 12 6y ago
Explain →
Contrastive-UCB

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Python ★ 11 4y ago
Explain →
MQN-offline

Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning

Python ★ 7 4y ago
Explain →
BHER

Code for "Addressing Hindsight Bias in Multi-Goal Reinforcement Learning"

Python ★ 7 5y ago
Explain →
Gumbel-softmax

Tensorflow eager for "categorical variational autoencoder using the Gumbel-Softmax estimator"

Python ★ 7 7y ago
Explain →
CeSD

Constrained Ensemble Exploration for Unsupervised Skill Discovery

Python ★ 6 2y ago
Explain →
NMT-eager

tensorflow implementation of neural machine translation with attention

Python ★ 6 7y ago
Explain →
mc-dropout

Tensorflow eager code for mc dropout. [Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning]

Python ★ 5 6y ago
Explain →
VDM

Code for "Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning"

Python ★ 5 4y ago
Explain →
CycleGAN-eager

Tensorflow implementation of CycleGAN "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks"

Python ★ 5 6y ago
Explain →
Resnet

tensorflow implementation of Resnet50 with tf.keras.model and eager

Python ★ 5 7y ago
Explain →
Graduation-design

No description.

Python ★ 5 10y ago
Explain →
EmbodiedAI-Book

The code implementation of EmbodiedAI-Book

Python ★ 4 1mo ago
Explain →
DeepLearning-Self_Driving

DeepLearning for self driving cars

Python ★ 4 9y ago
Explain →
CIFAR-10-basic

using basic neural network and softmax/SVM loss function to classify the CIFAR-10 image set

Python ★ 4 9y ago
Explain →
Segmentation-keras

Keras implementation of "Multi-scale context aggregation by dilated convolutions (ICLR 2016)"

Python ★ 3 7y ago
Explain →
Rainbow ⑂

Rainbow: Combining Improvements in Deep Reinforcement Learning

★ 2 5y ago
Explain →
BeCL-MI-Entropy

Mutual Information and Entropy estimation of BeCL and baseline methods

Jupyter Notebook ★ 2 3y ago
Explain →
C3D-tensorflow-eager

Implement C3D model for video recognition proporsed in "Learning Spatiotemporal Features with 3D Convolutional Networks"

Python ★ 2 7y ago
Explain →
Non-local-nets-tensorflow

Implementation of "Non-local neural networks." (CVPR 18) with tensorflow eager execution.

Python ★ 2 7y ago
Explain →
IMDB-eager

tensorflow implementation of IMDB dataset classification

Python ★ 2 7y ago
Explain →
RNN-Sequential-Mnist

Tensorflow eager implementation to solving Sequential-Mnist classification problem using Recurrent Neural network

Python ★ 1 7y ago
Explain →
Embodied-Survey

Figures for Embodied Survey

★ 1 2y ago
Explain →
TD3

The basic implementation of TD3/DDPG algorithm with Tensorflow 2

Python ★ 1 6y ago
Explain →
probabilistic-ensemble

No description.

Python ★ 1 5y ago
Explain →
offline_safe_rl

No description.

★ 1 4y ago
Explain →
VAEAC

Tensorflow version of Variational Autoencoder with Arbitrary Conditioning

Python ★ 1 6y ago
Explain →
DCGAN-eager

Tensorflow eager implementation of "DCGAN"

Python ★ 1 6y ago
Explain →
Style-transfer

Neural Style Transfer with tensorflow eager

Jupyter Notebook ★ 1 6y ago
Explain →
zeroshot-imitation ⑂

[ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration

Python ★ 1 8y ago
Explain →
noreward-rl ⑂

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

Python ★ 1 7y ago
Explain →
AI-blog ⑂

Accompanying repository for Let's make a DQN / A3C series.

Python ★ 1 7y ago
Explain →
Hierarchical-Actor-Critc-HAC- ⑂

This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.

Python ★ 1 7y ago
Explain →
keract ⑂

Activation Maps (Layers Outputs) and Gradients in Keras.

Python ★ 1 7y ago
Explain →
large-scale-curiosity ⑂

Code for the paper "Large-Scale Study of Curiosity-Driven Learning"

Python ★ 1 7y ago
Explain →
exploration-by-disagreement ⑂

[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement

Python ★ 1 7y ago
Explain →
BreakOut-A3C

A3C method in Breakout game

Python ★ 1 7y ago
Explain →
Mnist-tf-eager

Mnist classification with tensorflow eager execution

Python ★ 1 7y ago
Explain →
VAE-basic

tensorflow and keras for variational auto-encoder (VAE) model

Python ★ 1 7y ago
Explain →
Transformer

Basic implementation of Transformer model (Attention is all you need) with Tensorflow eager

Python ★ 1 7y ago
Explain →
DRAW_RNN_ImageGeneration

TensorFlow Implementation of "DRAW: A Recurrent Neural Network For Image Generation"

Python ★ 1 7y ago
Explain →
NeuralTuringMachine

Tensorflow implementation of a Neural Turing Machine (simple version)

Python ★ 1 7y ago
Explain →
PTB-eager

tensorflow implementation of language model in PTB dataset

Python ★ 1 7y ago
Explain →
Leetcode_train

Leetcode coded by me

C++ ★ 1 10y ago
Explain →
cluster-code

No description.

Python ★ 1 10y ago
Explain →
baichenjia.github.io

github pages

HTML ★ 0 12h ago
Explain →
-

具身智能书籍相关代码

★ 0 1mo ago
Explain →
lerobot

test of lerobot dataset

Python ★ 0 3mo ago
Explain →
LearningHumanoidWalking ⑂

Training a humanoid robot for locomotion using Reinforcement Learning

★ 0 2y ago
Explain →
SELM

SELM

★ 0 1y ago
Explain →
humanplus ⑂

HumanPlus: Humanoid Shadowing and Imitation from Humans

★ 0 1y ago
Explain →
BeCL ⑂

BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.

★ 0 3y ago
Explain →
Temporary_D3IL ⑂

No description.

★ 0 2y ago
Explain →
tdmpc ⑂

Code for "Temporal Difference Learning for Model Predictive Control"

★ 0 3y ago
Explain →
parkour ⑂

[CoRL 2023] Robot Parkour Learning

★ 0 2y ago
Explain →
cliport ⑂

CLIPort: What and Where Pathways for Robotic Manipulation

★ 0 3y ago
Explain →
walk-these-ways ⑂

Sim-to-real RL training and deployment tools for the Unitree Go1 robot.

★ 0 3y ago
Explain →
CODAC ⑂

No description.

★ 0 4y ago
Explain →
EBU

Reproduce for "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update" (NeurIPS 2019) with Tensorflow

Python ★ 0 5y ago
Explain →
CVAE_exploration

No description.

Python ★ 0 5y ago
Explain →
CAVE_NoisyMinist

No description.

Python ★ 0 5y ago
Explain →
O-RAAC ⑂

Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting

★ 0 5y ago
Explain →
SunRise-Constrain

SunRise-Constrain

Python ★ 0 5y ago
Explain →
sunrise ⑂

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

★ 0 6y ago
Explain →
offline-rl-neurips.github.io ⑂

No description.

★ 0 5y ago
Explain →
Rebuttal-OEB3

Rebuttal of OEB3

★ 0 5y ago
Explain →
Bayesian-DQN

Efficient Exploration through Bayesian Deep Q-Networks

★ 0 5y ago
Explain →
CB

Curiosity Bottleneck in Self-Supervised Learning Setting

Python ★ 0 5y ago
Explain →
CPC ⑂

Keras implementation of Representation Learning with Contrastive Predictive Coding

★ 0 7y ago
Explain →
PPO

The basic algorithm of policy gradients

Python ★ 0 6y ago
Explain →
rltf ⑂

Reinforcement Learning implementations and research prototyping in TensorFlow

★ 0 7y ago
Explain →
Stable-Baselines-Basic

Basic use of stable baselines for reinforcement learning

Python ★ 0 6y ago
Explain →
ts_tutorial ⑂

No description.

★ 0 8y ago
Explain →
Bayesian-linear-regression

Tensorflow implementation of Bayesian linear regression (Weight Uncertainty in Neural Networks)

Python ★ 0 6y ago
Explain →
ImageCaptionWithAttention

Tensorflow implementation of Image Caption with Attention

Python ★ 0 7y ago
Explain →
responsereviewers ⑂

A basic LaTeX template document for creating journal response to reviewers letters

TeX ★ 0 12y ago
Explain →

No repos match these filters.