64-day longest streak
Hi there 👋 <!-- Benjamin-eecs/benjamin-eecs is a ✨ _special_ ✨ repository because its README.md (this file) appears on your GitHub profile. Here are some ideas to get you started: 🔭…
Hi there 👋

<!--
Benjamin-eecs/benjamin-eecs is a ✨ _special_ ✨ repository because its README.md (this file) appears on your GitHub profile.

Here are some ideas to get you started:
- 🔭 I’m currently working on ...
- 🌱 I’m currently learning ...
- 👯 I’m looking to collaborate on ...
- 🤔 I’m looking for help with ...
- 💬 Ask me about ...
- 📫 How to reach me: ...
- 😄 Pronouns: ...
- ⚡ Fun fact: ...
-
DeepSeek-VL ★ PINNED ⑂
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Python ★ 0 2y agoExplain → -
torchopt ★ PINNED ⑂
No description.
Python ★ 0 2y agoExplain → -
envpool ★ PINNED ⑂
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
C++ ★ 1 3y agoExplain → -
Natural-language-RL ★ PINNED ⑂
Natural Language Reinforcement Learning
Python ★ 0 1y agoExplain → -
Theoretical-GMRL
NeurIPS-2022: A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.
Python ★ 4 3y agoExplain → -
camel ⑂
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society
Python ★ 1 3y agoExplain → -
Adan ⑂
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Python ★ 1 2y agoExplain → -
Benjamin-eecs
No description.
★ 0 4y agoExplain → -
github-readme-stats ⑂
:zap: Dynamically generated stats for your github readmes
JavaScript ★ 0 1mo agoExplain → -
OpenCLI ⑂
Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to discover, learn, and execute tools seamlessly via a unified AGENT.md integration.
JavaScript ★ 0 2d agoExplain → -
TextArena ⑂
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
Python ★ 0 1y agoExplain → -
Interplay-LM-Reasoning ⑂
No description.
★ 0 6mo agoExplain → -
socialrl-lab-website ⑂
Wesbite for the Social Reinforcement Learning Lab at the University of Washington
★ 0 6mo agoExplain → -
tinker-cookbook ⑂
Post-training with Tinker
★ 0 7mo agoExplain → -
simply ⑂
Minimal and scalable research codebase in JAX, designed for rapid iteration on frontier research in LLM and other autoregressive models.
★ 0 7mo agoExplain → -
nanochat ⑂
The best ChatGPT that $100 can buy.
★ 0 8mo agoExplain → -
gem ⑂
A Gym for Generalist LLMs
Python ★ 0 8mo agoExplain → -
fairseq2 ⑂
FAIR Sequence Modeling Toolkit 2
★ 0 1y agoExplain → -
DeepSpeed ⑂
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
★ 0 2y agoExplain → -
ngram ⑂
The n-gram Language Model
★ 0 1y agoExplain → -
llm.c ⑂
LLM training in simple, raw C/CUDA
★ 0 2y agoExplain → -
tianshou ⑂
An elegant PyTorch deep reinforcement learning library.
★ 0 3y agoExplain → -
simple-evals ⑂
No description.
★ 0 2y agoExplain → -
evals ⑂
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
Python ★ 0 3y agoExplain → -
the-incredible-pytorch ⑂
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
★ 0 3y agoExplain → -
optree ⑂
OpTree: Optimized PyTree
Python ★ 0 3y agoExplain → -
mujoco ⑂
Multi-Joint dynamics with Contact. A general purpose physics simulator.
C ★ 0 4y agoExplain → -
gym-docs ⑂
Code for Gym documentation website
★ 0 4y agoExplain → -
NAC ⑂
NeurIPS-2021: Neural Auto-Curricula in Two-Player Zero-Sum Games.
★ 0 4y agoExplain → -
rl ⑂
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Python ★ 0 3y agoExplain → -
day-day-up ⑂
NOOOOO_noFORK
Go ★ 0 7y agoExplain → -
openbilibili ⑂
哔哩哔哩 bilibili go-common
Go ★ 0 7y agoExplain → -
go-common-bilibili ⑂
No description.
Go ★ 0 7y agoExplain →
No repos match these filters.