-
Flash-RL
Implementation for FP8/INT8 Rollout for RL training without performence drop.
Python ★ 304 7mo agoExplain → -
DenseMixer
Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient
Python ★ 67 10mo agoExplain → -
ReaL
Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"
Python ★ 41 11mo agoExplain → -
verl ⑂
verl: Volcano Engine Reinforcement Learning for LLMs
Python ★ 21 7mo agoExplain → -
LER
Source code and dataset for AAAI2023 paper "Unsupervised Legal Evidence Retrieval via Contrastive Learning with Approximated Positive"
Python ★ 1 2y agoExplain → -
branch-prediction
No description.
C ★ 1 2y agoExplain → -
Reasoning360 ⑂
A repo for open research on building large reasoning models
★ 0 11mo agoExplain → -
cse234-w25 ⑂
Website for CSE 234, Winter 2025
SCSS ★ 0 1y agoExplain → -
cse256-pa2
No description.
Python ★ 0 2y agoExplain → -
mergekit ⑂
Tools for merging pretrained large language models.
Python ★ 0 2y agoExplain → -
lm-evaluation-harness ⑂
A framework for few-shot evaluation of language models.
★ 0 2y agoExplain → -
LLaMA-Factory ⑂
Unify Efficient Fine-tuning of 100+ LLMs
★ 0 2y agoExplain → -
grouped_gemm ⑂
PyTorch bindings for CUTLASS grouped GEMM.
★ 0 2y agoExplain → -
llm ⑂
No description.
★ 0 2y agoExplain → -
AutoGPT ⑂
An experimental open-source attempt to make GPT-4 fully autonomous.
★ 0 2y agoExplain → -
ElementMatchingContest
No description.
Python ★ 0 4y agoExplain → -
pytorch-worker ⑂
A framework for training, evaluating and testing models in pytorch.
★ 0 4y agoExplain →
No repos match these filters.