-
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Python ★ 10k 1y agoExplain → -
promptsource
Toolkit for creating, sharing and using natural language prompts.
Python ★ 3.0k 2y agoExplain → -
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python ★ 1.4k 2y agoExplain → -
bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Shell ★ 1.0k 1y agoExplain → -
xmtf
Crosslingual Generalization through Multitask Finetuning
Jupyter Notebook ★ 536 1y agoExplain → -
biomedical
Tools for curating biomedical training data for large-scale language modeling
Python ★ 500 1y agoExplain → -
t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Python ★ 463 3y agoExplain → -
data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Jupyter Notebook ★ 318 3y agoExplain → -
lm-evaluation-harness ⑂
A framework for few-shot evaluation of autoregressive language models.
Python ★ 105 3y agoExplain → -
architecture-objective ⑂
No description.
Python ★ 100 2y agoExplain → -
data_tooling
Tools for managing datasets for governance and training.
HTML ★ 90 26d agoExplain → -
lam
Libraries, Archives and Museums (LAM)
★ 89 3y agoExplain → -
multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
Python ★ 74 2y agoExplain → -
evaluation
Code and Data for Evaluation WG
Python ★ 42 4y agoExplain → -
data_sourcing
This directory gathers the tools developed by the Data Sourcing Working Group
Python ★ 31 4y agoExplain → -
metadata
Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.
Python ★ 30 3y agoExplain → -
model_card
No description.
★ 26 4y agoExplain → -
carbon-footprint
A repository for `codecarbon` logs.
Jupyter Notebook ★ 13 3y agoExplain → -
tokenization
No description.
Python ★ 11 4y agoExplain → -
bigscience-workshop.github.io ⑂
Alternative to https://github.com/Dynalon/mdwiki-seed
HTML ★ 10 4y agoExplain → -
ShadesofBias
Evaluation for Shades of Bias in Text
HTML ★ 10 1y agoExplain → -
bloom-dechonk
A repo for running model shrinking experiments
Python ★ 10 4y agoExplain → -
massive-probing-framework ⑂
Framework for BLOOM probing
Python ★ 9 2y agoExplain → -
pii_processing
PII Processing code to detect and remediate PII in BigScience datasets. Reference implementation for the PII Hackathon
Python ★ 9 3y agoExplain → -
catalogue_data
Scripts to prepare catalogue data
Jupyter Notebook ★ 8 4y agoExplain → -
historical_texts
BigScience working group on language models for historical texts
Jupyter Notebook ★ 8 4y agoExplain → -
transformers ⑂
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
★ 6 4y agoExplain → -
training_dynamics
No description.
★ 5 4y agoExplain → -
amazon-sagemaker-mlflow-fargate ⑂
Managing your machine learning lifecycle with MLflow and Amazon SageMaker
★ 3 5y agoExplain → -
bibliography
A list of BigScience publications
TeX ★ 3 3y agoExplain → -
multilingual-modeling-1 ⑂
No description.
★ 2 3y agoExplain → -
scaling-laws-tokenization
scaling-laws-tokenization
★ 2 4y agoExplain → -
evaluation-robustness-consistency
Tools for evaluating model robustness and consistency
Python ★ 2 4y agoExplain → -
datasets_stats
Generate statistics over datasets used in the context of BS
Makefile ★ 2 4y agoExplain → -
interpretability-ideas
No description.
★ 1 4y agoExplain → -
codecarbon ⑂
Track emissions from Compute and recommend ways to reduce their impact on the environment.
★ 0 4y agoExplain →
No repos match these filters.