BigScience Workshop ORG

@bigscience-workshop ·bigscience.huggingface.co

Research workshop on large language models - The Summer of Language Models 21

36 repos
1.4k followers
0 following

Python 57%
Jupyter Notebook 22%
HTML 9%
Shell 4%
TeX 4%

All public repos (36)

Show forks Show archived

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Petals lets you run massive AI language models like Llama 3.1 (405B) on consumer hardware by splitting the model across multiple volunteer computers over the internet, like BitTorrent for AI inference.

Python ★ 10k 1y ago
Explain →
promptsource

Toolkit for creating, sharing and using natural language prompts.

Python ★ 3.0k 2y ago
Explain →
Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python ★ 1.4k 2y ago
Explain →
bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell ★ 1.0k 1y ago
Explain →
xmtf

Crosslingual Generalization through Multitask Finetuning

Jupyter Notebook ★ 536 1y ago
Explain →
biomedical

Tools for curating biomedical training data for large-scale language modeling

Python ★ 500 1y ago
Explain →
t-zero

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

Python ★ 463 3y ago
Explain →
data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus

Jupyter Notebook ★ 318 3y ago
Explain →
lm-evaluation-harness ⑂

A framework for few-shot evaluation of autoregressive language models.

Python ★ 105 3y ago
Explain →
architecture-objective ⑂

No description.

Python ★ 100 2y ago
Explain →
data_tooling

Tools for managing datasets for governance and training.

HTML ★ 90 26d ago
Explain →
lam

Libraries, Archives and Museums (LAM)

★ 89 3y ago
Explain →
multilingual-modeling

BLOOM+1: Adapting BLOOM model to support a new unseen language

Python ★ 74 2y ago
Explain →
evaluation

Code and Data for Evaluation WG

Python ★ 42 4y ago
Explain →
data_sourcing

This directory gathers the tools developed by the Data Sourcing Working Group

Python ★ 31 4y ago
Explain →
metadata

Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.

Python ★ 30 3y ago
Explain →
model_card

No description.

★ 26 4y ago
Explain →
carbon-footprint

A repository for `codecarbon` logs.

Jupyter Notebook ★ 13 3y ago
Explain →
tokenization

No description.

Python ★ 11 4y ago
Explain →
bigscience-workshop.github.io ⑂

Alternative to https://github.com/Dynalon/mdwiki-seed

HTML ★ 10 4y ago
Explain →
ShadesofBias

Evaluation for Shades of Bias in Text

HTML ★ 10 1y ago
Explain →
bloom-dechonk

A repo for running model shrinking experiments

Python ★ 10 4y ago
Explain →
massive-probing-framework ⑂

Framework for BLOOM probing

Python ★ 9 2y ago
Explain →
pii_processing

PII Processing code to detect and remediate PII in BigScience datasets. Reference implementation for the PII Hackathon

Python ★ 9 3y ago
Explain →
catalogue_data

Scripts to prepare catalogue data

Jupyter Notebook ★ 8 4y ago
Explain →
historical_texts

BigScience working group on language models for historical texts

Jupyter Notebook ★ 8 4y ago
Explain →
transformers ⑂

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

★ 6 4y ago
Explain →
training_dynamics

No description.

★ 5 4y ago
Explain →
amazon-sagemaker-mlflow-fargate ⑂

Managing your machine learning lifecycle with MLflow and Amazon SageMaker

★ 3 5y ago
Explain →
bibliography

A list of BigScience publications

TeX ★ 3 3y ago
Explain →
multilingual-modeling-1 ⑂

No description.

★ 2 3y ago
Explain →
scaling-laws-tokenization

scaling-laws-tokenization

★ 2 4y ago
Explain →
evaluation-robustness-consistency

Tools for evaluating model robustness and consistency

Python ★ 2 4y ago
Explain →
datasets_stats

Generate statistics over datasets used in the context of BS

Makefile ★ 2 4y ago
Explain →
interpretability-ideas

No description.

★ 1 4y ago
Explain →
codecarbon ⑂

Track emissions from Compute and recommend ways to reduce their impact on the environment.

★ 0 4y ago
Explain →