Members
-
tree-of-thought-llm ★ PINNED
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Python ★ 6.0k 1y agoExplain → -
LLM-Shearing ★ PINNED
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Python ★ 640 2y agoExplain → -
SimCSE ★ PINNED
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Python ★ 3.7k 1y agoExplain → -
SimPO ★ PINNED
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Python ★ 954 1y agoExplain → -
MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Python ★ 1.2k 2y agoExplain → -
PURE
[NAACL 2021] A Frustratingly Easy Approach for Entity and Relation Extraction https://arxiv.org/abs/2010.12812
Python ★ 813 4y agoExplain → -
LM-BFF
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
Python ★ 727 3y agoExplain → -
DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.12624
Python ★ 607 4y agoExplain → -
WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Python ★ 563 1y agoExplain → -
LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Jupyter Notebook ★ 528 1y agoExplain → -
ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
Python ★ 520 1y agoExplain → -
AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
Python ★ 335 1y agoExplain → -
ProLong
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
Python ★ 258 9mo agoExplain → -
intercode
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
Python ★ 250 2y agoExplain → -
HELMET
The HELMET Benchmark
Jupyter Notebook ★ 218 2mo agoExplain → -
QuRating
[ICML 2024] Selecting High-Quality Data for Training Language Models
Python ★ 204 6mo agoExplain → -
CoFiPruning
[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408
Python ★ 198 3y agoExplain → -
TRIME
[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674
Python ★ 194 3y agoExplain → -
CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
Python ★ 169 2y agoExplain → -
OptiPrompt
[NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240
Python ★ 168 3y agoExplain → -
TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
Python ★ 165 2y agoExplain → -
CharXiv
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Python ★ 156 1y agoExplain → -
EntityQuestions
EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535
Python ★ 148 4y agoExplain → -
LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
Python ★ 138 2y agoExplain → -
MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
Jupyter Notebook ★ 125 1y agoExplain → -
USACO
Can Language Models Solve Olympiad Programming?
Python ★ 124 1y agoExplain → -
DinkyTrain
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
Python ★ 117 3y agoExplain → -
LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
Python ★ 109 1y agoExplain → -
NLProofS
EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443
Python ★ 87 1y agoExplain → -
LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
Python ★ 78 2y agoExplain → -
calm-textgame
[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games
Python ★ 74 5y agoExplain → -
c-sts
[EMNLP 2023] C-STS: Conditional Semantic Textual Similarity
Python ★ 74 2y agoExplain → -
Edge-Pruning
[NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".
Python ★ 70 10mo agoExplain → -
MADE
EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering
Python ★ 68 4y agoExplain → -
Collie
[ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks
Jupyter Notebook ★ 63 2y agoExplain → -
DataMUX
[NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks
Jupyter Notebook ★ 60 3y agoExplain → -
ShortcutGrammar
EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560
Jupyter Notebook ★ 59 1y agoExplain → -
corpus-poisoning
[EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156
Python ★ 51 2y agoExplain → -
LM-Science-Tutor
No description.
Python ★ 50 1y agoExplain → -
benign-data-breaks-safety
No description.
Python ★ 47 1y agoExplain → -
EvalConvQA
[ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering
Python ★ 43 4y agoExplain → -
MABEL
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975
Python ★ 38 2y agoExplain → -
PTP
Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073
Python ★ 32 2y agoExplain → -
unintentional-unalignment
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
Python ★ 32 5mo agoExplain → -
rationale-robustness
NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790
Python ★ 27 3y agoExplain → -
lwm
We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.
Python ★ 25 2y agoExplain → -
InstructEval
[NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.
Jupyter Notebook ★ 24 2y agoExplain → -
ELIZA-Transformer
[NAACL 2025] Representing Rule-based Chatbots with Transformers
Python ★ 23 1y agoExplain → -
WhatICLLearns
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
Python ★ 21 3y agoExplain → -
Cognac
Repo for paper: Controllable Text Generation with Language Constraints
Python ★ 20 3y agoExplain → -
SRL-NLC
Safe Reinforcement Learning with Natural Language Constraints
★ 17 4y agoExplain → -
semsup
Semantic Supervision: Enabling Generalization over Output Spaces
Python ★ 16 3y agoExplain → -
XTX
[ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games
Python ★ 15 4mo agoExplain → -
datamux-pretraining
MUX-PLMs: Pretraining LMs with Data Multiplexing
Python ★ 15 3y agoExplain → -
CopyCat
No description.
Python ★ 14 2y agoExplain → -
dyck-transformer
[ACL 2021] Self-Attention Networks Can Process Bounded Hierarchical Languages
Python ★ 13 5y agoExplain → -
align-mlm
No description.
Python ★ 13 3y agoExplain → -
MultilingualAnalysis
Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"
Python ★ 13 4y agoExplain → -
SocraticAI ⑂
Problem solving by engaging multiple AI agents in conversation with each other and the user.
★ 12 2y agoExplain → -
impersona
No description.
Python ★ 12 9mo agoExplain → -
continual-factoid-memorization
Continual Memorization of Factoids in Large Language Models
Python ★ 12 1y agoExplain → -
il-scaling-in-games
Official code repo of "Scaling Laws for Imitation Learning in Single-Agent Games"
Python ★ 11 1y agoExplain → -
blindfold-textgame
[NAACL 2021] Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents
Python ★ 11 5y agoExplain → -
metric-wsd
NAACL'2021: Non-Parametric Few-Shot Learning for Word Sense Disambiguation
Python ★ 10 5y agoExplain → -
semsup-xc
SemSup-XC: Semantic Supervision for Extreme Classification
Jupyter Notebook ★ 10 3y agoExplain → -
Heuristic-Core
[ACL 2024] The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models - https://arxiv.org/abs/2403.03942
Python ★ 9 2y agoExplain → -
NegotiationToM
Code release for Improving Dialog Systems for Negotiation with Personality Modeling.
Python ★ 8 4y agoExplain → -
CARETS
No description.
Python ★ 7 4y agoExplain → -
AttentionRules
Extracting rule-based descriptions of attention rules
Python ★ 6 11mo agoExplain → -
attribute-tagging
[LaReL 2022] Towards an Enhanced, Faithful, and Adaptable Web Interaction Environment
Python ★ 5 3y agoExplain → -
SPARTAN
SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers
Python ★ 5 3y agoExplain → -
MoQA
No description.
Python ★ 3 2y agoExplain → -
lmql ⑂
A query language for programming (large) language models.
★ 2 3y agoExplain → -
cocoa ⑂
Framework for learning dialogue agents in a two-player game setting.
Python ★ 2 5y agoExplain → -
decision-transformer ⑂
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
★ 2 4y agoExplain → -
gym-minigrid ⑂
Minimalistic gridworld package for OpenAI Gym
Python ★ 1 5y agoExplain → -
MC-LAVE-RL ⑂
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
Python ★ 1 4y agoExplain → -
DeepSpeed ⑂
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python ★ 0 5y agoExplain → -
RTFM ⑂
Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".
★ 0 4y agoExplain → -
semsup_vae ⑂
Semantic Supervision: Enabling Generalization over Output Spaces
Python ★ 0 3y agoExplain → -
SGI ⑂
No description.
★ 0 5y agoExplain → -
alfred ⑂
ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
★ 0 5y agoExplain → -
TabularSemanticParsing ⑂
Translating natural language questions to a structured query language
Jupyter Notebook ★ 0 5y agoExplain →
No repos match these filters.