Members
-
UltraChat ★ PINNED
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Python ★ 2.9k 2y agoExplain → -
OpenDelta ★ PINNED
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Python ★ 1.0k 1y agoExplain → -
OpenPrompt ★ PINNED
An Open-Source Framework for Prompt-Learning.
Python ★ 4.9k 1y agoExplain → -
OpenNRE ★ PINNED
An Open-Source Package for Neural Relation Extraction (NRE)
Python ★ 4.5k 2y agoExplain → -
OpenKE ★ PINNED
An Open-Source Package for Knowledge Embedding (KE)
Python ★ 4.0k 2y agoExplain → -
OpenNE ★ PINNED
An Open-Source Package for Network Embedding (NE)
Python ★ 1.7k 2y agoExplain → -
GNNPapers
Must-read papers on graph neural networks (GNN)
★ 17k 2y agoExplain → -
WantWords
An open-source online reverse dictionary.
JavaScript ★ 7.1k 4y agoExplain → -
PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
★ 4.3k 2y agoExplain → -
PLMpapers
Must-read Papers on pre-trained language models.
★ 3.4k 3y agoExplain → -
NRLPapers
Must-read papers on network representation learning (NRL) / network embedding (NE)
TeX ★ 2.5k 5y agoExplain → -
THULAC-Python
An Efficient Lexical Analyzer for Chinese
Python ★ 2.1k 4y agoExplain → -
TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
Python ★ 1.6k 1y agoExplain → -
KRLPapers
Must-read papers on knowledge representation learning (KRL) / knowledge embedding (KE)
TeX ★ 1.5k 4y agoExplain → -
KB2E
Knowledge Graph Embeddings including TransE, TransH, TransR and PTransE
C++ ★ 1.4k 3y agoExplain → -
ERNIE
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"
Python ★ 1.4k 2y agoExplain → -
THUOCL
THUOCL(THU Open Chinese Lexicon)中文词库
★ 1.1k 3y agoExplain → -
NREPapers
Must-read papers on neural relation extraction (NRE)
TeX ★ 1.0k 5y agoExplain → -
OpenCLaP
Open Chinese Language Pre-trained Model Zoo
★ 985 6y agoExplain → -
ToolLearningPapers
No description.
★ 922 1y agoExplain → -
WebCPM
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
HTML ★ 911 2y agoExplain → -
RCPapers
Must-read papers on Machine Reading Comprehension
★ 889 6y agoExplain → -
LLMxMapReduce
No description.
Python ★ 874 3mo agoExplain → -
THULAC
An Efficient Lexical Analyzer for Chinese
C++ ★ 832 3y agoExplain → -
NRE
Neural Relation Extraction, including CNN, PCNN, CNN+ATT, PCNN+ATT
C++ ★ 809 5y agoExplain → -
Chinese_Rumor_Dataset
中文谣言数据
★ 782 6y agoExplain → -
OpenAttack
An Open-Source Package for Textual Adversarial Attack.
Python ★ 779 2y agoExplain → -
FewRel
A Large-Scale Few-Shot Relation Extraction Dataset
Python ★ 746 4y agoExplain → -
OPD
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
Python ★ 689 21d agoExplain → -
DocRED
Dataset and codes for ACL 2019 DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Python ★ 650 5y agoExplain → -
OpenHowNet
Core Data of HowNet and OpenHowNet Python API
Python ★ 638 4y agoExplain → -
ProactiveAgent
A LLM-based Agent that predict its tasks proactively.
Python ★ 612 1mo agoExplain → -
TensorFlow-TransX
An implementation of TransE and its extended models for Knowledge Representation Learning on TensorFlow
Python ★ 513 3y agoExplain → -
CAIL
Chinese AI & Law Challenge
★ 510 7y agoExplain → -
LegalPapers
Must-read Papers on Legal Intelligence
★ 498 5y agoExplain → -
BERT-KPE
No description.
Python ★ 447 3y agoExplain → -
OpenMatch ▣
An Open-Source Package for Information Retrieval.
Python ★ 442 3y agoExplain → -
LLaVA-UHD
LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs
Python ★ 424 6mo agoExplain → -
Fast-TransX
An Efficient implementation of TransE and its extended models for Knowledge Representation Learning
C++ ★ 405 3y agoExplain → -
InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
Python ★ 404 2y agoExplain → -
Few-NERD
Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"
Python ★ 400 2y agoExplain → -
TensorFlow-Summarization
No description.
Python ★ 386 8y agoExplain → -
BMCourse
The repo for Tsinghua summer course: Interdisciplinary Seminar on Big Models
Python ★ 371 3y agoExplain → -
LEGENT
Open Platform for Embodied Agents
Python ★ 341 1y agoExplain → -
THULAC-Java
An Efficient Lexical Analyzer for Chinese
Java ★ 339 8y agoExplain → -
ChatEval
Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"
Python ★ 337 1y agoExplain → -
SOS4NLP
Survey of Surveys for Natural Language Processing (SOS4NLP)
★ 327 5y agoExplain → -
NSC
Neural Sentiment Classification
Python ★ 287 8y agoExplain → -
DeltaPapers
Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.
★ 283 3y agoExplain → -
JustRL
[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe
Python ★ 280 2mo agoExplain → -
Chinese_NRE
Source code for ACL 2019 paper "Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge"
Python ★ 274 6y agoExplain → -
PL-Marker
Source code for "Packed Levitated Marker for Entity and Relation Extraction"
Python ★ 272 3y agoExplain → -
THUCTC
An Efficient Chinese Text Classifier
Java ★ 214 7y agoExplain → -
OpenBackdoor
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
Python ★ 209 3y agoExplain → -
KnowledgeablePromptTuning
kpt code
Python ★ 209 3y agoExplain → -
OpenQA
The source code of ACL 2018 paper "Denoising Distantly Supervised Open-Domain Question Answering".
Python ★ 205 7y agoExplain → -
SCPapers
Must-read Papers on Sememe Computation
★ 201 3y agoExplain → -
SE-WRL
Improved Word Representation Learning with Sememes
C ★ 196 7y agoExplain → -
LegalPLMs
Source code and checkpoints for legal pre-trained language models.
Python ★ 194 5y agoExplain → -
HATT-Proto
Code and dataset of AAAI2019 paper Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification
Python ★ 190 7y agoExplain → -
JointNRE
Joint Neural Relation Extraction with Text and KGs
Python ★ 186 3y agoExplain → -
KernelGAT
The source codes for Fine-grained Fact Verification with Kernel Graph Attention Network.
Python ★ 182 3y agoExplain → -
NLP-THU
NLP Course Material & QA
★ 175 4y agoExplain → -
Auto_CLIWC
Code for Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention (AAAI18)
Python ★ 168 8y agoExplain → -
OOP-THU
OOP Course Material & QA
★ 165 6y agoExplain → -
PTR
Prompt Tuning with Rules
Python ★ 162 3y agoExplain → -
MoEfication
No description.
Python ★ 145 1y agoExplain → -
TritonBench
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
Python ★ 134 1y agoExplain → -
DeepNote
No description.
Python ★ 134 1y agoExplain → -
attribute_charge
The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".
Python ★ 132 7y agoExplain → -
THUCKE
THU Chinese Keyphrase Extraction Toolkit
C++ ★ 124 8y agoExplain → -
LEVEN
Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"
Python ★ 123 2y agoExplain → -
ConceptFlow
No description.
Python ★ 120 3y agoExplain → -
Ouroboros
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
Python ★ 117 1y agoExplain → -
MatPlotAgent
No description.
Python ★ 116 1y agoExplain → -
CAIL2018
No description.
Python ★ 115 8y agoExplain → -
AGE
Source code and dataset for KDD 2020 paper "Adaptive Graph Encoder for Attributed Graph Embedding"
Python ★ 114 3y agoExplain → -
MultiRD
Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"
Python ★ 110 5y agoExplain → -
RE-Context-or-Names
Bert-based models(BERT, MTB, CP) for relation extraction.
Python ★ 103 4y agoExplain → -
TransNet
Source code and datasets of IJCAI2017 paper "TransNet: Translation-Based Network Representation Learning for Social Relation Extraction".
Jupyter Notebook ★ 101 8y agoExplain → -
GEAR
Source code for ACL 2019 paper "GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification"
Python ★ 100 1y agoExplain → -
TopJudge
No description.
Python ★ 100 7y agoExplain → -
Prompt-Transferability ▣
On Transferability of Prompt Tuning for Natural Language Processing
Python ★ 99 2y agoExplain → -
HNRE
Hierarchical Neural Relation Extraction
Python ★ 93 5y agoExplain → -
KV-PLM
Source code for "A Deep-learning System Bridging Molecule Structure and Biomedical Text with Comprehension Comparable to Human Professionals"
Python ★ 89 2y agoExplain → -
Migician
[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
Python ★ 89 1y agoExplain → -
XQA
Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"
Python ★ 89 4y agoExplain → -
SememePSO-Attack
Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"
Python ★ 88 5y agoExplain → -
HMEAE
Source code for EMNLP-IJCNLP 2019 paper "HMEAE: Hierarchical Modular Event Argument Extraction".
Python ★ 87 4y agoExplain → -
DebugBench
The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".
Python ★ 86 1y agoExplain → -
ERICA ▣
Source code for ACL 2021 paper "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning"
Python ★ 85 5y agoExplain → -
TKRL
Representation Learning of Knowledge Graphs with Hierarchical Types (IJCAI-2016)
C++ ★ 82 7y agoExplain → -
ChartCoder
[ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
Python ★ 79 6mo agoExplain → -
CLAIM
No description.
★ 79 6y agoExplain → -
Advbench
Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP".
Python ★ 78 3y agoExplain → -
NeuIRPapers
Must-read Papers on Neural Information Retrieval
★ 74 6y agoExplain → -
TLNN
Source code for EMNLP-IJCNLP 2019 paper "Event Detection with Trigger-Aware Lattice Neural Network".
Python ★ 74 6y agoExplain → -
MMDW
Max-margin DeepWalk
Java ★ 73 9y agoExplain → -
Optima
Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"
Python ★ 72 1y agoExplain → -
SelectiveMasking
Source code for "Train No Evil: Selective Masking for Task-Guided Pre-Training"
Python ★ 70 3y agoExplain → -
KARL
KARL: Knowledge-Aware Reasoning and Reinforcement Learning for Knowledge-Intensive Visual Grounding
Python ★ 68 2mo agoExplain → -
ConversationQueryRewriter
Code and Data for SIGIR 2020 Paper "Few-Shot Generative Conversational Query Rewriting"
Roff ★ 68 3y agoExplain → -
CorefBERT
Source code for EMNLP 2020 paper "Coreferential Reasoning Learning for Language Representation"
Python ★ 67 3y agoExplain → -
H-Neurons
The official implementation of the paper: H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs
Python ★ 66 5mo agoExplain → -
Muffin
No description.
Python ★ 65 2y agoExplain → -
jec-qa
The respository of jec-qa.
Python ★ 63 6y agoExplain → -
Knowledge-Plugin
[ACL 2023] Plug-and-Play Knowledge Injection for Pre-trained Language Models
Python ★ 61 2y agoExplain → -
Adaptive-Note
No description.
Python ★ 60 1y agoExplain → -
Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
Python ★ 59 1y agoExplain → -
MuGNN
Source code for ACL2019 paper "Multi-Channel Graph Neural Network for Entity Alignment".
Python ★ 59 5y agoExplain → -
EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
Python ★ 58 1y agoExplain → -
DIAG-NRE
Source code for ACL 2019 paper "DIAG-NRE: A Neural Pattern Diagnosis Framework for Distantly Supervised Neural Relation Extraction".
Python ★ 56 7y agoExplain → -
FR-Spec
[ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling
C++ ★ 55 11mo agoExplain → -
paragraph2vec
Paragraph Vector Implementation
Python ★ 55 9y agoExplain → -
PEVL
Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”
Python ★ 49 3y agoExplain → -
SE-WRL-SAT
Revised Version of SAT Model in "Improved Word Representation Learning with Sememes"
C ★ 49 5y agoExplain → -
TADW
Network Representation Learning with Rich Text Information (IJCAI 2015)
Matlab ★ 48 9y agoExplain → -
TR-BERT
Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"
Python ★ 48 4y agoExplain → -
duplex-model
No description.
TypeScript ★ 46 1y agoExplain → -
StyleAttack
Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"
Python ★ 46 3y agoExplain → -
HiddenKiller
Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"
Python ★ 45 3y agoExplain → -
SubCharTokenization
No description.
Python ★ 45 3y agoExplain → -
MNRE
The code and data for ACL2017 paper "Neural Relation Extraction with Multi-lingual Attention"
C++ ★ 45 9y agoExplain → -
IKRL
Image-embodied Knowledge Representation Learning (IJCAI-2017)
C++ ★ 44 4y agoExplain → -
ConvDR
Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
Python ★ 43 4y agoExplain → -
THULAC.so
An Efficient Lexical Analyzer for Chinese
C++ ★ 43 6y agoExplain → -
CKRL
Does William Shakespeare REALLY Write Hamlet? Knowledge Representation Learning with Confidence (AAAI-2018)
C++ ★ 43 7y agoExplain → -
VERNet
Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
Python ★ 42 5y agoExplain → -
relation-similarity
codes accompanying ACL 2019 paper Quantifying the Similarity between Relations with Fact Distributions
Python ★ 42 1y agoExplain → -
ACDiT
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
Python ★ 42 4mo agoExplain → -
EmbodiedAIxLLMPapers
Papers on integrating large language models with embodied AI
★ 39 2y agoExplain → -
Seq1F1B ⑂
Sequence-level 1F1B schedule for LLMs.
Python ★ 37 9mo agoExplain → -
APB
Official Implementation of APB (ACL 2025 main Oral) and Spava (ACL 2026 main).
C++ ★ 37 2mo agoExplain → -
Knowledge-Inheritance ▣
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
Python ★ 37 4y agoExplain → -
ONION
Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"
Python ★ 37 4y agoExplain → -
iAgents
No description.
Python ★ 37 1y agoExplain → -
hybrid-linear-attention
Code and models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts
Python ★ 36 2mo agoExplain → -
COS960
COS960: A Chinese Word Similarity Dataset of 960 Word Pairs
Python ★ 36 7y agoExplain → -
DSDocRE
Source code for EMNLP 2020 paper "Denoising Relation Extraction from Document-level Distant Supervision"
Python ★ 34 4y agoExplain → -
Sememe-SC
Source code and data for ACL 2019 paper "Modeling Semantic Compositionality with Sememe Knowledge"
Python ★ 34 5y agoExplain → -
CodRED
No description.
Python ★ 32 4y agoExplain → -
SparsingLaw
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
Python ★ 32 1y agoExplain → -
COVID19-Social-Datasets
Datasets of NCP, contaning news, rumors and legal documents.
★ 32 5y agoExplain → -
CED
source code for TKDE paper “CED: Credible Early Detection of Social Media Rumors”
Python ★ 32 5y agoExplain → -
explore-and-evaluate
Code for EMNLP2020 paper "Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment".
Python ★ 31 4y agoExplain → -
CokeBERT
CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models
Python ★ 30 3y agoExplain → -
SE-Bench
Official repo for "SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization"
Python ★ 28 2mo agoExplain → -
Model_Emotion
Neuron Activation
Python ★ 27 1y agoExplain → -
DeepTHULAC
A High-Performance Lexical Analyzer for Chinese
Python ★ 27 2y agoExplain → -
Modularity-Analysis
[ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers
Python ★ 26 3y agoExplain → -
LoRAFlow
ACL 2024: LoRA-Flow Dynamic LoRA Fusion for Large Language Models in Generative Tasks
Python ★ 25 1y agoExplain → -
Document-Plugin
Plug-and-Play Document Modules for Pre-trained Models
Python ★ 25 3y agoExplain → -
SDLM-pytorch
Code accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts
Python ★ 25 7y agoExplain → -
VisualDS
No description.
Python ★ 24 4y agoExplain → -
Character-enhanced-Sememe-Prediction
Code accompanying Incorporating Chinese Characters of Words for Lexical Sememe Prediction (ACL2018) https://arxiv.org/abs/1806.06349
Python ★ 24 7y agoExplain → -
KG-Infused-RAG
Official implementation for the paper "KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs"
Python ★ 23 5mo agoExplain → -
SchemaReinforcementLearning
Learning to Generate STRUCTURED Output with Schema Reinforcement Learning
Python ★ 23 1y agoExplain → -
AutoForm
Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"
Python ★ 23 2y agoExplain → -
FalseQA
Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"
Python ★ 22 3y agoExplain → -
OHRE
Source code of paper 'Open Hierarchical Relation Extraction' (NAACL 2021)
Python ★ 22 4y agoExplain → -
BabelNet-Sememe-Prediction
Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"
Python ★ 20 5y agoExplain → -
BlockFFN
Source codes for paper "BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity".
Python ★ 19 5mo agoExplain → -
CLSP
Code and data for EMNLP 2018 paper "Cross-lingual Lexical Sememe Prediction"
C ★ 19 7y agoExplain → -
LME
Neural Entity Typing with Language Model Enhancement
Python ★ 18 7y agoExplain → -
NOSA
The official implementation of NOSA
Python ★ 17 10d agoExplain → -
DANCE
No description.
Python ★ 16 4y agoExplain → -
BkdAtk-LWS
Code and data of the ACL 2021 paper "Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution"
Python ★ 16 5y agoExplain → -
LEAD
Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)
Python ★ 16 1y agoExplain → -
EREN
Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1
Python ★ 14 2y agoExplain → -
SememeWSD
Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"
Python ★ 14 5y agoExplain → -
thunlp.github.io
No description.
HTML ★ 13 3y agoExplain → -
hyperbolic_llm
No description.
Python ★ 13 2y agoExplain → -
LLM-generated-text-detection
No description.
Python ★ 13 2y agoExplain → -
Tell_Me_More
Repo for ACL 2024 paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"
Python ★ 13 2y agoExplain → -
ClueAnchor
[EMNLP 2025 Findings] ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation
Python ★ 12 1y agoExplain → -
Chujian
A large-scale dataset of Chu bamboo slip scripts and a multi-granularity tokenizer for ancient Chinese scripts
Python ★ 12 1y agoExplain → -
THUCBERT
A Chinese Character BERT Trained with Multi-Level Masking
★ 12 2y agoExplain → -
KACC
KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion
★ 12 4y agoExplain → -
CSS-LM
CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models
Python ★ 12 3y agoExplain → -
oknlp
Open and Knowledgeable NLP Toolkit including CWS, POS Tagging, NER, and Entity Typing
C++ ★ 10 4y agoExplain → -
BurstEngine
BurstEngine is an efficient framework designed to train LLMs on long-sequence data.
Python ★ 9 8mo agoExplain → -
AutoJudge
No description.
Python ★ 9 5y agoExplain → -
CA-LoRA
CA-LoRA: Adapting Existing LoRA for Compressed LLMs to Enable Efficient Multi-Tasking on Personal Devices (COLM 2024)
Python ★ 9 1y agoExplain → -
ChemTrans
No description.
Python ★ 9 2y agoExplain → -
SMP
Single-Shot Meta-Pruning (SMP) for attention heads of Transformers
Python ★ 8 5y agoExplain → -
AgentRM
[ACL 2025 main] AgentRM: Enhancing Agent Generalization with Reward Modeling
Python ★ 6 8mo agoExplain → -
SememeRNN
Code and data of the TASLP paper "Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes"
Python ★ 6 5y agoExplain → -
SIR-Bench
No description.
Python ★ 5 9mo agoExplain → -
FFD
Source code for NAACL 2019 paper "Fact Discovery from Knowledge Base via Facet Decomposition".
Python ★ 5 7y agoExplain → -
LexChain
No description.
Python ★ 4 2mo agoExplain → -
ToLeaP
No description.
Python ★ 4 1y agoExplain → -
cost-optimal-gqa
The code for the paper "Cost-Optimal Grouped-Query Attention for Long-Context Modeling"
Python ★ 4 9mo agoExplain → -
StateX
The official implementation of the paper "StateX: Enhancing RNN Recall via Post-training State Expansion".
Python ★ 3 7mo agoExplain → -
PretrainingRecommender
Source code for Knowledge Transfer via Pre-training for Recommendation: A Review and Prospect
Python ★ 3 5y agoExplain → -
rethinking-hybrid-attention
Rethinking the Role of Efficient Attention in Hybrid Architectures
Python ★ 2 4d agoExplain → -
DECO
Source code for paper "DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices".
Python ★ 2 1mo agoExplain → -
DIET
Official code for "The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training"
Python ★ 2 1y agoExplain → -
CPMobius
No description.
Python ★ 1 1mo agoExplain → -
LexRel
No description.
Python ★ 1 1mo agoExplain → -
stuffed-mamba
The code of the paper Stuffed Mamba: Oversized States Lead to the Inability to Forget
Python ★ 1 8mo agoExplain →
No repos match these filters.