<!-- --> <!-- --> <!-- --> Discord English | 中文 | 日本語 Introduction ModelScope is built upon the notion of “Model-as-a-Service” (MaaS). It seeks to bring together most advanced machine…

<!--  -->





<!--  -->
<!--  -->
Discord
English |
中文 |
日本語
Introduction
[ModelScope]( https://www.modelscope.cn) is built upon the notion of “Model-as-a-Service” (MaaS). It seeks to bring together most advanced machine learning models from the AI community, and streamlines the process of leveraging AI models in real-world applications. The core ModelScope library open-sourced in this repository provides the interfaces and implementations that allow developers to perform model inference, training and evaluation.
In particular, with rich layers of API-abstraction, the ModelScope library offers unified experience to explore state-of-the-art models spanning across domains such as CV, NLP, Speech, Multi-Modality, and Scientific-computation. Model contributors of different areas can integrate models into the ModelScope ecosystem through the layered-APIs, allowing easy and unified access to their models. Once integrated, model inference, fine-tuning, and evaluations can be done with only a few lines of codes. In the meantime, flexibilities are also provided so that different components in the model applications can be customized wherever necessary.
Apart from harboring implementations of a wide range of different models, ModelScope library also enables the necessary interactions with ModelScope backend services, particularly with the Model-Hub and Dataset-Hub. Such interactions facilitate management of various entities (models and datasets) to be performed seamlessly under-the-hood, including entity lookup, version control, cache management, and many others.
Models and Online Accessibility
Hundreds of models are made publicly available on [ModelScope]( https://www.modelscope.cn) (700+ and counting), covering the latest development in areas such as NLP, CV, Audio, Multi-modality, and AI for Science, etc. Many of these models represent the SOTA in their specific fields, and made their open-sourced debut on ModelScope. Users can visit ModelScope(modelscope.cn) and experience first-hand how these models perform via online experience, with just a few clicks. Immediate developer-experience is also possible through the ModelScope Notebook, which is backed by ready-to-use CPU/GPU development environment in the cloud - only one click away on ModelScope.
Some representative examples include:
LLM:
Multi-Modal:
CV:Audio:
AI for Science:
Note: Most models on ModelScope are public and can be downloaded directly from the website, please refer to instructions for model download, for downloading models with api provided by modelscope library or git.QuickTour
We provide unified interface for inference using pipeline, fine-tuning and evaluation using Trainer for different tasks.
For any given task with any type of input (image, text, audio, video...), inference pipeline can be implemented with only a few lines of code, which will automatically load the underlying model to get inference result, as is exemplified below:
python
>>> from modelscope.pipelines import pipeline
>>> word_segmentation = pipeline('word-segmentation',model='damo/nlp_structbert_word-segmentation_chinese-base')
>>> word_segmentation('今天天气不错,适合出去游玩')
{'output': '今天 天气 不错 , 适合 出去 游玩'}
Given an image, portrait matting (aka. background-removal) can be accomplished with the following code snippet:

python
>>> import cv2
>>> from modelscope.pipelines import pipeline
>>> portrait_matting = pipeline('portrait-matting')
>>> result = portrait_matting('https://modelscope.oss-cn-beijing.aliyuncs.com/test/images/image_matting.png')
>>> cv2.imwrite('result.png', result['output_img'])
The output image with the background removed is:

Fine-tuning and evaluation can also be done with a few more lines of code to set up training dataset and trainer, with the heavy-lifting work of training and evaluation a model encapsulated in the implementation of trainer.train() andtrainer.evaluate() interfaces.
For example, the gpt3 base model (1.3B) can be fine-tuned with the chinese-poetry dataset, resulting in a model that can be used for chinese-poetry generation.
python
>>> from modelscope.metainfo import Trainers
>>> from modelscope.msdatasets import MsDataset
>>> from modelscope.trainers import build_trainer
>>> train_dataset = MsDataset.load('chinese-poetry-collection', split='train'). remap_columns({'text1': 'src_txt'})
>>> eval_dataset = MsDataset.load('chinese-poetry-collection', split='test').remap_columns({'text1': 'src_txt'})
>>> max_epochs = 10
>>> tmp_dir = './gpt3_poetry'
>>> kwargs = dict(
model='damo/nlp_gpt3_text-generation_1.3B',
train_dataset=train_dataset,
eval_dataset=eval_dataset,
max_epochs=max_epochs,
work_dir=tmp_dir)
>>> trainer = build_trainer(name=Trainers.gpt3_trainer, default_args=kwargs)
>>> trainer.train()
Why should I use ModelScope library
1. A unified and concise user interface is abstracted for different tasks and different models. Model inferences and training can be implemented by as few as 3 and 10 lines of code, respectively. It is convenient for users to explore models in different fields in the ModelScope community. All models integrated into ModelScope are ready to use, which makes it easy to get started with AI, in both educational and industrial settings.
2. ModelScope offers a model-centric development and application experience. It streamlines the support for model training, inference, export and deployment, and facilitates users to build their own MLOps based on the ModelScope ecosystem.
3. For the model inference and training process, a modular design is put in place, and a wealth of functional module implementations are provided, which is convenient for users to customize their own model inference, training and other processes.
4. For distributed model training, especially for large models, it provides rich training strategy support, including data parallel, model parallel, hybrid parallel and so on.
Installation
Docker
ModelScope Library currently supports popular deep learning framework for model training and inference, including PyTorch, TensorFlow and ONNX. All releases are tested and run on Python 3.7+, Pytorch 1.8+, Tensorflow1.15 or Tensorflow2.0+.
To allow out-of-box usage for all the models on ModelScope, official docker images are provided for all releases. Based on the docker image, developers can skip all environment installation and configuration and use it directly. Currently, the latest version of the CPU image and GPU image can be obtained from:
CPU docker imageshell
# py37
registry.cn-hangzhou.aliyuncs.com/modelscope-repo/modelscope:ubuntu20.04-py37-torch1.11.0-tf1.15.5-1.6.1
# py38
registry.cn-hangzhou.aliyuncs.com/modelscope-repo/modelscope:ubuntu20.04-py38-torch2.0.1-tf2.13.0-1.9.5
GPU docker image
shell
# py37
registry.cn-hangzhou.aliyuncs.com/modelscope-repo/modelscope:ubuntu20.04-cuda11.3.0-py37-torch1.11.0-tf1.15.5-1.6.1
# py38
registry.cn-hangzhou.aliyuncs.com/modelscope-repo/modelscope:ubuntu20.04-cuda11.8.0-py38-torch2.0.1-tf2.13.0-1.9.5
Setup Local Python Environment
One can also set up local ModelScope environment using pip and conda. ModelScope supports python3.7 and above.
We suggest anaconda for creating local python environment:
shell
conda create -n modelscope python=3.8
conda activate modelscope
PyTorch or TensorFlow can be installed separately according to each model's requirements.
After installing the necessary machine-learning framework, you can install modelscope library as follows: If you only want to play around with the modelscope framework, of trying out model/dataset download, you can install the core modelscope components:
shell
pip install modelscope
If you want to use multi-modal models:
shell
pip install modelscope[multi-modal]
If you want to use nlp models:
shell
pip install modelscope[nlp] -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
If you want to use cv models:
shell
pip install modelscope[cv] -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
If you want to use audio models:
shell
pip install modelscope[audio] -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
If you want to use science models:
shell
pip install modelscope[science] -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html
Notes:
1. Currently, some audio-task models only support python3.7, tensorflow1.15.4 Linux environments. Most other models can be installed and used on Windows and Mac (x86).
shell
sudo apt-get update
sudo apt-get install libsndfile1
3. Some models in computer vision need mmcv-full, you can refer to mmcv installation guide, a minimal installation is as follows:
shell
pip uninstall mmcv # if you have installed mmcv, uninstall it
pip install -U openmim
mim install mmcv-full
Learn More
We provide additional documentations including:
- More detailed Installation Guide
- Introduction to tasks
- Use pipeline for model inference
- Finetuning example
- Preprocessing of data
- Evaluation
- Contribute your own model to ModelScope
License
This project is licensed under the [Apache License (Version 2.0)](https://git
…
Members
-
modelscope ★ PINNED
ModelScope: bring the notion of Model-as-a-Service to life.
Python ★ 9.0k 16h agoExplain → -
FunASR ★ PINNED
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
Python ★ 18k 1h agoExplain → -
DiffSynth-Studio ★ PINNED
Enjoy the magic of Diffusion models!
Python ★ 13k 3d agoExplain → -
ms-swift ★ PINNED
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).
Python ★ 15k 2d agoExplain → -
evalscope ★ PINNED
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
Python ★ 3.0k 2d agoExplain → -
ms-agent ★ PINNED
MS-Agent: a lightweight framework to empower agentic execution of complex tasks
Python ★ 4.3k 3d agoExplain → -
facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Jupyter Notebook ★ 9.5k 1y agoExplain → -
FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.
Python ★ 5.8k 1h agoExplain → -
ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Python ★ 4.3k 10mo agoExplain → -
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Python ★ 3.0k 6mo agoExplain → -
AgentEvolver
AgentEvolver: Towards Efficient Self-Evolving Agent System
Python ★ 1.5k 2mo agoExplain → -
modelscope-classroom
No description.
Jupyter Notebook ★ 1.4k 1mo agoExplain → -
sirchmunk
🐿️ Sirchmunk: Raw data to self-evolving intelligence, real-time.
Python ★ 1.1k 2d agoExplain → -
scepter
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
Python ★ 550 1y agoExplain → -
KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Python ★ 524 2y agoExplain → -
richdreamer
[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Python ★ 479 1y agoExplain → -
AdaSeq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
Python ★ 454 2y agoExplain → -
FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Python ★ 443 2y agoExplain → -
awesome-deep-reasoning
Collect every awesome work about r1!
Python ★ 432 1y agoExplain → -
DiffSynth-Engine
No description.
Python ★ 416 8d agoExplain → -
easydistill
a toolkit on knowledge distillation for large language models
Python ★ 411 3mo agoExplain → -
motionagent
MotionAgent is your AI assistent to convert ideas into motion pictures.
Python ★ 311 1y agoExplain → -
Nexus-Gen
No description.
Python ★ 294 10mo agoExplain → -
dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
C ★ 272 10mo agoExplain → -
MCPBench
The evaluation benchmark on MCP servers
Python ★ 247 9mo agoExplain → -
twinkle
Twinkle✨: Training workbench to make your model glow.
Python ★ 241 2h agoExplain → -
AgentJet
Cutting-edge platform for LLM agent tuning. Deliver RL tuning with flexibility, reliability, speed, multi-agent optimization and realtime community benchmarking.
Python ★ 217 16d agoExplain → -
kws-training-suite
No description.
Python ★ 160 3y agoExplain → -
ultron
Ultron: Collective Intelligence System — Shared Memories, Skills, and Harnesses Across Every Agent
Python ★ 156 5d agoExplain → -
modelscope-studio
A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, Monaco Editor and more advanced components to help you build applications efficiently.
Python ★ 143 1mo agoExplain → -
normal-depth-diffusion
No description.
Python ★ 132 2y agoExplain → -
Awesome-Vibe-Research
An open, collaboratively-built repository for AI-assisted scientific research — collecting and curating agents, skills, workflows, tools, and best practices across the full research lifecycle. 面向 AI 辅助科研的开放共建仓库 收集和沉淀科研全流程中的 agents、skills、workflows、tools 与最佳实践
Python ★ 129 2h agoExplain → -
lite-sora
An initiative to replicate Sora
Python ★ 104 2y agoExplain → -
mcore-bridge
MCore-Bridge: Providing Megatron-Core model definitions for state-of-the-art large models and making Megatron training as simple as Transformers — with support for 300+ large language models (Qwen3-Next, GLM-5.1, Deepseek-V4, MiniMax-2.7, ...) and 200+ multimodal large models (Qwen3.5, Qwen3-Omni, Gemma4, ...).
Python ★ 77 2d agoExplain → -
flowra
No description.
Python ★ 72 6mo agoExplain → -
ms-enclave
A modular and stable agent sandbox runtime environment.
Python ★ 54 26d agoExplain → -
ImagePulse
Open Image Curation Tools
Python ★ 47 1y agoExplain → -
PromptScope
Enjoy easier conversations with LLM
Python ★ 46 1y agoExplain → -
mcp-central
Collection of model-centric MCP servers
Python ★ 26 1y agoExplain → -
Katz
[ATC'25] Katz is a high-performance serving system designed specifically for diffusion model workflows with multiple adapters.
Python ★ 23 1y agoExplain → -
modelscope-mcp-server
ModelScope's official MCP Server (in active development).
Python ★ 21 6mo agoExplain → -
sirchmunk-web
The source code for: https://modelscope.github.io/sirchmunk-web/
HTML ★ 17 2d agoExplain → -
r-chain
No description.
Python ★ 11 1y agoExplain → -
Trinity-Studio
No description.
JavaScript ★ 8 1y agoExplain → -
modelscope_hub
The official Python client to connect with ModelScope Hub.
Python ★ 6 3d agoExplain → -
langchain-modelscope
Langchain integration for ModelScope
Python ★ 5 9mo agoExplain → -
comfyscope
Collection of various Comfy components.
Python ★ 4 1y agoExplain → -
twinkle-web
The github website for twinkle-kit
CSS ★ 2 5d agoExplain → -
modelscope-skills
Equip your agents with skills to interact with ModelScope
★ 1 1mo agoExplain → -
diffusion-templates-web
No description.
HTML ★ 0 1mo agoExplain →
No repos match these filters.