Skip to content

mindspore-lab/mindnlp

Repository files navigation

MindNLP

docs GitHub PRs Welcome open issues ci

Table of Contents

News 📢

  • 🔥 Latest Features

    • Fully compatible with 🤗HuggingFace, it enables seamless execution of any Transformers/Diffusers models on MindSpore across all hardware platforms (GPU/Ascend/CPU).

      You may still invoke models through MindNLP as shown in the example code below:

      from mindnlp.transformers import AutoTokenizer, AutoModel
      
      tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")
      model = AutoModel.from_pretrained("bert-base-uncased")
      
      inputs = tokenizer("Hello world!")
      outputs = model(**inputs)

      You can also directly use the native HuggingFace library(like transformers, diffusers, etc.) via the following approach as demonstrated in the example code:

      # for huggingface transformers
      import mindspore
      import mindnlp
      from transformers import pipeline
      
      chat = [
          {"role": "system", "content": "You are a sassy, wise-cracking robot as imagined by Hollywood circa 1986."},
          {"role": "user", "content": "Hey, can you tell me any fun things to do in New York?"}
      ]
      
      pipeline = pipeline(task="text-generation", model="meta-llama/Meta-Llama-3-8B-Instruct", ms_dtype=mindspore.bfloat16, device_map="auto")
      response = pipeline(chat, max_new_tokens=512)
      print(response[0]["generated_text"][-1]["content"])
      import mindspore
      import mindnlp
      from diffusers import DiffusionPipeline
      
      pipeline = DiffusionPipeline.from_pretrained("stable-diffusion-v1-5/stable-diffusion-v1-5", ms_dtype=mindspore.float16)
      pipeline("An image of a squirrel in Picasso style").images[0]

Notice ⚠️: Due to differences in autograd and parallel execution mechanisms, any training or distributed execution code must utilize the interfaces provided by MindNLP.

Installation

Install from Pypi

You can install the official version of MindNLP which is uploaded to pypi.

pip install mindnlp

Daily build

You can download MindNLP daily wheel from here.

Install from source

To install MindNLP from source, please run:

pip install git+https://github.com/mindspore-lab/mindnlp.git
# or
git clone https://github.com/mindspore-lab/mindnlp.git
cd mindnlp
bash scripts/build_and_reinstall.sh

Version Compatibility

MindNLP version MindSpore version Supported Python version
master daily build >=3.7.5, <=3.9
0.1.1 >=1.8.1, <=2.0.0 >=3.7.5, <=3.9
0.2.x >=2.1.0 >=3.8, <=3.9
0.3.x >=2.1.0, <=2.3.1 >=3.8, <=3.9
0.4.x >=2.2.x, <=2.5.0 >=3.9, <=3.11
0.5.x >=2.5.0 >=3.10, <=3.11

Introduction

MindNLP is an open source NLP library based on MindSpore. It supports a platform for solving natural language processing tasks, containing many common approaches in NLP. It can help researchers and developers to construct and train models more conveniently and rapidly.

The master branch works with MindSpore master.

Major Features

  • Comprehensive data processing: Several classical NLP datasets are packaged into friendly module for easy use, such as Multi30k, SQuAD, CoNLL, etc.
  • Friendly NLP model toolset: MindNLP provides various configurable components. It is friendly to customize models using MindNLP.
  • Easy-to-use engine: MindNLP simplified the complicated training process in MindSpore. It supports Trainer and Evaluator interfaces to train and evaluate models easily.

Supported models

Since there are too many supported models, please check here

License

This project is released under the Apache 2.0 license.

Feedbacks and Contact

The dynamic version is still under development, if you find any issue or have an idea on new features, please don't hesitate to contact us via Github Issues.

MindSpore NLP SIG

MindSpore NLP SIG (Natural Language Processing Special Interest Group) is the main development team of the MindNLP framework. It aims to collaborate with developers from both industry and academia who are interested in research, application development, and the practical implementation of natural language processing. Our goal is to create the best NLP framework based on the domestic framework MindSpore. Additionally, we regularly hold NLP technology sharing sessions and offline events. Interested developers can join our SIG group using the QR code below.

Acknowledgement

MindSpore is an open source project that welcomes any contribution and feedback.
We wish that the toolbox and benchmark could serve the growing research
community by providing a flexible as well as standardized toolkit to re-implement existing methods
and develop their own new semantic segmentation methods.

Citation

If you find this project useful in your research, please consider citing:

@misc{mindnlp2022,
    title={{MindNLP}: Easy-to-use and high-performance NLP and LLM framework based on MindSpore},
    author={MindNLP Contributors},
    howpublished = {\url{https://github.com/mindlab-ai/mindnlp}},
    year={2022}
}

About

Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 153

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy