-
Beijing Institute of Technology
- Beijing, China
Stars
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
🩹Editing large language models within 10 seconds⚡
A generative AI extension for JupyterLab
A high-throughput and memory-efficient inference and serving engine for LLMs
A 13B large language model developed by Baichuan Intelligent Technology
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Example models using DeepSpeed
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Train transformer language models with reinforcement learning.
An Open-Source Framework for Prompt-Learning.
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
Conversational RPA SDK for Chatbot Makers. Join our Discord: https://discord.gg/7q8NBZbQzt
Code for COLING 2020 paper "Improving Document-level Sentiment Analysis with User and Product Context"
Alibaba Java Coding Guidelines pmd implements and IDE plugin
2023-2025中国节假日、调休、补班日历,ICS格式,可供IPhone、Google Calendar、Outlook等客户端订阅,包含节假日API
Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
A Multi-modal Model Chinese Spell Checker Released on ACL2021.