Skip to content
View duj12's full-sized avatar

Block or report duj12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
duj12/README.md
  • 👋 Hi, I’m @duj12. I graduated from Tsinghua University, Department of Engineering Physics.
  • 👀 I’m interested in Speech and Spoken Language Processing and Understanding, and Voice Generation.
  • 🌱 I mainly focus on Automatic Speech Recognition, Voice Activity Detection, Key Word Spotting, Language Modeling and related fields.
  • ⏳ Now I'm working on Text to Speech, Zero-Shot Speech Synthesis, and Voice Cloning.
  • 💞️ Hoping to communicate with you in the fields of deep learning, generative artificial intelligence, and large language models, and so on.
  • 📫 How to reach me: [email protected].

Pinned Loading

  1. duj12 duj12 Public

    Config files for my GitHub profile.

  2. cnn-lstm-based-malware-document-classification cnn-lstm-based-malware-document-classification Public

    use cnn/lstm and ensembling model to classify different documents, according to the api sequences each document calls.

    Python 12 3

  3. ss-vad ss-vad Public

    self-supervised vad

    Python 7 2

  4. wekws wekws Public

    Forked from wenet-e2e/wekws

    Production First and Production Ready End-to-End Keyword Spotting Toolkit

    Python 7 2

  5. ASR-2Pass ASR-2Pass Public

    ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).

    HTML 52 7

  6. kws_demo kws_demo Public

    KWS demo based on CTC prefix beam search.

    Python 12 2