Skip to content
View jason-huang03's full-sized avatar
  • Tsinghua University, NVIDIA
  • Beijing, China

Highlights

  • Pro

Organizations

@thu-nics @thu-ml

Block or report jason-huang03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. thu-ml/SageAttention Public

    Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

    Cuda 959 59

  2. SPH_Project Public

    SPH Realization of Fluid Simulation. Featuring Large Scale Simulation, Rigid-Fluid Coupling and High Viscosity Fluid.

    Python 156 11

  3. mit-han-lab/llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 2.7k 230

  4. thu-nics/MoA Public

    The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

    Python 114 6

  5. mit-han-lab/qserve Public

    QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

    Python 496 31

112 contributions in the last year

Contribution Graph
Day of Week February March April May June July August September October November December January February
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Contribution activity

February 2025

Created 3 commits in 2 repositories
Opened 1 pull request in 1 repository
thu-ml/SageAttention 1 merged
Loading

Seeing something unexpected? Take a look at the GitHub profile guide.