- 👋 Hi, I’m @yiakwy-xpu-ml-framework-team
- 👀 I’m interested in accelerating the word through algorithms, chips and intelligence. (compiler/transpiler, c++ ops development/optimization for critical path of overall performance and python bindings for HPC application.)
- 🌱 I’m currently working on core framework infrastracture and AI compilier technologies.
- 📫 Please drop me a message through [email protected]
-
Graphcore
- Bristol
-
18:48
(UTC -12:00)
Popular repositories Loading
-
NV_grouped_gemm
NV_grouped_gemm PublicForked from fanshiqing/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM for MoE.
Cuda 4
-
Tooklkit-remote-pdb-for-pytorch-distributed
Tooklkit-remote-pdb-for-pytorch-distributed PublicDebugging torch distributed program
Python 3
-
GC-OXFORD-CVPR2021-gbp-poplar
GC-OXFORD-CVPR2021-gbp-poplar PublicForked from joeaortiz/gbp-poplar
Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)
C++ 2
-
NV-DOCA-code-examples
NV-DOCA-code-examples PublicForked from openhackathons-org/NVIDIA-DOCA-App-Code-Sharing
DOCA Application code sharing Contest
-
-
llama.cpp
llama.cpp PublicForked from ggerganov/llama.cpp
Port of Facebook's LLaMA model in C/C++
C 1
If the problem persists, check the GitHub status page or contact support.