-
Notifications
You must be signed in to change notification settings - Fork 217
Pull requests: pytorch/ao
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Experimental] Enable kleidi AI examples to run on graviton3
#1721
opened Feb 17, 2025 by
akote123
Loading…
CPUOffload: only offload parameters above a certain size
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1720
opened Feb 16, 2025 by
ngc92
Loading…
Make TorchAO cpp/Python extension
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
topic: not user facing
Use this tag if you don't want this PR to show up in release notes
#1719
opened Feb 14, 2025 by
drisspg
Loading…
fix tensor parallelism for float8 training with rowwise scaling
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: improvement
Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1718
opened Feb 14, 2025 by
vkuzo
Loading…
Make FakeQuantizer expose useful config details
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: improvement
Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1717
opened Feb 14, 2025 by
andrewor14
Loading…
Update version.txt to 0.10.0
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: not user facing
Use this tag if you don't want this PR to show up in release notes
#1714
opened Feb 14, 2025 by
HDCharles
Loading…
SAM2: Use torch.export for VOS
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1708
opened Feb 13, 2025 by
cpuhrsch
Loading…
Outline Benchmark Quant APIs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: for developers
Use this tag if this PR is mainly developer facing
topic: new feature
Use this tag if this PR adds a new feature
#1706
opened Feb 12, 2025 by
jainapurva
•
Draft
[ROCm] preshuffled weight mm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: rocm
fix overflow fp16 act woq int8
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1698
opened Feb 11, 2025 by
leslie-fang-intel
Loading…
Fix float8nocompile CI workflow
ci
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: not user facing
Use this tag if you don't want this PR to show up in release notes
#1695
opened Feb 10, 2025 by
danielvegamyhre
Loading…
Support MXFP6 packing and fused unpack-dequantise kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1687
opened Feb 10, 2025 by
alex-titterton
Loading…
Fix This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: bug fix
Use this tag for PRs that fix bugs
DDP
with nf4
CLA Signed
#1684
opened Feb 9, 2025 by
jeromeku
Loading…
ROCm OCP FP8 Support
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
float8
module: rocm
topic: improvement
Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1677
opened Feb 6, 2025 by
petrex
Loading…
Add CUTLASS-based row-wise scaled sparse FP8 kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
float8
sparsity
topic: new feature
Use this tag if this PR adds a new feature
#1671
opened Feb 5, 2025 by
alexsamardzic
•
Draft
Feat/blockwise fp8 quant
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1668
opened Feb 5, 2025 by
Degnel
Loading…
Tensor parallel support for fpx dtype
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1660
opened Feb 4, 2025 by
jainapurva
•
Draft
Attempt to switch everything to cmake
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1659
opened Feb 4, 2025 by
drisspg
Loading…
Tensor parallel support for uintx dtype
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: improvement
Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1656
opened Feb 3, 2025 by
jainapurva
•
Draft
draft ukernel selection logic
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[Fix]: Fallback to KleidiAI channelwise kernel groupsize isnt suitable
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1647
opened Jan 31, 2025 by
nikhil-arm
Loading…
Tests jeanschmidt/NVIDIA_IMEX_CHANNELS changes
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1644
opened Jan 30, 2025 by
jeanschmidt
Loading…
Create separate float8 tensor subclass
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1636
opened Jan 29, 2025 by
jainapurva
•
Draft
[not for land] hook up MX to CUDA 12.8 cuBLAS MX gemm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
topic: not user facing
Use this tag if you don't want this PR to show up in release notes
#1625
opened Jan 27, 2025 by
vkuzo
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.