-
Notifications
You must be signed in to change notification settings - Fork 437
Issues: openxla/xla
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[ROCm] failed to legalize operation 'math.exp' for exponential op with bf16 dtype
#19700
opened Nov 22, 2024 by
hugomano
[GPU bug] Memcpy local p2p leads to numerical issues (mem access problems).
#19555
opened Nov 20, 2024 by
yliu120
stablehlo.cbrt
crashes with complex numbers when StableHLO spec supports it
#19482
opened Nov 19, 2024 by
mofeing
The tiles of some instructions in Triton fusion are not powers of 2. How did this issue arise?
#19478
opened Nov 19, 2024 by
yulong-chen551
is set access to the previous GPUs at cudamallocasync_allocator.cc cause allocateRaw slower?
#19431
opened Nov 18, 2024 by
zjjott
[cpu] LLVM error during compilation of bf16 convert on aarch64
CPU
Related to XLA on CPU
#19105
opened Nov 6, 2024 by
hawkinsp
XLA:CPU performance regression with the min alignment changed from 16 to 64
#18611
opened Oct 22, 2024 by
snadampal
jax_cpu_enable_async_dispatch is degrading the inference performance on x86 and arm64 cpu backend
#18608
opened Oct 22, 2024 by
snadampal
Unexpected slow-down with
@jit
on simple functions that only use element-wise operations and jnp.roll()
on CPUs
#18478
opened Oct 18, 2024 by
pmocz
Unexpected speedup from wrapping function call in trivial jax.lax.cond statement
#18440
opened Oct 17, 2024 by
cgiovanetti
Deserialization of executables fails on non-zero ranks when deserializing single-device executable
#18286
opened Oct 14, 2024 by
jaro-sevcik
error: call of overloaded 'TileAssignment(<brace-enclosed initializer list>)' is ambiguous with gcc 10
#18140
opened Oct 10, 2024 by
elistevens
Inadequate memory consumption when using HSDP without gradient accumulation
#18090
opened Oct 9, 2024 by
qGentry
AVX512 quantization (cast from float to uint8) returns wrong results
#17800
opened Oct 1, 2024 by
Flamefire
[XLA:CPU] Support limiting LLVM codegen in Aarch64 and other new x86 instructions
CPU
Related to XLA on CPU
enhancement
New feature or request
#17758
opened Sep 30, 2024 by
penpornk
ElementalIrEmitterExecutionTest.ConvertFloatsToF8E4FN failed -0.0586 vs -0.0625
#17324
opened Sep 18, 2024 by
apivovarov
ElementalIrEmitterExecutionTest.IotaF8E4M3FN - Invalid LLVM IR
#17323
opened Sep 18, 2024 by
apivovarov
Previous Next
ProTip!
Updated in the last three days: updated:>2024-11-20.