v0.4.0
Metal v0.4.0
Closed issues:
- Restore mtlcall (#17)
- mapreduce has poor performance (#87)
- Native code reflection (#95)
- rand! with Bools sometimes fails in tests in 1.9 (#141)
- LLVM assertion failures (#153)
- Time macro similar to CUDA.@time (#160)
- bug in rand!? (#162)
- Why not support threadIdx().x, blockIdx().x, blockDim().x etc? (#163)
- Incorrect(?) darwin version in 1.8 with
Metal.versioninfo()
(#179)
Merged pull requests:
- Add native code reflection. (#96) (@maleadt)
- Move MPSKernels into a dedicated file (#155) (@tgymnich)
- [LU decomposition] Fix types (#156) (@tgymnich)
- Update manifest (#161) (@github-actions[bot])
- Implement Time macro (#164) (@christiangnrd)
- Fix some references to CUDA (#165) (@christiangnrd)
- Fix GPUArrays RNG interface implementation. (#166) (@maleadt)
- Bump the LLVM back-end. (#169) (@maleadt)
- Update manifest (#170) (@github-actions[bot])
- Update manifest (#171) (@github-actions[bot])
- Update manifest (#172) (@github-actions[bot])
- Bump GPUCompiler to v0.20 (#173) (@christiangnrd)
- Detect mapreduce threadgroup limits instead of guessing. (#176) (@maleadt)
- Remove reference to no longer used library in README.md (#177) (@christiangnrd)
- Report package versions as part of versioninfo() (#180) (@christiangnrd)
- Fix Darwin version indentification (#181) (@christiangnrd)
- Topk for MPSMatrix (#182) (@christiangnrd)
- Update manifest (#183) (@github-actions[bot])
- Don't rely on thread adoption for command buffer callbacks. (#184) (@maleadt)