v1.1.0
Metal v1.1.0
Merged pull requests:
- Add
resize!
(#279) (@mtfishman) - Initial MTLTexture support (#280) (@christiangnrd)
- Avoid redundant pointer conversions for threadgroup memory. (#283) (@maleadt)
- Re-implement metallib generation in Julia. (#284) (@maleadt)
- CompatHelper: add new compat entry for SHA at version 0.7, (keep existing compat) (#286) (@github-actions[bot])
- Support more of the metallib format (#288) (@maleadt)
- Address potentiallly buggy
mtl
behaviour. (#290) (@christiangnrd) - CompatHelper: add new compat entry for CodecBzip2 at version 0.8, (keep existing compat) (#292) (@github-actions[bot])
- Remove an unneeded pointer method. (#293) (@maleadt)
- Use NSAutoreleasePool to clean up memory. (#294) (@maleadt)
adapt_storage
-related improvements (#296) (@christiangnrd)- CompatHelper: bump compat for ObjectiveC to 2, (keep existing compat) (#297) (@github-actions[bot])
- Add support for signposts (#300) (@maleadt)
- Retain NSError we rethrow to avoid an UAF. (#302) (@maleadt)
- Minor mapreduce improvements (#303) (@maleadt)
- Specialize broadcast to avoid integer divisions. (#304) (@maleadt)
- Better Support for Unified Memory (#305) (@tgymnich)
- Add 1.11 CI (#306) (@christiangnrd)
- Remove unused files (#307) (@tgymnich)
- Skip profiling tests on macOS 14.4/M1. (#310) (@maleadt)
- Increase test timeout limit to accomodate 1.8 (#311) (@christiangnrd)
- Test all storage modes (#314) (@christiangnrd)
- Fix doctests (#315) (@christiangnrd)
- Fix KernelAbstractions for Unified Memory (#316) (@tgymnich)
- CompatHelper: add new compat entry for Preferences at version 1, (keep existing compat) (#318) (@github-actions[bot])
- Minor cleanup (#319) (@christiangnrd)
- Create MtlArray using memory allocated by Array (#320) (@christiangnrd)
- Re-enable profiling tests on M1/14.4 when using Xcode 15.3. (#322) (@maleadt)
- Small typo and doc fixup (#325) (@christiangnrd)
- BFloat16s.jl extension and related improvements (#326) (@christiangnrd)
- Support for Julia 1.11 (#327) (@maleadt)
Closed issues:
- Validation-related back-end crash on macOS Ventura (#34)
- slow broadcast copy in 2D (#41)
- Poor performance of mapreduce (#46)
- Multiplication with SubArrays (#47)
- Add support to creating MtlArray using a memory allocated by Array (#62)
- Improve use of unified memory (#86)
- Use Autoreleasepools with Metal (#103)
- Unknown RFLT tag generated by macOS 13 Metal compiler (#167)
- mapreduce allocates a lot on the CPU (#211)
- Legalization errors with vectorized code (#257)
- Compilation Failure due to undefined symbols (#276)
resize!
,append!
not defined (#277)- tag new version (#278)
- Panic during profiling tests on 14.4 beta (#281)
- M3 backend cannot handle atomics with complicated pointer conversions (#282)
- Int128 does not compile (#287)
- Two suspicious
mtl
-related behaviours (#289) - LU factorization: add allowsingular keyword argument (#299)
- Autorelease changes lead to use after free with errors (#301)
- Reductions don't work on Shared Arrays (#312)