v0.16.0
Highlights
@mx.custom_function
for customvjp
/jvp
/vmap
transforms- Up to 2x faster Metal GEMV and fast masked GEMV
- Fast
hadamard_transform
Core
- Metal 3.2 support
- Reduced CPU binary size
- Added quantized GPU ops to JIT
- Faster GPU compilation
- Added grads for bitwise ops + indexing
Bug Fixes
- 1D scatter bug
- Strided sort bug
- Reshape copy bug
- Seg fault in
mx.compile
- Donation condition in compilation
- Compilation of accelerate on iOS