Tsunami v0.3.0
v0.3.0
Breaking changes:
fit!
returnsnothing
instead of aFitState
object. TheFitState
object can be accessed viatrainer.fit_state
.on_before_pullback
has been removed. Useon_train_batch_start
instead.on_*_batch_start
now receives the batch on device.- Some of the hooks now take more inputs.
Highlights:
-
Now Tsunami uses
MLDataDevices.DeviceIterator
to wrap dataloaders for more efficient device memory management. -
training_step
,validation_step
, andtest_step
can now return a named tuple
for flexibility. One of the fields of the named tuple should beloss
which is used to compute the loss value.
Merged pull requests:
- deprecate return fit_state (#86) (@CarloLucibello)
- more tests for enzyme (#87) (@CarloLucibello)
- bfloat16 initial support (#90) (@CarloLucibello)
- add a flow matching example (#91) (@CarloLucibello)
- changes for v0.3 (#94) (@CarloLucibello)
- update examples + fix callbacks (#96) (@CarloLucibello)
Closed issues: