Training added on top of flux_impl #147

ksikiric · 2025-02-12T13:19:17Z

Linked to #146

I've added the training code on top of https://github.com/AI-Hypercomputer/maxdiffusion/tree/flux_impl. This PR is meant to be merged after #146.

With the training code, I have also added a pipeline for flux, which can be used for inference as well.

… Start creating generation code for flux.

google-cla · 2025-02-12T13:19:24Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

…te_flux.py

ksikiric · 2025-02-13T12:14:28Z

Background in #148

@entrpn, I've rebased on flux_lora now and aligned the pipeline with the changes you made to generate_flux.py. Inference is working as expected, but I am a bit suspicious about the training. Please try it out and lets discuss on how to move forward with this.

In the meantime, I will prepare another PR where I will add FA for GPUs, similar to how it is done in maxtext.

ksikiric · 2025-02-19T07:34:17Z

Hi @entrpn, have you had a chance to test this PR? I think we can try to merge this soon if you think it looks alright

entrpn · 2025-02-19T16:12:04Z

Hi @entrpn, have you had a chance to test this PR? I think we can try to merge this soon if you think it looks alright

I started to take a look at it. The pipeline fails for me during the data pipeline due to memory restraints in my environment. During the text encoding, I get OOM. The code will need to be refactored in a way that this can run on on CPU or at least 32 GB of accelerator memory (preferably 16) since the t5 encoder cannot be sharded atm. I remember doing something similar before by batching the captions in the data pipeline. I can take a look at it next week and try to get that part working.

jfacevedo-google and others added 29 commits January 14, 2025 02:06

add support for flux vae. ~ wip

d5ac715

test for flux vae both encoding and decoding.

394ebd1

add clip text encoder test

025642b

remove transformers inside maxdiffusion, add transformers dependency.…

a2b7f82

… Start creating generation code for flux.

add double block to flux

2b83d5c

forward pass for single double block.

37d9f00

trying to use scan.

8785d00

add single stream block

cb91d5e

finish transformer

bb71982

convert pt weights to flax and load transformer state.

3eb5729

apply fsdp sharding, do one forward pass in the transformer.

956341e

wip - generate fn

4b64f5d

working loop, bad generation

860e76e

e2e, encoder offloading.

93a3bb6

add missing conversions of pt to jax weights.

601f40c

support both dev and schnell loading. Images still incorrect.

d16c020

flux schnell working

4a12b39

removed unused code.

9871c7d

support dev

a75a125

add sentencepiece requirement

05b6fc8

fix repeated double and single blocks.

df25e47

optimized flash block sizes for trillium.

587bc6a

Merge branch 'main' into flux_impl

8905362

clean up code and lint

b87443f

fix sdxl generate smoke tests.

37df8b9

fix rest of unit tests.

e56825f

update readme and some dependencies.

064a3a7

remove unused dependencies.

fa1c23b

initial lora implementation for flux

b4d0502

ksikiric closed this Feb 12, 2025

ksikiric reopened this Feb 12, 2025

This was referenced Feb 12, 2025

Flux inference implementation #146

Merged

WIP flax.linen flux.1 ported from nnx jflux #141

Closed

jfacevedo-google added 4 commits February 12, 2025 21:38

adding another format lora support.

9e07358

Merge branch 'main' into flux_lora

4c68d53

Support other format loras. update readme. Run code_style.

1f2e65c

ruff

24ee4cc

entrpn mentioned this pull request Feb 13, 2025

Flux lora #148

Merged

entrpn and others added 3 commits February 13, 2025 07:10

fix typo in readme.

63bad83

Added training code, loss and results are stable

853c150

Rebased on flux_lora and aligned flux_pipeline with changes in genera…

f56234e

…te_flux.py

ksikiric force-pushed the kris/flux-impl-training branch from ba2d028 to f56234e Compare February 13, 2025 12:11

ksikiric mentioned this pull request Feb 13, 2025

Flash attention for GPUs like in maxtext #149

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training added on top of flux_impl #147

Training added on top of flux_impl #147

ksikiric commented Feb 12, 2025 •

edited

Loading

google-cla bot commented Feb 12, 2025

ksikiric commented Feb 13, 2025 •

edited

Loading

ksikiric commented Feb 19, 2025

entrpn commented Feb 19, 2025

Training added on top of flux_impl #147

Are you sure you want to change the base?

Training added on top of flux_impl #147

Conversation

ksikiric commented Feb 12, 2025 • edited Loading

google-cla bot commented Feb 12, 2025

ksikiric commented Feb 13, 2025 • edited Loading

ksikiric commented Feb 19, 2025

entrpn commented Feb 19, 2025

ksikiric commented Feb 12, 2025 •

edited

Loading

ksikiric commented Feb 13, 2025 •

edited

Loading