[Feature Request] expose `unidirectional` (causal) attribute of GQA #23409

hann-wang · 2025-01-17T04:38:48Z

Describe the feature request

We have a unidirectional attribute in MHA, but it is missing in GQA because most LLMs are causal. Transformer-based Text-to-Image models, on the other hand, are not causal. We should expose this attribute in GQA to help facilitate the deployment of Text-to-Image models.

Describe scenario use case

Text-to-Image generation models (like Stable Diffusion 3) require attention implemented with causal=False.

PR

#23412

The text was updated successfully, but these errors were encountered:

hann-wang added the feature request request for unsupported feature or enhancement label Jan 17, 2025

hann-wang changed the title ~~[Feature Request] expose is_unidirectional (causal) attribute of GQA~~ [Feature Request] expose unidirectional (causal) attribute of GQA Jan 17, 2025

hann-wang mentioned this issue Jan 17, 2025

add unidirectional attribute to GQA #23412

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] expose `unidirectional` (causal) attribute of GQA #23409

[Feature Request] expose `unidirectional` (causal) attribute of GQA #23409

hann-wang commented Jan 17, 2025 •

edited

Loading

[Feature Request] expose unidirectional (causal) attribute of GQA #23409

[Feature Request] expose unidirectional (causal) attribute of GQA #23409

Comments

hann-wang commented Jan 17, 2025 • edited Loading

Describe the feature request

Describe scenario use case

PR

[Feature Request] expose `unidirectional` (causal) attribute of GQA #23409

[Feature Request] expose `unidirectional` (causal) attribute of GQA #23409

hann-wang commented Jan 17, 2025 •

edited

Loading