Spaces:

Ruurd
/

tini

Running on Zero

Ruurd commited on 20 days ago

Commit

22370b2

verified ·

1 Parent(s): 1723639

Fix attention_weights referenced before assigned bug

Files changed (1) hide show

llama_diffusion_model.py CHANGED Viewed

@@ -51,7 +51,8 @@ class BidirectionalLlamaAttention(LlamaAttention):
         # if attention_mask is not None:
         #     causal_mask = attention_mask[:, :, :, : key_states.shape[-2]]
         #     attn_weights = attn_weights + causal_mask
         if attention_mask is not None:
             # Convert bool -> float with -inf where masked
             attn_mask = attention_mask.masked_fill(~attention_mask, float('-inf')).to(query.dtype)

         # if attention_mask is not None:
         #     causal_mask = attention_mask[:, :, :, : key_states.shape[-2]]
         #     attn_weights = attn_weights + causal_mask
+        attn_weights = torch.matmul(query, key_states.transpose(2, 3)) * scaling
         if attention_mask is not None:
             # Convert bool -> float with -inf where masked
             attn_mask = attention_mask.masked_fill(~attention_mask, float('-inf')).to(query.dtype)