Commits · Ruurd/tini

Removed custom bidirectional layer as it is not needed when using the Llama attention_masks

b57b92e
verified

Ruurd commited on 2 days ago

Changed to bidirectional

7d7b6d7
verified

Ruurd commited on 19 days ago

Turn autocast back on

b6cb410
verified

Ruurd commited on 19 days ago

Remove autocast

526493a
verified

Ruurd commited on 19 days ago

New masking implementation

7141e39
verified

Ruurd commited on 19 days ago

Set to bidirectional for debugging

04f0876
verified

Ruurd commited on 20 days ago

Set to unidirectional for debugging

57e6bce
verified

Ruurd commited on 20 days ago

Deal with float values

a721355
verified

Ruurd commited on 20 days ago

Make attention mask float

8851563
verified

Ruurd commited on 20 days ago

Update llama_diffusion_model.py

238c8f8
verified

Ruurd commited on 20 days ago

Create safe fallback for models not yet initialized with masking_type

f2ca6a6
verified

Ruurd commited on 20 days ago

Overhaul code for appropriate masking for full model instead of just attention layers

b43e862
verified

Ruurd commited on 20 days ago

Fix attention_weights referenced before assigned bug

22370b2
verified

Ruurd commited on 20 days ago

Implement improved attention masking for bidirectional_masked

1723639
verified

Ruurd commited on 20 days ago

Change LoRA size from 256 to 512, also back to bidirectional_masked

620a6cd
verified

Ruurd commited on 23 days ago

Changed back to bidirectional attention

e237f80
verified

Ruurd commited on 23 days ago

Try out bidirectional_masked prediction

0daaccf

Ruurd commited on 23 days ago

input_size?

a5ca1bf

Ruurd commited on 25 days ago

input_size

f7efac8

Ruurd commited on 25 days ago

Updated model architecture

0af2920

Ruurd commited on 26 days ago

First commit

7252f98

Ruurd commited on 26 days ago

Spaces:

Ruurd
/

tini

Sleeping

Commit History

Removed custom bidirectional layer as it is not needed when using the Llama attention_masks

b57b92e
verified

Changed to bidirectional

7d7b6d7
verified

Turn autocast back on

b6cb410
verified

Remove autocast

526493a
verified

New masking implementation

7141e39
verified

Set to bidirectional for debugging

04f0876
verified

Set to unidirectional for debugging

57e6bce
verified

Deal with float values

a721355
verified

Make attention mask float

8851563
verified

Update llama_diffusion_model.py

238c8f8
verified

Create safe fallback for models not yet initialized with masking_type

f2ca6a6
verified

Overhaul code for appropriate masking for full model instead of just attention layers

b43e862
verified

Fix attention_weights referenced before assigned bug

22370b2
verified

Implement improved attention masking for bidirectional_masked

1723639
verified

Change LoRA size from 256 to 512, also back to bidirectional_masked

620a6cd
verified

Changed back to bidirectional attention

e237f80
verified

Try out bidirectional_masked prediction

0daaccf

input_size?

a5ca1bf

input_size

f7efac8

Updated model architecture

0af2920

First commit

7252f98

Commit History

Removed custom bidirectional layer as it is not needed when using the Llama attention_masks b57b92e verified

Changed to bidirectional 7d7b6d7 verified

Turn autocast back on b6cb410 verified

Remove autocast 526493a verified

New masking implementation 7141e39 verified

Set to bidirectional for debugging 04f0876 verified

Set to unidirectional for debugging 57e6bce verified

Deal with float values a721355 verified

Make attention mask float 8851563 verified

Update llama_diffusion_model.py 238c8f8 verified

Create safe fallback for models not yet initialized with masking_type f2ca6a6 verified

Overhaul code for appropriate masking for full model instead of just attention layers b43e862 verified

Fix attention_weights referenced before assigned bug 22370b2 verified

Implement improved attention masking for bidirectional_masked 1723639 verified

Change LoRA size from 256 to 512, also back to bidirectional_masked 620a6cd verified

Changed back to bidirectional attention e237f80 verified

Try out bidirectional_masked prediction 0daaccf

input_size? a5ca1bf

input_size f7efac8

Updated model architecture 0af2920

First commit 7252f98

Removed custom bidirectional layer as it is not needed when using the Llama attention_masks

b57b92e
verified

Changed to bidirectional

7d7b6d7
verified

Turn autocast back on

b6cb410
verified

Remove autocast

526493a
verified

New masking implementation

7141e39
verified

Set to bidirectional for debugging

04f0876
verified

Set to unidirectional for debugging

57e6bce
verified

Deal with float values

a721355
verified

Make attention mask float

8851563
verified

Update llama_diffusion_model.py

238c8f8
verified

Create safe fallback for models not yet initialized with masking_type

f2ca6a6
verified

Overhaul code for appropriate masking for full model instead of just attention layers

b43e862
verified

Fix attention_weights referenced before assigned bug

22370b2
verified

Implement improved attention masking for bidirectional_masked

1723639
verified

Change LoRA size from 256 to 512, also back to bidirectional_masked

620a6cd
verified

Changed back to bidirectional attention

e237f80
verified

Try out bidirectional_masked prediction

0daaccf

input_size?

a5ca1bf

input_size

f7efac8

Updated model architecture

0af2920

First commit

7252f98