Bump transformers to 4.25.1 #151

justheuristic · 2022-12-12T19:15:26Z

latest accelerate, transformers, huggingface_hub
rearrange attention caches to support [BLOOM] Clean modeling code huggingface/transformers#18344
remove unused code
fix edge case where session crashes when receiving seq length 0
assert transformer version when importing WrappedBloomBlock

justheuristic · 2022-12-12T19:28:30Z

setup.cfg

+    huggingface-hub==0.11.1
+    transformers==4.25.1
    protobuf>=3.20.3,<4.0dev
    hivemind==1.1.3


also gonna bump it, but it's a separate PR

justheuristic · 2022-12-12T19:34:55Z

src/petals/bloom/modeling_utils.py

@@ -0,0 +1,74 @@
+"""


This file is not new. It was renamed from model.py, but git does not recognize the diff

borzunov

We've found some bugs, pending their resolution.

src/petals/bloom/from_pretrained.py

tests/test_full_model.py

tests/test_aux_functions.py

src/petals/server/handler.py

src/petals/bloom/modeling_utils.py

mryab · 2022-12-13T00:41:17Z

src/petals/bloom/modeling_utils.py

+
+        for i in range(0, num_embeddings, self.chunk_size):
+            chunk = word_embeddings[i : i + self.chunk_size].float()
+            output[..., i : i + self.chunk_size] = F.linear(hidden_states, chunk)


Not sure if this is worth doing, but maybe you can do torch.matmul(hidden_states, chunk, out=output[..., i : i + self.chunk_size]) to avoid allocating memory for the intermediate result?

Tried to do the same thing, but to no avail
On GPU, it appears that F.linear has a better support for some optimizations like TF32 (enabled by default)
On CPU, this has no effect.

src/petals/server/throughput.py

src/petals/server/server.py

src/petals/server/block_utils.py

src/petals/server/backend.py

mryab · 2022-12-13T00:49:39Z

src/petals/server/backend.py

+                key_past = key_cache.flatten(0, 1)[:, :, :prefix_length]  # [batch * num_heads, head_dim, kv_length]
+                value_past = value_cache.flatten(0, 1)[:, :prefix_length, :]  # [batch * num_heads, kv_length, head_dim]


Can't you just directly reshape the past tensors to these shapes like you've done in src/petals/server/handler.py?

Nope, we cannot

hypo_ids need shape [2, batch_size, ...]

training needs key [batch_size * heads, ..., length] and value [..., length, :], making them non-concat-able

handler needs them to be concat-able in a single tensor

Co-authored-by: Max Ryabinin <[email protected]>

justheuristic added 3 commits December 12, 2022 20:29

bump versions

e62345e

bump versions

88348cf

yeet models

7018954

justheuristic requested a review from borzunov December 12, 2022 19:15

justheuristic added 2 commits December 12, 2022 22:18

y u no instal?

f46d421

fix imports

1e2ab6b

borzunov approved these changes Dec 12, 2022

View reviewed changes

justheuristic commented Dec 12, 2022

View reviewed changes

justheuristic and others added 3 commits December 12, 2022 22:47

fix edge case where session crashes when receiving seq length 0

4b35dd7

Merge branch 'main' into bump

67e8070

review

062cd51

borzunov requested changes Dec 12, 2022

View reviewed changes

justheuristic and others added 7 commits December 13, 2022 00:01

mixin

d227021

remix

ab813ba

fix throughput

9bf813b

fix throughput

524468e

benchmark throughput in CI jobs

c473012

reduce ban timeout

f9e0910

fork pytest

044e915

borzunov approved these changes Dec 12, 2022

View reviewed changes

borzunov reviewed Dec 13, 2022

View reviewed changes

src/petals/bloom/from_pretrained.py Show resolved Hide resolved

mryab reviewed Dec 13, 2022

View reviewed changes

mryab and others added 8 commits December 13, 2022 10:34

review

b12ad06

review

35e2c0a

Update tests/test_aux_functions.py

27ac588

Co-authored-by: Max Ryabinin <[email protected]>

Merge branch 'bump' of github.com:bigscience-workshop/petals into bump

26fe612

isort

b090dd2

Update src/petals/server/handler.py

7f7f5dc

Co-authored-by: Max Ryabinin <[email protected]>

Update src/petals/bloom/modeling_utils.py

7103268

Co-authored-by: Max Ryabinin <[email protected]>

cleanup

fa632a3

justheuristic and others added 4 commits December 13, 2022 10:40

Merge branch 'bump' of github.com:bigscience-workshop/petals into bump

d1d59e4

cleanup

110a307

Update src/petals/server/backend.py

a24659f

Co-authored-by: Max Ryabinin <[email protected]>

check transformers version

a61c5bb

justheuristic merged commit b04982c into main Dec 13, 2022

justheuristic deleted the bump branch December 13, 2022 08:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers to 4.25.1 #151

Bump transformers to 4.25.1 #151

Uh oh!

justheuristic commented Dec 12, 2022 •

edited

Loading

Uh oh!

justheuristic Dec 12, 2022

Uh oh!

justheuristic Dec 12, 2022

Uh oh!

borzunov left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Uh oh!

justheuristic Dec 13, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Uh oh!

justheuristic Dec 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		key_past = key_cache.flatten(0, 1)[:, :, :prefix_length] # [batch * num_heads, head_dim, kv_length]
		value_past = value_cache.flatten(0, 1)[:, :prefix_length, :] # [batch * num_heads, kv_length, head_dim]

Bump transformers to 4.25.1 #151

Bump transformers to 4.25.1 #151

Uh oh!

Conversation

justheuristic commented Dec 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

justheuristic Dec 12, 2022

Choose a reason for hiding this comment

Uh oh!

justheuristic Dec 12, 2022

Choose a reason for hiding this comment

Uh oh!

borzunov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

justheuristic Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

justheuristic Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

justheuristic commented Dec 12, 2022 •

edited

Loading