add capability to output intermediate hidden states #451

sarahwie · 2024-02-13T06:38:53Z

Add a functionality that HF model forward calls have.
A very similar update can be done to add the output_attentions functionality, though I haven't written it yet.

Note that HF code "appends" hidden states to a tuple object, but I'm using a list as I think it's better practice. Hence the type casting in the function output to return a tuple, in order to match expected output in HF's convention.

See allenai#446.

natolambert · 2024-02-13T16:35:51Z

Have you tested this @sarahwie ? Looks close to what I would expect but I don't use the OLMo repo much :)

AkshitaB · 2024-02-13T19:31:16Z

Hi @sarahwie this is great! Do you mind also adding the use of the two flags to the HF wrapper here: https://github.com/allenai/OLMo/blob/main/hf_olmo/modeling_olmo.py#L48

sarahwie · 2024-02-14T01:44:50Z

I've tested this locally but haven't re-built the pip package, let me try that to make sure it still works as expected @natolambert

Co-authored-by: 玄钛 <[email protected]> Co-authored-by: Pete <[email protected]> Co-authored-by: epwalsh <[email protected]>

sarahwie · 2024-02-16T00:51:37Z

Temporarily added code to throw an error if output_attentions=True (#449) since that functionality hasn't been coded yet as @natolambert pointed out.

Tested this, and everything is worked as expected. Should I merge?

natolambert · 2024-02-16T17:13:49Z

@sarahwie probably, but I'm not a core contributor. If you changed a bunch more, you can ask @AkshitaB to review the recent changes.

sarahwie and others added 6 commits February 12, 2024 22:34

add capability to output intermediate hidden states

df19064

Pass input embeddings from HF OLMo to inner model forward

e129014

Use input embeddings instead of input ids when provided

3da61c4

Run Ruff

37f43be

Update Changelog

aab4607

Require Python>=3.9 for now

2f58100

See allenai#446.

sarahwie mentioned this pull request Feb 13, 2024

Make clear that output_hidden_states and output_attentions aren't implemented #449

Closed

sarahwie added 3 commits February 12, 2024 23:07

black format, improving datatype, & mypy typing fix

e8081d9

add docstring and changelog update

d271f00

Merge branch 'main' into main

2fdb23c

AkshitaB approved these changes Feb 14, 2024

View reviewed changes

hxdtest and others added 3 commits February 15, 2024 16:10

Add support for Python 3.8 (allenai#448)

904c740

Co-authored-by: 玄钛 <[email protected]> Co-authored-by: Pete <[email protected]> Co-authored-by: epwalsh <[email protected]>

throw error for output_attentions flag (allenai#449)

a8a955a

Merge branch 'main' into main

0b0cb1d

sarahwie mentioned this pull request Feb 16, 2024

Output Hidden States seems to return None on forward pass of OLMO model #447

Closed

sarahwie merged commit 7f7abbb into allenai:main Feb 16, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add capability to output intermediate hidden states #451

add capability to output intermediate hidden states #451

sarahwie commented Feb 13, 2024 •

edited

Loading

natolambert commented Feb 13, 2024

AkshitaB commented Feb 13, 2024

sarahwie commented Feb 14, 2024

sarahwie commented Feb 16, 2024

natolambert commented Feb 16, 2024

add capability to output intermediate hidden states #451

add capability to output intermediate hidden states #451

Conversation

sarahwie commented Feb 13, 2024 • edited Loading

natolambert commented Feb 13, 2024

AkshitaB commented Feb 13, 2024

sarahwie commented Feb 14, 2024

sarahwie commented Feb 16, 2024

natolambert commented Feb 16, 2024

sarahwie commented Feb 13, 2024 •

edited

Loading