You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Makes sense! I hate strict requirements myself, but for attention_sinks I'm somewhat forced into it. This project works by overriding the forward method of the ...Attention classes in transformers, which are often updated, even in minor versions. Any mismatch can cause failures, so I can really only support one transformers version at a time. At least, until huggingface/transformers#26681 is merged and Attention Sinks can be implemented that way.
Makes it hard to upgrade. Thanks!
The text was updated successfully, but these errors were encountered: