Add Mega: Moving Average Equipped Gated Attention#21766
Merged
sgugger merged 66 commits intohuggingface:mainfrom Mar 24, 2023
Merged
Add Mega: Moving Average Equipped Gated Attention#21766sgugger merged 66 commits intohuggingface:mainfrom
sgugger merged 66 commits intohuggingface:mainfrom
Commits
Commits on Feb 8, 2023
Commits on Feb 10, 2023
Commits on Feb 13, 2023
Commits on Feb 14, 2023
refactored MovingAverageGatedAttention to remove stateful k/v history and use unified attention mask
committed- committed
- committed
- committed
- committed
Commits on Feb 16, 2023
Commits on Feb 17, 2023
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Feb 21, 2023
- committed
- committed
- committed
- committed
- committed
Commits on Feb 22, 2023
- committed
- committed
- committed
- committed
Commits on Feb 23, 2023
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Mar 3, 2023
- andauthored
Commits on Mar 7, 2023
- committed
- committed
- committed
- committed
Commits on Mar 8, 2023
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Mar 15, 2023
- andauthored
Commits on Mar 17, 2023
- committed
- committed
- committed
- committed
- committed
Commits on Mar 22, 2023
Commits on Mar 23, 2023
- andauthored
- andauthored
- committed
- committed
- committed