Prophetnet optimization #9453

guillaume-be · 2021-01-07T09:12:33Z

What does this PR do?

This PR proposes an optimization for the ProphetNet model. The current implementation calculates an attention bias mask by looping through the position to unmask. It performs a high number of assignments (ngram * sequence_length) which can be in the order of ~1000. Single tensor assignments, especially on accelerators, are inefficient.

This PR proposes a vectorized implementation which performs at most ngram assignments, which should be significantly lower than ngram * sequence_length.

A quick experiment shown at https://gist.github.com/guillaume-be/e6b862c701fac1b54765e7af7e71c641 shows that:

this ngram_attention_bias calculation is very expensive, taking close to 230ms (!) on a GPU
the vectorized implementation is several orders of magnitude faster (the same calculation takes less than 1ms on the same example)

Who can review?

@patrickvonplaten maybe you would be a good candidate? I could not find anyone assigned for ProphetNet

edit: pushed some further optimization, further accelerating by ~40%

src/transformers/models/prophetnet/modeling_prophetnet.py

patrickvonplaten

Thanks a lot @guillaume-be ! Very nice to remove the for i in range(sequence_length) call -> this does indeed look very suboptimal.

I'm fine with the PR! When all changes are made I'll run the slow tests of ProphetNet locally to make sure nothing broke unexpectantly and merge then :-)

patrickvonplaten · 2021-01-07T10:41:55Z

All slow tests are passing! Very nice PR - thanks a mille @guillaume-be

guillaume-be added 3 commits January 7, 2021 10:00

Vectorized ngram_attention_bias calculation

561abb8

updated formatting with black

287f9cb

Further optimization

3becee4

patrickvonplaten reviewed Jan 7, 2021

View reviewed changes

src/transformers/models/prophetnet/modeling_prophetnet.py Outdated Show resolved Hide resolved

one (last) optimization

15542f5

patrickvonplaten reviewed Jan 7, 2021

View reviewed changes

patrickvonplaten merged commit 390cf16 into huggingface:master Jan 7, 2021

patrickvonplaten mentioned this pull request Mar 9, 2021

[Wav2Vec2] Improve SpecAugment function by converting numpy based fun… #10494

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prophetnet optimization #9453

Prophetnet optimization #9453

Uh oh!

guillaume-be commented Jan 7, 2021 •

edited

Loading

Uh oh!

Uh oh!

patrickvonplaten left a comment

Uh oh!

patrickvonplaten commented Jan 7, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prophetnet optimization #9453

Prophetnet optimization #9453

Uh oh!

Conversation

guillaume-be commented Jan 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Who can review?

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten commented Jan 7, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

guillaume-be commented Jan 7, 2021 •

edited

Loading