Fix the input pixel format when using GPU video encoder #3426

mthrok · 2023-06-08T15:59:37Z

StreamWriter's encoding pipeline looks like the following

convert tensor to AVFrame
pass AVFrame to AVFilter
pass the resulting AVFrame to AVCodecContext (encoder) and AVFormatContext (muxer)

When dealing with CUDA tensor, the AVFilter becomes no-op, as we have not added support for CUDA-compatible filters.

When CUDA frame is passed, the existing solution passes the software pixel format to AVFilter, which issues warning later as what AVFilter sees is AV_PIX_FMT_CUDA.

Since the filter itself is no-op, it functions as expected. But this commit fixes it.

See #3317

pytorch-bot · 2023-06-08T15:59:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/audio/3426

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 12 Pending, 1 Unrelated Failure

As of commit d8cdced:

NEW FAILURE - The following job has failed:

build (3.8) (gh)

BROKEN TRUNK - The following job failed but were present on the merge base a7fea8a:

👉 Rebase onto the `viable/strict` branch to avoid these failures

unittests-windows-gpu / windows-job (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

StreamWriter's encoding pipeline looks like the following 1. convert tensor to AVFrame 2. pass AVFrame to AVFilter 3. pass the resulting AVFrame to AVCodecContext (encoder) and AVFormatContext (muxer) When dealing with CUDA tensor, the AVFilter becomes no-op, as we have not added support for CUDA-compatible filters. When CUDA frame is passed, the existing solution passes the software pixel format to AVFilter, which issues warning later as what AVFilter sees is AV_PIX_FMT_CUDA. Since the filter itself is no-op, it functions as expected. But this commit fixes it.

facebook-github-bot · 2023-06-08T16:13:15Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-06-08T21:18:45Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-06-09T08:58:05Z

@mthrok merged this pull request in 30afaa9.

github-actions · 2023-06-09T08:58:17Z

Hey @mthrok.
You merged this PR, but labels were not properly added. Please add a primary and secondary label (See https://github.com/pytorch/audio/blob/main/.github/process_commit.py).

Some guidance:

Use 'module: ops' for operations under 'torchaudio/{transforms, functional}', and ML-related components under 'torchaudio/csrc' (e.g. RNN-T loss).

Things in "examples" directory:

'recipe' is applicable to training recipes under the 'examples' folder,
'tutorial' is applicable to tutorials under the “examples/tutorials” folder
'example' is applicable to everything else (e.g. C++ examples)
'module: docs' is applicable to code documentations (not to tutorials).

Regarding examples in code documentations, please also use 'module: docs'.

Please use 'other' tag only when you’re sure the changes are not much relevant to users, or when all other tags are not applicable. Try not to use it often, in order to minimize efforts required when we prepare release notes.

When preparing release notes, please make sure 'documentation' and 'tutorials' occur as the last sub-categories under each primary category like 'new feature', 'improvements' or 'prototype'.

Things related to build are by default excluded from the release note, except when it impacts users. For example:
* Drop support of Python 3.7.
* Add support of Python 3.X.
* Change the way a third party library is bound (so that user needs to install it separately).

facebook-github-bot added the CLA Signed label Jun 8, 2023

mthrok force-pushed the fix-gpu-encoder branch from 302acb4 to d8cdced Compare June 8, 2023 16:13

facebook-github-bot closed this in 30afaa9 Jun 9, 2023

facebook-github-bot added the Merged label Jun 9, 2023

mthrok deleted the fix-gpu-encoder branch June 9, 2023 09:22

mthrok added C++ module: IO improvement labels Jun 9, 2023

mthrok mentioned this pull request Jun 9, 2023

Fix warning of changing video frame properties on the fly #3318

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the input pixel format when using GPU video encoder #3426

Fix the input pixel format when using GPU video encoder #3426

mthrok commented Jun 8, 2023 •

edited

Loading

pytorch-bot bot commented Jun 8, 2023 •

edited

Loading

facebook-github-bot commented Jun 8, 2023

facebook-github-bot commented Jun 8, 2023

facebook-github-bot commented Jun 9, 2023

github-actions bot commented Jun 9, 2023

Fix the input pixel format when using GPU video encoder #3426

Fix the input pixel format when using GPU video encoder #3426

Conversation

mthrok commented Jun 8, 2023 • edited Loading

pytorch-bot bot commented Jun 8, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/audio/3426

❌ 1 New Failure, 12 Pending, 1 Unrelated Failure

facebook-github-bot commented Jun 8, 2023

facebook-github-bot commented Jun 8, 2023

facebook-github-bot commented Jun 9, 2023

github-actions bot commented Jun 9, 2023

Some guidance:

mthrok commented Jun 8, 2023 •

edited

Loading

pytorch-bot bot commented Jun 8, 2023 •

edited

Loading