Support multiple video backends (OpenCV sometimes drops frames resulting in correct timecodes) #213

elxy · 2021-04-09T03:08:49Z

Description of Problem & Solution
I want to use the FrameTimecode to instruct ffmpeg process. But the FrameTimecode is different with ffmpeg.
For belowing media, the first 2 scenes detected of command scenedetect -i Blossoms_at_the_Basin.mp4 detect-content list-scenes -n save-images is:

-----------------------------------------------------------------------
 | Scene # | Start Frame |  Start Time  |  End Frame  |   End Time   |
-----------------------------------------------------------------------
 |      1  |           0 | 00:00:00.000 |         462 | 00:00:19.269 |
 |      2  |         462 | 00:00:19.269 |         635 | 00:00:26.485 |

But the actual end frame number of scene 1 is 508 (start from 0), not 462. Look this:

I think the reason is that VideoCapture has dropped frames. I suggest to use PyAV to read frame. Because PyAV can decode frame with index and pts props.

Media Examples:

Blossoms_at_the_Basin.mp4 is the 4K format of https://www.youtube.com/watch?v=WzD_PREISiM

Proposed Implementation:

Here is a demo to read frames with PyAV:

import sys

import av
import cv2
import numpy

from scenedetect.video_manager import compute_downscale_factor


class Video():
    def __init__(self, video):
        self.video = video
        self.container = av.open(video)

        self.stream = self.container.streams.video[0]
        self.width = self.stream.codec_context.width

        def _get_frame_rate(stream: av.video.stream.VideoStream):
            if stream.average_rate.denominator and stream.average_rate.numerator:
                return float(stream.average_rate)
            if stream.time_base.denominator and stream.time_base.numerator:
                return 1.0 / float(stream.time_base)
            else:
                raise ValueError("Unable to determine FPS")

        self.frame_rate = _get_frame_rate(self.stream)

    def frames(self):
        for frame in self.container.decode(video=0):
            yield frame.index, frame.to_ndarray(format='bgra')



def compute_delta_hsv(i1, i2):
    i1_hsv = cv2.split(cv2.cvtColor(i1, cv2.COLOR_BGR2HSV))
    i2_hsv = cv2.split(cv2.cvtColor(i2, cv2.COLOR_BGR2HSV))
    delta_hsv = [0, 0, 0, 0]
    for i in range(3):
        num_pixels = i1_hsv[i].shape[0] * i1_hsv[i].shape[1]
        i1_hsv[i] = i1_hsv[i].astype(numpy.int32)
        i2_hsv[i] = i2_hsv[i].astype(numpy.int32)
        delta_hsv[i] = numpy.sum(numpy.abs(i1_hsv[i] - i2_hsv[i])) / float(num_pixels)
    return sum(delta_hsv[0:3]) / 3.0


video = Video(sys.argv[1])
threshold = 30.0
factor = compute_downscale_factor(video.width)

last_frame = None
for index, frame in video.frames():
    frame = frame[::factor, ::factor, :3]
    if last_frame is None:
        last_frame = frame
        continue
    hsv = compute_delta_hsv(last_frame, frame)
    if hsv >= threshold:
        print(index)
    last_frame = frame

The text was updated successfully, but these errors were encountered:

Breakthrough · 2021-04-09T04:17:45Z

This seems like a good approach, and may solve some other issues (e.g. #93). I need to learn a bit about the overall API to make it compatible with the VideoManager object, e.g. getting the aspect ratio, but it definitely seems feasible (or pass to the VideoManager constructor if you want to use a cv2.VideoCapture or av.video.stream.VideoStream).

Is VideoCapture not using ffmpeg on your system, or using a different version? I'm curious as to why this occurs.

Very interesting, and thank you for the code sample!

Edit: It may be worth supporting several backends for video input such as decord and pass this as a command line parameter.

elxy · 2021-04-09T07:00:33Z

Is VideoCapture not using ffmpeg on your system, or using a different version? I'm curious as to why this occurs.

VideoCapture also use ffmpeg (libav) as backend on my system, but I have no idea why VideoCapture lost frames.

I had noticed PyAV because of the slow speed of VideoCapture.seek(). PyAV might be helpful to speedup VideoManager.seek().

And, there is one more suggetion I want to give to. With PyAV, SceneDetector could detect more accurately/quickly by using the keyframe property. And it may be possible to control split accuration without re-encode (like this).

Breakthrough · 2021-05-29T02:06:19Z

TODO: Add a command line argument to expose the requested video input library. The current plan will be to default to PyAV, if installed, otherwise fall back to OpenCV. Will create a separate issue for supporting any other requested IO backends.

It may also be possible to use PyAV directly for re-encoding videos, rather than invoking ffmpeg by command line. One major advantage of that approach would be that it could avoid passing timestamps to an external tool, ensuring everything lines up frame-by-frame.

This also may influence how FrameTimecodes work - in particular, different backends could theoretically use different objects that have different representations. For now though will probably use what you posted above as a basis to start the transition.

Breakthrough · 2021-08-21T20:32:56Z

@elxy did you download the video using youtube-dl? I'll try that on my end when I get the new backend working, but was hoping you could share the exact video format you downloaded (or exact commands you used with youtube-dl).

I plan on starting this as the first major task for the v1.0 refactor as this should also resolve several other linked issues.

elxy · 2021-08-22T02:17:13Z

@elxy did you download the video using youtube-dl? I'll try that on my end when I get the new backend working, but was hoping you could share the exact video format you downloaded (or exact commands you used with youtube-dl).

I plan on starting this as the first major task for the v1.0 refactor as this should also resolve several other linked issues.

I had downloaded 4K format of https://www.youtube.com/watch?v=WzD_PREISiM with youtube-dl. I just checked that it's 4K format is 313 (webm vp9).

Passes all non-seeking related tests. (#213)

Issues: #213, #257, #258

Breakthrough · 2022-03-11T17:57:20Z

Open items before v0.6 release discovered in #262:

Fix video length calculations (sometimes reports 0)
Ensure image sequences are rejected by the CLI when specifying PyAV backend
Create new issue to use multi-threaded decoding, may be addressed for v0.6.1

Breakthrough · 2022-03-12T04:56:39Z

Complete in v0.6-beta3 including multithreaded decoding.

Breakthrough added improvement status: accepted labels Apr 9, 2021

Breakthrough added this to the v0.6 milestone Apr 9, 2021

Breakthrough mentioned this issue Apr 12, 2021

Don't seek in VideoManager start() unless required #212

Merged

Breakthrough changed the title ~~FrameTimecode is wrong due to dropping frame~~ Support multiple video backends (OpenCV sometimes drops frames resulting in correct timecodes) May 29, 2021

Breakthrough added the bug label May 29, 2021

This was referenced May 29, 2021

v1.0 Planned API Changes & Feedback #177

Closed

Incorrect Detection of VP9 Encoded Video #86

Closed

Breakthrough mentioned this issue Aug 14, 2021

Scenedetect fails on mp4 video with multiple audio tracks #179

Closed

Breakthrough mentioned this issue Sep 4, 2021

Dropped scenes if using --copy with split-video #236

Closed

Breakthrough mentioned this issue Sep 25, 2021

Scan stops after 15 / 45 frames for Gopro 7 videos (both AVC and HEVC in .mp4) Breakthrough/DVR-Scan#62

Closed

Breakthrough mentioned this issue Feb 9, 2022

Using an inmemory Bytestream (io.BytesIO object) as videoinput #257

Closed

Breakthrough modified the milestones: v1.0, v0.6 Feb 11, 2022

Breakthrough added in progress and removed status: accepted labels Feb 19, 2022

Breakthrough added a commit that referenced this issue Feb 22, 2022

Implement VideoStreamAv (without seeking for now).

d4c7306

Passes all non-seeking related tests. (#213)

Breakthrough added a commit that referenced this issue Feb 24, 2022

Update TODOs now that PyAV backend is available.

0174d8f

Issues: #213, #257, #258

Breakthrough mentioned this issue Mar 11, 2022

v0.6 Beta Release & Feedback #262

Closed

2 tasks

Breakthrough added a commit that referenced this issue Mar 11, 2022

[backends] Add logging to open_video and TODO as per #213/#262.

f2c4cb3

Breakthrough added a commit that referenced this issue Mar 11, 2022

[VideoStreamAv] Fix duration calculation. #213, #262

2e5e2fb

Breakthrough added a commit that referenced this issue Mar 12, 2022

[VideoStreamAv] Implement multithreaded decoding. #213

3b28522

Breakthrough added status: completed and removed in progress labels Mar 12, 2022

Breakthrough closed this as completed Mar 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multiple video backends (OpenCV sometimes drops frames resulting in correct timecodes) #213

Support multiple video backends (OpenCV sometimes drops frames resulting in correct timecodes) #213

elxy commented Apr 9, 2021 •

edited

Loading

Breakthrough commented Apr 9, 2021 •

edited

Loading

elxy commented Apr 9, 2021

Breakthrough commented May 29, 2021 •

edited

Loading

Breakthrough commented Aug 21, 2021

elxy commented Aug 22, 2021

Breakthrough commented Mar 11, 2022

Breakthrough commented Mar 12, 2022

Support multiple video backends (OpenCV sometimes drops frames resulting in correct timecodes) #213

Support multiple video backends (OpenCV sometimes drops frames resulting in correct timecodes) #213

Comments

elxy commented Apr 9, 2021 • edited Loading

Breakthrough commented Apr 9, 2021 • edited Loading

elxy commented Apr 9, 2021

Breakthrough commented May 29, 2021 • edited Loading

Breakthrough commented Aug 21, 2021

elxy commented Aug 22, 2021

Breakthrough commented Mar 11, 2022

Breakthrough commented Mar 12, 2022

elxy commented Apr 9, 2021 •

edited

Loading

Breakthrough commented Apr 9, 2021 •

edited

Loading

Breakthrough commented May 29, 2021 •

edited

Loading