Add get_image method to Stream by uvjustin · Pull Request #61918 · home-assistant/core

uvjustin · 2021-12-15T18:37:25Z

Proposed change

This PR adds a get_image method to Stream which generates a still image from the most recent keyframe in the RTSP feed. This can be used by camera platforms that support Stream but have no still snapshot url. This can be used as an alternative to using haffmpeg to establish a new connection to the camera whenever an image is requested, but for this to work properly the stream must be active (ie "preload stream" needs to be on).
The jpeg creation step requires turbojpeg to be installed (alternatively we could use pillow).

Type of change

Dependency upgrade
Bugfix (non-breaking change which fixes an issue)
New integration (thank you!)
New feature (which adds functionality to an existing integration)
Breaking change (fix/feature causing existing functionality to break)
Code quality improvements to existing code or addition of tests

Additional information

This PR fixes or closes issue: fixes #
This PR is related to issue:
Link to documentation pull request:

Checklist

The code change is tested and works locally.
Local tests pass. Your PR cannot be merged unless tests pass
There is no commented out code in this PR.
I have followed the development checklist
The code has been formatted using Black (black --fast homeassistant tests)
Tests have been added to verify that the new code works.

If user exposed functionality or configuration variables are added/changed:

Documentation added/updated for www.home-assistant.io

If the code communicates with devices, web services, or third-party tools:

The manifest file has all fields filled out correctly.
Updated and included derived files by running: python3 -m script.hassfest.
New or updated dependencies have been added to requirements_all.txt.
Updated by running python3 -m script.gen_requirements_all.
For the updated dependencies - a link to the changelog, or at minimum a diff between library versions is added to the PR description.
Untested files have been added to .coveragerc.

The integration reached or maintains the following Integration Quality Scale:

No score or internal
🥈 Silver
🥇 Gold
🏆 Platinum

To help with the load of incoming pull requests:

I have reviewed two other open pull requests in this repository.

probot-home-assistant · 2021-12-15T18:37:33Z

Hey there @hunterjm, @allenporter, mind taking a look at this pull request as it has been labeled with an integration (stream) you are listed as a code owner for? Thanks!
_{^{(message by CodeOwnersMention)}}

uvjustin · 2021-12-15T18:41:43Z

Actually this crashed once for me. ~~Perhaps there's a threading issue with turbojpeg?~~ Will have to look into that more later.
Also, looks like the CI runner doesn't have the libjpeg-turbo library installed. It looks like camera just mocks the pyturbojpeg library so we can do that too. I'll do it later.

allenporter · 2021-12-15T20:59:14Z

homeassistant/components/stream/core.py

+            # decode packet (try up to 3 times)
+            for _i in range(3):
+                # pylint: disable=maybe-no-member
+                if frames := last_packet_or_image.decode():


Have details of the crash? I assume its in turbojpeg, but i was curious about this step. I was thinking maybe this step is the problem since its produced by the worker thread but run in the main loop. However reading https://pyav.org/docs/develop/cookbook/basics.html#threading it didn't necessarily give me the impression this decode step could be decoupled, or that its cheap enough we can do it all the time. That is, it seems both heavyweight and tied to worker state.

How often are there keyframes?

(I was thinking through an alternative where the Stream asks the worker for a frame and it produces it on demand as it iterates, which is a little more complex from an orchestration perspective, but maybe less complex from a thread safety perspective if its just passing off the final bytes between threads)

No, there wasn't much in my docker logs - I think the process exited with a 256 error.
After looking at a discussion about threading in libjpeg-turbo, I don't think it's from that side. It might be from the python wrapper, but there's a good chance it's from the pyav side.
I haven't been able to repro. It happened when I had 10+ cameras on the same page pulling images every 10 seconds. I don't have a separate dev instance so I took off those settings and only have it pulling on one camera right now, and I haven't seen any problems.

Keyframes depend on the input stream. Generally they're every 1 second or every few seconds, but for some of the modified codecs like H264+ H265+ they use really long intervals like 20 seconds to save on bandwidth. I use those myself, and I don't mind the lag. With the regular keyframe intervals this is not a concern, but for long keyframe intervals an alternative implementation is to wait for the next keyframe instead of the last one, resulting in a longer time to the first request but a shorter "lag" between the displayed picture and real time.
We could try to move it to the worker, and that might solve a threading issue if we have one, but we'd have to add the overhead of the worker to core events/signaling. Currently the only inter thread interaction is when the packet is set, but I think that is atomic.
One more feature we can add is options for quality/scaling which can help to speed up the encode/implement the width/height parameters used in Camera.async_camera_image

uvjustin · 2021-12-16T17:43:57Z

Moved the encoding into the worker. At first I played around with waiting for the next keyframe before returning the image, which will result in marginally less "lag" between the returned image and real time, but I ended up reverting to the last keyframe. The reason is twofold:

Using the last keyframe will allow us to produce an image quicker e.g. when a new panel is opened
Waiting for the next keyframe doesn't work well with long I-frame intervals the frontend refreshing every ~10 seconds. Say the iframe interval is 30 - in both the last keyframe and next keyframe cases, the images will be refreshed 3 times with the same image. But in the next keyframe case, all 3 get_images will arrive at the same time which may result in network congestion. In the previous keyframe case, at least the transfers will be spread out at different times, which seems to make more sense even if the end behavior is similar (image doesn't change for 30 seconds).

allenporter · 2021-12-16T21:07:42Z

Moved the encoding into the worker. At first I played around with waiting for the next keyframe before returning the image, which will result in marginally less "lag" between the returned image and real time, but I ended up reverting to the last keyframe. The reason is twofold:

Using the last keyframe will allow us to produce an image quicker e.g. when a new panel is opened

Waiting for the next keyframe doesn't work well with long I-frame intervals the frontend refreshing every ~10 seconds. Say the iframe interval is 30 - in both the last keyframe and next keyframe cases, the images will be refreshed 3 times with the same image. But in the next keyframe case, all 3 get_images will arrive at the same time which may result in network congestion. In the previous keyframe case, at least the transfers will be spread out at different times, which seems to make more sense even if the end behavior is similar (image doesn't change for 30 seconds).

I think this is a very smart compromise to get the prior key frame, and seems like the correct answer.

Overall, this having the worker thread do work approach is a little more complicated given the extra cooperation, but the code structure seems to hide the complexity well. I think its worthwhile given the robustness.

homeassistant/components/stream/core.py

homeassistant/components/stream/worker.py

allenporter · 2021-12-16T21:05:06Z

homeassistant/components/stream/__init__.py

+
+    async def get_image(self) -> bytes:
+        """Fetch an image from the Stream and return it as a jpeg in bytes."""
+        return await self._keyframe_converter.get_image()


I feel like this deserves a health check before running, e.g. is the worker thread started? maybe also pre-checking "self.available"? Or do you consider this the callers responsibility?

(either way i assume we'll want something faster than a 60 second timeout, which seems more like a failsafe in case things get stuck)

I initially thought the caller should be responsible for this. It seems wasteful to have to check it every time, and the caller should generally know the status of stream.
However, it might be easier to actually take care of all this here in stream than to have platforms try and manage things. The problem then becomes managing the stream when given these image calls - I already saw a previous request to be able to lower the default stream timeout. I'm worried that introducing this separate use case for stream will have a separate list of demands including a customizable stream timeout.
Yes the 60 second timeout was based on waiting for the next keyframe, which could be some time in the future with certain codecs/settings. Not sure exactly what we should drop it to though. If this was purely image generation, something like 0.5 seconds would probably work. However, this is the time from the request until the worker replies, which requires the worker to have gotten a new packet from its input. For RTSP feeds this extra time waiting for a packet is probably negligible, but if it's some cloud feed like a hls feed this might be like 5-10 seconds. I guess this is one argument for moving the image generation process back out of the worker.

allenporter · 2021-12-16T21:13:15Z

tests/components/stream/test_worker.py

    stream.stop()
+
+
+async def test_get_image(hass, record_worker_sync):


Is it possible to add some test coverage for the case with multiple tasks asking for image thumbnail to be produced? (I assume that is a case that can happen)

We can add this after we figure out the other stuff.

As the code stands right now, there is an asyncio lock around the call to generate_image() from the main thread, so there can only be one generate_image() per KeyframeConverter/stream at a time. I think a test for that specific case is not necessary since the lock is straightforward. I think adding such a test may also be quite involved as we'd have to add another pause/sync mechanism inside the get_image call so that the calls overlap, so there would be a complicated test with little benefit.
As for the threading problems before, I think they have mostly been resolved. The last one was due to trying to use the CodecContext object from the existing outputs. It would work for a while until the container was closed at which time the CodecContext object was deallocated. I've now created a separate CodecContext just for KeyframeConverter use.

homeassistant/components/stream/worker.py

homeassistant/components/stream/core.py

bdraco · 2021-12-19T01:00:29Z

homeassistant/components/stream/core.py

+
+        # Keep import here so that we can import stream integration without installing reqs
+        # pylint: disable=import-outside-toplevel
+        from homeassistant.components.camera.img_util import TurboJPEGSingleton


Is it a problem to make camera a dependencies for stream? It seems like they would be loading it anyways

There's also a circular import problem between the two components because camera also imports things from stream. The easiest way to avoid this is to rejig the imports (including the conditional ones) to avoid importing individual classes/functions from each other and instead importing the whole modules (ie avoid using from x import y and just using import x), but the former seems to be preferred in the HA codebase and the existing stream and camera code follow this. If we want to make those changes, I can do that in this PR or another one.

Refactor and do keyframe conversion in worker

Refactor KeyFrameConverter Add KeyFrameConverter comments Adjust KEYFRAME_TIMEOUT

uvjustin · 2021-12-21T01:44:29Z

Sorry had to rebase several times to get around frenck's CI issue. Should be good now.
The current code seems quite stable (have had it on 10+ cameras for two days now) - ready for another look.

homeassistant/components/stream/__init__.py

homeassistant/components/stream/worker.py

homeassistant/components/stream/core.py

Add class comments

uvjustin requested review from allenporter and hunterjm as code owners December 15, 2021 18:37

probot-home-assistant bot added core has-tests integration: stream new-feature labels Dec 15, 2021

homeassistant added the cla-signed label Dec 15, 2021

probot-home-assistant bot assigned allenporter and hunterjm Dec 15, 2021

probot-home-assistant bot added the by-code-owner label Dec 15, 2021

allenporter reviewed Dec 15, 2021

View reviewed changes

allenporter reviewed Dec 16, 2021

View reviewed changes

bdraco reviewed Dec 19, 2021

View reviewed changes

uvjustin force-pushed the get-still-image-stream branch 4 times, most recently from 3f61b65 to 6e1b320 Compare December 20, 2021 23:08

uvjustin added 9 commits December 21, 2021 00:23

Add get_image method to Stream

862b14d

Fix tests

d477666

Disable threading for pyav decoder

d90ef18

Minor formatting changes

1e6935e

Get next keyframe instead of last keyframe

4df3d9d

Refactor and do keyframe conversion in worker

Change behavior back to last keyframe

acb29b5

Allow get_image to return None

8ec4b60

Refactor KeyFrameConverter Add KeyFrameConverter comments Adjust KEYFRAME_TIMEOUT

Call generate_image from thread executor and not worker

26b301c

Fix thread safety issue

7dfd7b3

uvjustin added 6 commits December 21, 2021 00:23

Remove BytesIO

43b2464

Add width and height parameters to get_image

357a240

Use async_with syntax for lock

510c664

Flush decoder instead of retrying packet

6229c87

Use separate codec_context in KeyframeConverter

dddfd48

Fix test_worker

f549f31

uvjustin force-pushed the get-still-image-stream branch from 6e1b320 to f549f31 Compare December 21, 2021 00:24

allenporter reviewed Dec 22, 2021

View reviewed changes

uvjustin added 5 commits December 22, 2021 08:49

Use is_video for detecting packet stream type

bbe0982

Refactor get_image into KeyFrameConverter

5519323

Add class comments

Make lock private

fb9b8fe

Update KeyFrameConverter comments

aec32bc

Update comment

b2f17a6

allenporter approved these changes Dec 22, 2021

View reviewed changes

uvjustin merged commit 6e13605 into home-assistant:dev Dec 22, 2021

uvjustin mentioned this pull request Dec 22, 2021

Allow generic camera conf without still_image_url #62611

Merged

22 tasks

github-actions bot locked and limited conversation to collaborators Dec 23, 2021

		stream.stop()


		async def test_get_image(hass, record_worker_sync):

Uh oh!

Conversation

uvjustin commented Dec 15, 2021

Proposed change

Type of change

Additional information

Checklist

Uh oh!

probot-home-assistant bot commented Dec 15, 2021

Uh oh!

uvjustin commented Dec 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

uvjustin commented Dec 16, 2021

Uh oh!

allenporter commented Dec 16, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

uvjustin commented Dec 21, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

uvjustin commented Dec 15, 2021 •

edited

Loading