Broken buffer detection - contrast method #94

MarcelMB · 2025-01-16T01:40:14Z

Takuyas approach was:

Detect broken buffers by comparing buffers with the same position buffer in the previous frame (I'm only making mean error now).
Remove frames that have broken buffers. These broken frames are individually stacked and tracked to examine which frame got removed.

This method works fine most of the time. But has an issue with the data we recorded in December. Because the previous frame is often also broken.

I added another method: block_contrast

Broken buffers typically have higher contrast. Applying local contrast detection to identify regions with unusually bright or dark pixels could be helpful
a broken buffer looks like this (black&white pixels), and therefore has a high contrast, buffers with 'real' neural images don't have this very high contrast:
detect regions with high contrast on a block-by-block basis (not for the entire frame, that didn't work so well when I tried this), each block represents the size of a buffer
The frame is divided into non-overlapping blocks/buffers.
• Each block is analyzed independently for contrast.
For each block, the standard deviation of pixel intensities is calculated.
• If the standard deviation (contrast) exceeds the threshold, the block is flagged as noisy.

worked well with test data and data from December

Its a relatively small PR and I tried to stick strictly to how the code is organized at the moment. And only added one other method for filtering broken buffers. So could be merged easily.

📚 Documentation preview 📚: https://miniscope-io--94.org.readthedocs.build/en/94/

t-sasatani

Nice! We can examine the dropped frames with the root denoise branch update. It'll be interesting to compare what this and mean_error drops.

One processing-wise concern is that the comparison unit doesn't match the buffer shapes in data transfer and is redefining an original block shape. I commented more about this inline. My guess is that with these blocks there should be more false-positive/false-negatives depending on the threshold, but I might be wrong.

Another request is to add some tests and do linting if possible, but as this isn't headed to the main branch, we can also take care of that later.

t-sasatani · 2025-01-16T01:52:35Z

mio/process/video.py

-            self.diff_frames.append(
-                cv2.absdiff(input_frame, self.previous_frame)
-                * self.noise_patch_config.diff_multiply


Could you leave these so it doesn't break the mean version functions? It's my bad if the primitive tests don't detect this, so we need to update that too.

this has somewhat changed with the unified mode now that I have implemented

t-sasatani · 2025-01-16T01:54:41Z

mio/process/video.py

            )
+            else:
+                raise ValueError(f"Unsupported noise detection method: {self.noise_patch_config.method}")


It may be better to validate in the denoise config model. If you can switch, that'll be great, but if not, it's not a big deal so we can leave it.

t-sasatani · 2025-01-16T02:03:34Z

mio/process/frame_helper.py

+        # Use buffer_size to calculate the height of each block
+        block_height = noise_patch_config.buffer_size // width  # Block spans entire width
+        block_height = max(1, block_height)  # Ensure at least one row per block


My intention in using the buffer size and chunking it up in the mean_error method was to match the comparison block with the shape of the buffers because errors are likely to occur within a buffer unit.

I guess you're using buffer_size for the same reason, but I think you need to serialize the frame so we can chunk it in the way it's done in data transfer. This is done in the mean_error method so I think it's worth looking into it.

Or might it be better to unify it with the mean_error method as a single noise detection method and make it run different detection functions based on the input options? That way, it's already chunked in buffers (communication packets), and we can also visualize the areas within the frame that the detector determined are noisy, which the mean stuff is doing.

Sounds good.
I will change it to have a noise detection logic so that there’s one entry point (detect_frame_with_noisy_buffer).
Inside this method:
• Serialize the frame into chunks (buffers).
• Based on the configuration (e.g., method: mean_error or method: block_contrast), run the appropriate detection function on those chunks.

t-sasatani · 2025-01-16T02:07:22Z

mio/process/frame_helper.py

@@ -121,6 +121,53 @@ def detect_frame_with_noisy_buffer(
        noise_output = np.concatenate(noisy_parts)[: self.height * self.width]
        noise_patch = noise_output.reshape(self.width, self.height)
        return any_buffer_has_noise, np.uint8(noise_patch)
+
+    def detect_frame_with_block_contrast(


Maybe standard deviation or SD to be specific?

changed to: def _detect_with_block_contrast_SD

MarcelMB · 2025-01-16T22:40:25Z

main new commit change:
I unified the
• Serialize the frame into chunks (buffers).
• Based on the configuration (e.g., method: mean_error or method: block_contrast), run the appropriate detection function on those chunks
as Takuya suggested

needed to change mean_error buffer_split: 10 to 8 because it didn't work that it cut the buffer into 10 smaller pieces but only up to 8, 8 is the amount of splits/chunks/buffers for the 200x200

included some logging for debug as well

t-sasatani · 2025-01-16T23:19:12Z

What do you mean when you say buffer_split 10 doesn't work? Does it just not detect errors correctly or does it get an error? (if It's an error what kind?)

sneakers-the-rat · 2025-01-17T00:04:22Z

mio/process/frame_helper.py

+        # Slide through the frame vertically in block_height steps
+        for y in range(0, height, block_height):


not sure why we are using a different splitting method here? we already have split_current passed to us (but unused).

sneakers-the-rat · 2025-01-17T00:06:57Z

mio/process/frame_helper.py

+            logger.debug("Previous frame is None.")
+
+        buffer_size = noise_patch_config.buffer_size
+        split_current = self.split_by_length(serialized_current, buffer_size)


this seems to be a different value for the second argument buffer_size compared to before buffer_size // buffer_split + 1 - does this affect the other method?

sneakers-the-rat · 2025-01-17T00:08:38Z

mio/process/frame_helper.py

+        if noise_patch_config.method == "mean_error" and previous_frame is not None:
+            return self._detect_with_mean_error(split_current, split_previous, noise_patch_config)
+        elif noise_patch_config.method == "block_contrast":
+            return self._detect_with_block_contrast_SD(split_current, current_frame, buffer_size, noise_patch_config)


not sure why buffer_size is split out as a separate param when we are passing the config object anyway, seems like the signature here should be just (current_frame, noise_patch_config) (or (split_current, noise_patch_config) if there isn't a reason to have a different splitting method)

mio/process/frame_helper.py

sneakers-the-rat · 2025-01-17T00:53:03Z

mio/process/frame_helper.py

+                continue
+
+            mean_intensity = np.mean(block)
+            std_intensity = np.std(block)


I think what we want here is not the standard deviation of the whole block, but of neighboring pixels. otherwise it seems like this would be tripped by an uncorrupted buffer that just has a very bright region and a very dark region.

For example this image:

has a standard deviation of 112.7

and this image

has a standard deviation of 127.5

and i can get the donut image to have the same standard deviation by increasing the size of the donut until half the pixels are 1 and half the pixels are 0.

If we however use the second derivative (in this case over just the -1th axis, but you could also average the diffs over x and y) they are easily distinguishable.

>>> # the random image >>> np.mean(np.diff(np.diff(speckle))) np.float64(95.5089898989899) >>> # the donut image >>> np.mean(np.diff(np.diff(donut))) np.float64(2.87030303030303)

Sort of related note. We just chatted that we'll probably need to combine detection methods because there are two modes of broken buffers now: (a) sandstorm and (b) all black (not showing up here, but this happens if the preamble or header is missed). SD won't be good for detecting the latter, and the mean error comparison needs two almost valid frames, so we'll need a fusion of these methods.

Doesn't have to be this PR, but we'll eventually have to combine these or think of a better detection method to reduce false positives/negatives.

agreed on having several, separable methods rather than one huge complicated one

sneakers-the-rat · 2025-01-17T00:58:29Z

mio/process/frame_helper.py

+        block_height = buffer_size // width  # Block spans the entire width
+        block_height = max(1, block_height)  # Ensure at least one row per block
+
+        noisy_mask = np.zeros_like(current_frame, dtype=np.uint8)


seems like it could be dtype=bool for memory efficiency

sneakers-the-rat

Since we're merging this into the preprocessing branch, and i figure we'll need further work there on refactoring these into separable classes, not commenting on the need for that here, but we do need tests for this - two kinds would be ideal:

naturalistic, with a short video segment where we have "ground truth" labels for buffers/ known to be corrupted - confirm that we label those and only those labels as corrupted
unitlike, where we generate a frame with a normal image in it (like that donut image) and then randomly corrupt some buffer-shaped segment within it

I also think we need to not just use plain stdev as i said in a comment bc it's not very specific to the corruption we're filtering for, proposed an example alternative in comments

sneakers-the-rat · 2025-01-17T01:21:50Z

ok I linted so the tests would run. @MarcelMB check out https://miniscope-io.readthedocs.io/en/latest/meta/contributing.html#linting - your IDE should be checking this for you (it's way less annoying that way to have the IDE warn you about it as you're writing and do the autofixes), but otherwise just run pdm run format or install pre-commit like pip install pre-commit and then do pre-commit install while in the mio directory to automatically run it before committing

edit: ope i was thinking of another repo, we don't have tests dependent on code quality checks here, it's the PR not being to main. i'll fix that one sec

sneakers-the-rat · 2025-01-17T02:24:07Z

added tests with a sample video (very small, just 60 frame segment) with a lot of the speckle noise error of varying sizes. Currently the tests fail because we miss 5 of the frames with smaller patches. I think the more sensitive method described above would let us set a much lower threshold so we could catch those.

MarcelMB added 2 commits January 15, 2025 16:31

Apply a local contrast detection to identify broken buffers in frames

6fc6db2

evaluating contrast on buffers not entire frames to improve performance

55e855f

MarcelMB requested review from sneakers-the-rat and t-sasatani January 16, 2025 01:40

MarcelMB self-assigned this Jan 16, 2025

t-sasatani requested changes Jan 16, 2025

View reviewed changes

unified serialization of frames into buffers,

76b4466

sneakers-the-rat reviewed Jan 17, 2025

View reviewed changes

mio/process/frame_helper.py Outdated Show resolved Hide resolved

sneakers-the-rat reviewed Jan 17, 2025

View reviewed changes

sneakers-the-rat added 2 commits January 16, 2025 17:18

linting, fixing docstrings

4155d58

run tests against any PR

9a0eae9

add tests

a15c224

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Broken buffer detection - contrast method #94

Broken buffer detection - contrast method #94

MarcelMB commented Jan 16, 2025 •

edited by github-actions bot

Loading

t-sasatani left a comment •

edited

Loading

t-sasatani Jan 16, 2025

MarcelMB Jan 16, 2025

t-sasatani Jan 16, 2025

t-sasatani Jan 16, 2025

t-sasatani Jan 16, 2025

MarcelMB Jan 16, 2025

t-sasatani Jan 16, 2025 •

edited

Loading

MarcelMB Jan 16, 2025

MarcelMB commented Jan 16, 2025

t-sasatani commented Jan 16, 2025

sneakers-the-rat Jan 17, 2025

sneakers-the-rat Jan 17, 2025

sneakers-the-rat Jan 17, 2025 •

edited

Loading

sneakers-the-rat Jan 17, 2025 •

edited

Loading

t-sasatani Jan 17, 2025

sneakers-the-rat Jan 17, 2025

sneakers-the-rat Jan 17, 2025

sneakers-the-rat left a comment

sneakers-the-rat commented Jan 17, 2025 •

edited

Loading

sneakers-the-rat commented Jan 17, 2025

		# Slide through the frame vertically in block_height steps
		for y in range(0, height, block_height):

Broken buffer detection - contrast method #94

Are you sure you want to change the base?

Broken buffer detection - contrast method #94

Conversation

MarcelMB commented Jan 16, 2025 • edited by github-actions bot Loading

t-sasatani left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

t-sasatani Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MarcelMB commented Jan 16, 2025

t-sasatani commented Jan 16, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sneakers-the-rat Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

sneakers-the-rat Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sneakers-the-rat left a comment

Choose a reason for hiding this comment

sneakers-the-rat commented Jan 17, 2025 • edited Loading

sneakers-the-rat commented Jan 17, 2025

MarcelMB commented Jan 16, 2025 •

edited by github-actions bot

Loading

t-sasatani left a comment •

edited

Loading

t-sasatani Jan 16, 2025 •

edited

Loading

sneakers-the-rat Jan 17, 2025 •

edited

Loading

sneakers-the-rat Jan 17, 2025 •

edited

Loading

sneakers-the-rat commented Jan 17, 2025 •

edited

Loading