Possible bug in the audio part of SelectRangeEvery #232

TomArrow · 2021-09-24T14:11:10Z

In avs_core/filters/field.cpp, line 1085:

startframe = (iteration+1) * every;

Seems to be a bug to me. I believe startframe is supposed to track the frame position relative to the output video, but multiplying by every would make it relative to the source video.

As a result, I believe that there are countless glitches in the generated audio, depending on how exactly an application requests the audio and in which order. Misplaced segments, silent areas, etc.

I believe the correct way to do this would be:

startframe = (iteration+1) * length;

The audio code in general could maybe use a revamp to be based solely on audio samples, without bringing frames into it, because it's currently integer-based, but the relationship between framerate and audio sample rate isn't guaranteed to be expressible with an integer. Perhaps it could be solved by converting the length and every parameters into their corresponding audio sample equivalents in the constructor and using those for the audio? However I can also see how in the worst case, that could cause a very slight drift over extremely long videos. But as it is now, I think the current code can result in lost or unpredictable individual samples at the borders of the ranges.

pinterf · 2021-09-25T07:35:35Z

Without digging into the problem deeper, let me have a question.
In Avisynth wiki there is a part regarding the audio:
http://avisynth.nl/index.php/Select
"SelectRangeEvery will normally process audio. To keep the original audio, use audio=false."
Does it explain the problem?

TomArrow · 2021-09-25T13:32:28Z

Without digging into the problem deeper, let me have a question.
In Avisynth wiki there is a part regarding the audio:
http://avisynth.nl/index.php/Select
"SelectRangeEvery will normally process audio. To keep the original audio, use audio=false."
Does it explain the problem?

Thanks for your reply. It doesn't, it's a bug in how exactly the audio is processed when audio=true I believe. It often will even work (seemingly) well for a few seconds or so (at least in my case), which is why it might be easy to overlook. The most noticeable kind of glitch is when the output becomes silent, but I reckon this doesn't tend to happen in the beginning of the output because the offsets are still so small that even when jumping too far ahead, you still end up with some kind of audio instead of silence. I reckon the silence happens when the error becomes so large that it tries to query source audio past its end.

Here's an example with SelectRangeEvery(240,60).

This is the source audio:

And here's two times the result. Nothing was changed, just rendered out twice from VirtualDub:

As you can see, both results are different. I think this is because VirtualDub won't always query the audio frames in the exact same order. So the position of the errors shifts. Some parts remain stable across the two attempts as you can see, others change around, move around borders, etc.

But both have one thing in common: The beginning of the audio seems okay at first glance if you just take a quick listen.

qyot27 · 2021-09-25T23:36:13Z

I think this is because VirtualDub won't always query the audio frames in the exact same order. So the position of the errors shifts. Some parts remain stable across the two attempts as you can see, others change around, move around borders, etc.

What happens with libavformat, though? It could mostly (or entirely) be a bug in VirtualDub, or a consequence of VDub probably having to go through the ACM to decode/render audio (or even the interference of accessing AviSynth through Video for Windows).

TomArrow · 2021-09-26T00:01:50Z

Sorry, I don't know what libavformat is, but I've also tried ffmpeg, if that answers the question. Same issue. Also tried rendering only audio vs with video, which changes the durations requested in one go. With video rendering, the artifacts become more short term and often, because the count of samples requested is typically exactly one video frame (in my example it was 1920 samples per request iirc, but ofc depends on frame & samplerate).

I should perhaps mention that I made my own version of the SelectRangeEvery filter: https://github.com/TomArrow/SelectRangeEveryReversing

That is how I found the error. Making the suggested change fixed the issue for me. If you look at the logic of the function, I think it's pretty clear that it should be *length.

Initially the startframe variable is calculated like this:

int startframe = vi.FramesFromAudioSamples(start);

with start being the value with which GetAudio is called. So startframe is clearly in reference to the request, or in other words, output video.

Then iteration is calculated by dividing startframe by length:

const int iteration = startframe / length;

And lastly this is called to advance the "pointer" startframe to the next section to be rendered:

startframe = (iteration+1) * every;

So we first divide by length and then we multiply by every. Which doesn't really make a whole lot of sense.

qyot27 · 2021-09-26T00:30:04Z

libavformat is the (de)muxing library in FFmpeg.

The AviSynth support in it is implemented by accessing the AviSynth library through the C interface directly, so there's no middleman.

TomArrow · 2021-09-26T00:31:58Z

Ah, gotcha. Well, I never had any issues with VirtualDub in the past that are comparable and I also have the issue with ffmpeg, as stated above.

TomArrow · 2021-09-26T00:47:30Z

Hmm, I just tried again to reproduce with ffmpeg with -vn -acodec copy and failed to get an example that is as drastic as what I showed with VirtualDub. I reckon it's because ffmpeg might be more consistent in how it requests audio frames, maybe always based on video framerate.

That makes it even clearer perhaps why it stayed under the radar for so long. The glitches would be rather small and hard to notice.

However I managed to come up with a method to reproduce them more clearly in ffmpeg too. Simply add ChangeFPS(1,linear=false) (after SelectRangeEvery) to reduce the framerate. My suspicion was that this could force ffmpeg into requesting larger blocks and I think I was correct, based on the results.

Here is the spectrum at 25fps with ffmpeg:

Here it is (otherwise identical .avs file, only with the ChangeFPS addition) at 1 fps:

Hope that's convincing enough. ;)

pinterf · 2021-09-27T11:11:33Z

Can you check this build (with your proposed change - local build, no commit yet), thanks
https://drive.google.com/uc?export=download&id=13-lNFkFHkRg4-mwE2uCI16UbpyrE_REp

TomArrow · 2021-09-27T14:09:19Z

Sure, I can test it, thanks. Where do I put all the files? In my AviSynth+ installation folder there is no AviSynth.dll and no plugins and system folder etc.

pinterf · 2021-09-27T14:55:40Z

Usually I just overwrite existing avisynth.dll in system32 with the one in the x64 folder

TomArrow · 2021-09-27T15:26:57Z

Alright, here's my new test:
Ffmpeg normal:

Ffmpeg at 1 fps:

VirtualDub normal (audio render):

Definitely a lot more consistent. No silent places anymore. The two ffmpeg outputs seem almost identical. The Vdub is still a tiny bit different.

But I think the remaining differences are likely down to rounding errors and such and not related to this bug. Would likely need a major rework to fix those I reckon.

As far as subjective impression, the versions sound pretty close to each other, but the Virtualdub one seems to crackle at the transitions sometimes. But it could just be random, with the slight discrepancies causing crackles on some transitions and not on others.

Thanks!

pinterf closed this as completed in ac9ed0f Sep 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible bug in the audio part of SelectRangeEvery #232

Possible bug in the audio part of SelectRangeEvery #232

TomArrow commented Sep 24, 2021 •

edited

Loading

pinterf commented Sep 25, 2021

TomArrow commented Sep 25, 2021

qyot27 commented Sep 25, 2021

TomArrow commented Sep 26, 2021

qyot27 commented Sep 26, 2021

TomArrow commented Sep 26, 2021

TomArrow commented Sep 26, 2021 •

edited

Loading

pinterf commented Sep 27, 2021

TomArrow commented Sep 27, 2021

pinterf commented Sep 27, 2021

TomArrow commented Sep 27, 2021

Possible bug in the audio part of SelectRangeEvery #232

Possible bug in the audio part of SelectRangeEvery #232

Comments

TomArrow commented Sep 24, 2021 • edited Loading

pinterf commented Sep 25, 2021

TomArrow commented Sep 25, 2021

qyot27 commented Sep 25, 2021

TomArrow commented Sep 26, 2021

qyot27 commented Sep 26, 2021

TomArrow commented Sep 26, 2021

TomArrow commented Sep 26, 2021 • edited Loading

pinterf commented Sep 27, 2021

TomArrow commented Sep 27, 2021

pinterf commented Sep 27, 2021

TomArrow commented Sep 27, 2021

TomArrow commented Sep 24, 2021 •

edited

Loading

TomArrow commented Sep 26, 2021 •

edited

Loading