How to cut a waveform above certain db level with ffmpeg?

19th Nov 2020 08:17 #1
qo4on

View Profile

View Forum Posts

Private Message
Member

Join Date
Jan 2018
My audio file pcm_f32le has some values above 0 db. I'd like to cut/clip everything above 0. I know that I can use alimiter or loudnorm but the goal is to make values more than 0 to be equal to 0 and keep everything else unchanged. I cannot find appropriate command for this in the documentation.

Quote
19th Nov 2020 13:43 #2
manono

View Profile

View Forum Posts

Private Message
Member

Join Date
Aug 2003
I'd recommend using a real wave editor, one like Audacity. Then add in the Relife VST plugin and more-or-less fix the problem. What you want to do doesn't fix anything.

Quote
19th Nov 2020 13:56 #3
qo4on

View Profile

View Forum Posts

Private Message
Member

Join Date
Jan 2018
I need a command line tool. Also I don't need to restore clipped values. I want to cut everything above 0 and nothing more.

Quote
19th Nov 2020 14:31 #4
ProWo

View Profile

View Forum Posts

Private Message
Member

Join Date
Dec 2019
Try: ffmpeg -i "your.wav" -acodec pcm_s16le -ar 44100 "output.wav" (for music).
ffmpeg -i "your.wav" -acodec pcm_s16le -ar 48000 "output.wav" (for video audiotrack).

Quote
19th Nov 2020 16:10 #5
qo4on

View Profile

View Forum Posts

Private Message
Member

Join Date
Jan 2018
This line adds frequencies above 12k.

[Attachment 55915 - Click to enlarge]

Quote
19th Nov 2020 16:49 #6
Cornucopia

View Profile

View Forum Posts

Private Message
Member

Join Date
Oct 2001

Location
Deep in the Heart of Texas
I want to cut everything above 0 and nothing more.

First, there is NOTHING above 0. Digital signals go up from -XXdB (depending on bitdepth) to 0dB. There is nothing above that. Greater than 0dB exists in the analog world because they utilize a thing called "headroom". Thus, there is a spread above 0dB for consumer gear (usually ~10-14dB) and professional gear (usually ~18-24dB) to accommodate signal peaks while avoiding the onset of gross distortion. With digital, once you hit the 0 ceiling, you clip and have massive distortion (because the clipping creates what amounts to a square wave, which is high in odd harmonic distortion).
BTW, best practices of pro audio is to always set the nominal level at ~-18-24dB or so, to accommodate the digital equivalent of headroom.

Second, your example looks to already be clipping. IOW, it has already removed any signal that would have been above 0dB.
Note that your 2nd pic is a frequency-based plot. Yes, because of the clipping, and the square-waving, you have those added odd harmonics (aka spurious aliasing artifacts, which unfortunately are also correllated with the signal and so are even more noticeable than non-correllated noise). Those are what you are seeing in the frequency area above the normal limits. Those harmonics are lower in level (remember, you're not seeing LEVEL on the X or Y axes of the plot, except by looking at the intensity of the color - notice that the bulk of the intensity is in the lower frequencies).

Third, why on earth would you NOT want to un-clip? The way to restore the signal properly (beyond the preferred method of re-capturing using proper signal levels to begin with), is to un-clip. Un-clipping will result in the overall level of the signal being reduced to accommodate, and then synthesizing the peaks (based on sinusoidal extrapolation). This will do what I believe you are ultimately wanting to do, whether you realize it or not.

Fourth, I don't know of any command-line tools that are worthwhile that feature un-clipping (aka "clip restoration", "peak restoration", etc).

Scott

Quote
20th Nov 2020 03:10 #7
qo4on

View Profile

View Forum Posts

Private Message
Member

Join Date
Jan 2018
I selected clipped area, reduced the volume and got all clipped information back. My file is in float format pcm_f32le.

[Attachment 55920 - Click to enlarge]

I think I can open wav file as a numpy array and then replace all values above 0 with 0.

[Attachment 55921 - Click to enlarge]

But I'm not sure what is the right range for pcm_f32le [-1.0, 1.0) or [-1.0, 1.0] or (-1.0, 1.0)?

Quote
20th Nov 2020 12:23 #8
ItaloFan

View Profile

View Forum Posts

Private Message
Member

Join Date
Jul 2018

Location
Colorado, USA
Originally Posted by Cornucopia

First, there is NOTHING above 0.

You are thinking of integer PCM formats. OP said his audio file is 32-bit floating-point. ±1.0* in that format may be thought of as zero dB because it correlates with the maximum/minimum amplitude possible in integer formats, but there is no actual limit in 32-bit float. It is the format used internally by wave editors and DSPs; it can go very far above or below zero, with very high precision.

Although it might be possible, FFmpeg is primarily a video toolkit. I would use a compressor/limiter effect in a wave editor, or if I had to do it from the command-line, I'd use SoX (-l or the compand effect).

I would not just replace everything with zeroes; that will add audible harmonics because it's pure clipping, no different than just doing a naïve conversion to an integer bit depth. You want to use a limiter because it will e.g. logarithmically squish everything above -3 dB so that it fits into into the 0 to -3 dB range.

* as the OP implies in the last question, one of these extremes (+1.0 or -1.0) might be an estimate; I'd have to look it up or experiment in an editor.

Last edited by ItaloFan; 20th Nov 2020 at 12:36.

Quote
20th Nov 2020 16:09 #9
Cornucopia

View Profile

View Forum Posts

Private Message
Member

Join Date
Oct 2001

Location
Deep in the Heart of Texas
Yes, I didn't catch it the first time, but his last post made that quite clear.

HOWEVER, unless this stuff was captured by some of the latest Sound Devices, or Zoom, or Tentacle Sync gear (all $$), it may CURRENTLY be 32bit float, but it wasn't captured that way. And my guess is that in the (standard PCM) capturing, it looks to have been clipped. Just because something is expressed in 32bit float format does not mean it can't represent clipped material.

***********************

The OP keeps mentioning "0dB" when referring to the spectral (frequency-based) plot - which he highlighted. But there is no gauge of 0dB on that. The cutoff seems to be ~11 or 12 kHz (probably 11.025, which is common for 22.050kHz aka 1/2 of 44.1kHz sampling rate). If you really want to remove items above the 11.025kHz line (which is what I think you think you are trying to do), either resample again to 22.050kHz (which should automatically apply an anti-aliasing filter at the Fs/2 cutoff of 11.025), or just use a brick wall Low-Pass filter at 11.025kHz without resampling.

And again, this kind of stuff is NOT best served using CLI tools.

Scott

Last edited by Cornucopia; 20th Nov 2020 at 16:32.

Quote
20th Nov 2020 16:27 #10
Cornucopia

View Profile

View Forum Posts

Private Message
Member

Join Date
Oct 2001

Location
Deep in the Heart of Texas
Note: if you actually DO have a 32bitFloat file that came from one of those new 32bitFloat devices, here is a link to what the demo'er experienced and how to get seemingly "clipped" signals to act un-clipped...
https://www.thebroadcastbridge.com/content/entry/14755/hands-on-with-32-bit-float-recording

First, the user records a 32-bit float file on a compatible recorder to record a 32-bit float file. When the DAW first reads a 32-bit float file, signals greater than 0 dBFS may first appear clipped since, by default, files are read in with 0 dB of gain applied.

By applying attenuation to the file in the DAW, signals above 0 dBFS can be brought below 0 dBFS, undistorted, and used just like any 24- or 16-bit file.

So, basically, just simple attenuation.

Scott

Quote
20th Nov 2020 21:31 #11
ItaloFan

View Profile

View Forum Posts

Private Message
Member

Join Date
Jul 2018

Location
Colorado, USA
The spectrogram indicates a low-quality lossy source, like maybe a 64 kbps MP3. If he has access to an MP3, a better way to deal with it would be to downward-adjust the global gain field in the MP3 frames, e.g. with a tool designed for ReplayGain adjustments (MP3Gain/AACGain or foobar2000).

Quote
22nd Nov 2020 05:55 #12
qo4on

View Profile

View Forum Posts

Private Message
Member

Join Date
Jan 2018
You are right it is a low-quality mp3. But occasionally there are some 1 frame peaks above 0. When I cut them out with numpy array I see the same artifacts on the spectrogram above 12 kHz. I think the best thing I can do is to use hard limiter.

Quote
22nd Nov 2020 12:45 #13
richardpl

View Profile

View Forum Posts

Private Message
Member

Join Date
Apr 2018

Location
Croatia
With ffmpeg, it is possible to do it with asoftclip=type=hard, or even aeval filter. But note that will give you distortions.
Also with ffmpeg is even possible to declip audio.

Quote
22nd Nov 2020 13:50 #14
ItaloFan

View Profile

View Forum Posts

Private Message
Member

Join Date
Jul 2018

Location
Colorado, USA
Clipping (sudden flattening of wave tops) creates a square wave, which is the sum of infinite harmonics, so yes, when you introduce hard clipping, your spectrogram will show broadband spikes of noise in those places.

As I mentioned, I'd want to use a ReplayGain-applying tool to scale down the volume of the MP3. It could have been made from a heavily clipped source that is now partially clipped due to the nature of MP3 encoding, or it could have been made from an unclipped source but the MP3 was made obnoxiously loud due to its internal volume control (the global gain field in each frame). Either way, if we reduce that volume, we can then apply dynamic range compression as needed to bring the volume up to where you want without introducing any new clipping.

Feel free to PM a link to the file and I'll have a look.

Quote
22nd Nov 2020 16:02 #15
hello_hello

View Profile

View Forum Posts

Private Message
Member

Join Date
Mar 2012
If you're patient, you can adjust the volume of single MP3 frame with MP3DirectCut (it can be a bit "fiddly"). Converting to a wave file too still helps, because MP3DirectCut doesn't display a real waveform, so opening a wave file in another program can tell you where to look in MP3DirectCut. Changes are you'll only be able to reduce the volume of small sections of audio by a few dB without causing audible glitches, but it might be worth a try.

For the record, the MP3 format is capable of encoding peaks above 0dB, at least to some extent, and the same applies to AAC. Unfortunately though, not all codecs accept a 32 bit float input, and in the case of LAME which I think does, it converts to 24 bit before it encodes and clips the peaks that way. I'm fairly certain ffmpeg's version encodes 32 bit float directly and can therefore encode peaks above 0dB, although I kind of remember +6dB to +10dB being about as much as the encoder itself was happy with. It's been a while since I've tried, but I remember QAAC handling peaks well over +10dB.

So I agree with ItaloFan. Try using something like Mp3Gain or foobar2000 to adjust the volume down over-all (it's lossless for MP3/AAC) to see if looks prettier when it's decoded. Failing that, if I need to squish the occasional peak and I'm converting anyway, I mostly use foobar2000's limiter and convert/limit in a single step. The Advanced Limiter seems to do a good job, but it doesn't kick in until the audio samples hit 0dB (I don't think) so the true peaks can still exceed 0dB. My tip of the day.... I add the Amplify DSP to the conversion chain first to increase the volume by a few dB, followed by the limiter, and followed by the Amplify DSP again to reverse the volume boost. For example
Amplify (+3dB) -> Limiter -> Amplify (-3dB)
would limit to -3dB, as the audio is processed in 32 bit float.

These days, the thought of having to convert to wave, import into an editor, run a filter, adjust it, run it again, export, convert.... it's depressing just thinking about it.

Last edited by hello_hello; 22nd Nov 2020 at 16:14.

Avisynth functions Resize8 Mod - Audio Speed/Meter/Wave - FixBlend.zip - Position.zip
Avisynth/VapourSynth functions CropResize - FrostyBorders - CPreview (Cropping Preview)

Quote
22nd Nov 2020 16:07 #16
richardpl

View Profile

View Forum Posts

Private Message
Member

Join Date
Apr 2018

Location
Croatia
Originally Posted by hello_hello

These days, the thought of having to convert to wave, import into an editor, run a filter, adjust it, run it again, export, convert.... it's depressing just thinking about it.

That is definitely the reason to use ffmpeg, all that above mentioned steps can be made with ffmpeg too.

Quote
22nd Nov 2020 19:47 #17
Cornucopia

View Profile

View Forum Posts

Private Message
Member

Join Date
Oct 2001

Location
Deep in the Heart of Texas
Quote
22nd Nov 2020 19:52 #18
hello_hello

View Profile

View Forum Posts

Private Message
Member

Join Date
Mar 2012
Originally Posted by Cornucopia

First, the user records a 32-bit float file on a compatible recorder to record a 32-bit float file. When the DAW first reads a 32-bit float file, signals greater than 0 dBFS may first appear clipped since, by default, files are read in with 0 dB of gain applied.

By applying attenuation to the file in the DAW, signals above 0 dBFS can be brought below 0 dBFS, undistorted, and used just like any 24- or 16-bit file.

So, basically, just simple attenuation.

Scott

It sounds more like he's describing the process of working with the audio you've recorded, rather than anything the Zoom itself does, but I only bring it up again because I happened to be looking at the manual today, and then 3 or 4 YouTube videos with 3 or 4 explanations on how 32 bit mode works.... but get ready to hear about the awesomeness of 32 bit float for the next five years.
There's no going back. The cat's out of the bag.

Anyway, from what I can gather those things have dual A/D converters sensitive to different input gains on each input, sharing the work with the combined output being 32 bit float. And some of the YouTube videos indicate it works quite well. There's no way to adjust the input gain in 32 bit mode, but from what I can tell the device doesn't try to adjust the gain either. One YouTube sceptic did say he was a little impressed when it came to amplifying very quiet sections, in respect to the noise floor, and thanks to the awesomeness of 32 bit float.

Last edited by hello_hello; 22nd Nov 2020 at 19:59.

Avisynth functions Resize8 Mod - Audio Speed/Meter/Wave - FixBlend.zip - Position.zip
Avisynth/VapourSynth functions CropResize - FrostyBorders - CPreview (Cropping Preview)

Quote
22nd Nov 2020 22:29 #19
Cornucopia

View Profile

View Forum Posts

Private Message
Member

Join Date
Oct 2001

Location
Deep in the Heart of Texas
Yeah, 32bit float very cool. Puts normalizing on a whole new level (pun intended), or in another sense makes it moot.

The idea of banking inputs on recorders (e.g. input 1 has mic A at low sensitivity, input 2 has same mic A at high sensitivity) is not new. In fact, I was doing that very thing, AND combining them into a much higher bitdepth consolidated master signal as much as 15 years ago. But then it was 24 or 32 bit int format. This just extends that idea even further.

The thing with mixing these is not a complete panacea, however. The adage of "every channel you mix into the master adds to the noise floor" is still true, but it just helps with the linearity and lack of quantization distortion when you work fully in 32bit float. Which isn't insignificant.

Scott

Quote
23rd Nov 2020 05:46 #20
qo4on

View Profile

View Forum Posts

Private Message
Member

Join Date
Jan 2018
The idea of clipping above 0 was wrong. Hard limiter works better. Thank you everybody.

Quote
23rd Nov 2020 14:46 #21
hello_hello

View Profile

View Forum Posts

Private Message
Member

Join Date
Mar 2012
Originally Posted by Cornucopia

Yeah, 32bit float very cool. Puts normalizing on a whole new level (pun intended), or in another sense makes it moot.

I'm all for working in 32 bit float, but I was a little surprised when some YouTube videos made it sound like the latest thing. It also appears to have more dynamic range than it did ten years ago. I've seen the same number quoted more than once now, so maybe it'll evolve into a fact. I don't think "precision" was mentioned in the video used for the screenshot attached.

[Attachment 55957 - Click to enlarge].

I swear the next time some c%n$ asks me to subscribe to a f%#@ YouTube channel while I'm watching a video my head will explode.....

I'm off on a bit of a tangent here, but perfection should be shared
This guy has taken the art of trolling to such an amazing level..... as I watched I found myself wondering if this could be as close as it gets to the perfect troll.
The Dark Side of the Moon: analog & digital comparison (CD, SACD, Vinyl, Tape)

Last edited by hello_hello; 23rd Nov 2020 at 15:01.

Avisynth functions Resize8 Mod - Audio Speed/Meter/Wave - FixBlend.zip - Position.zip
Avisynth/VapourSynth functions CropResize - FrostyBorders - CPreview (Cropping Preview)

Quote
24th Nov 2020 07:21 #22
richardpl

View Profile

View Forum Posts

Private Message
Member

Join Date
Apr 2018

Location
Croatia
Do not tell him about long doubles.

Quote
18th Nov 2022 10:25 #23
Mr Awesome

View Profile

View Forum Posts

Private Message
Member

Join Date
Nov 2022
Originally Posted by manono

I'd recommend using a real wave editor, one like Audacity. Then add in the Relife VST plugin and more-or-less fix the problem. What you want to do doesn't fix anything.

How do you use that relife plugin in audacity? It says the plugin is incompatible when I move the dll file into the plugins folder....as I also need to unclip a wave file thats seems to have been recorded with the gain knobbed turned all the way up or too much....and decreasing volume with ffmpeg doesnt work:

Code:

-af volume=-10dB:precision=fixed
Last edited by Mr Awesome; 19th Nov 2022 at 20:35.
Quote
18th Nov 2022 13:13 #24
manono

View Profile

View Forum Posts

Private Message
Member

Join Date
Aug 2003
Sorry, don't know. I've installed it a couple of times into different versions of Audacity. I stick the DLL in that plugins folder and restart Audacity and it shows up. Never seen the message you're experiencing. Sorry, not much help. It works great, though.

Quote
19th Nov 2022 02:16 #25
pandy

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008
Floats offer higher dynamic range at a cost of accuracy and in pro audio applications they are usually avoided especially in applications where higher accuracy is mandatory (such as high order IIR filters). Floats are mostly selected by developers due their easiness (floats remove responsibility from developer to deal with signal) not as wise system decision (i.e. engineering choice).

Overall OP problem is but odd... even if signal actual value is equal 0 then energy is signal still can be higher and this will be exposed in reconstruction filter later (in analog domain) - those harmonics visible in spectrum is clear symptom for this.

Proper way to deal with such signal is normalization (preferably to maximum -3dBFS), then dynamics compression (soft limiters may work on similar principle), at the end of process verification for clipping presence is recommended (usually performed by 4 time oversampling and low-pass filter to expose eventual peaks).

btw pushing for 0dBFS level is usually not justified - -3dBFS is loss of half bit in signal accuracy so even with 16 bit samples there is still OK to have non distorted (non clipped) signal with 15.5 bit resolution especially that typical signal added TPDF dither has 2 bits level.

Quote
19th Nov 2022 20:35 #26
Mr Awesome

View Profile

View Forum Posts

Private Message
Member

Join Date
Nov 2022
Originally Posted by manono

Sorry, don't know. I've installed it a couple of times into different versions of Audacity. I stick the DLL in that plugins folder and restart Audacity and it shows up. Never seen the message you're experiencing. Sorry, not much help. It works great, though.

Ok, it would be helpful if you can please at least tell me which version you tried it on and it works with no errors or windows saying its missing this and that? I already tried the latest spyware version 3.2.1 and the latest non-spyware version 2.4.2 and neither worked....

Originally Posted by pandy

Proper way to deal with such signal is normalization (preferably to maximum -3dBFS), then dynamics compression (soft limiters may work on similar principle), at the end of process verification for clipping presence is recommended (usually performed by 4 time oversampling and low-pass filter to expose eventual peaks).

I'm guessing you can't do this directly in audacity without a plugin like relife, which is what that does for you, right?

As for why some people push for 0db, I've been told it could be on purpose as a feature of say a music piece....or to simulate a strong sound force such as in a huge explosion scenes in movies or shows that if realistically recorded with a mic, would sound as such. Those two would be the only reason to push that high, other than trying to make it louder than your volume knob on your speaker can go which still sounds quiet is probably why some people remix it bass boosted and the such....

Quote
22nd Nov 2022 06:42 #27
pandy

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008
Originally Posted by Mr Awesome

I'm guessing you can't do this directly in audacity without a plugin like relife, which is what that does for you, right?

As for why some people push for 0db, I've been told it could be on purpose as a feature of say a music piece....or to simulate a strong sound force such as in a huge explosion scenes in movies or shows that if realistically recorded with a mic, would sound as such. Those two would be the only reason to push that high, other than trying to make it louder than your volume knob on your speaker can go which still sounds quiet is probably why some people remix it bass boosted and the such....

Not sure about Audacity (long time not used this app) especially after it become payware-like. But operations are basic and should be present in every audio editor so perhaps multiple operations (but i bet you can combine them as user script i run as single plug-in like).

Well... good mic's offer usually ability to deal with higher dynamics than loudspeaker is capable. Distorting audio by hard clip you risk damage of your tweeter's as higher harmonics may lead to overheat such small, high tone speakers...

I would recommend to not clip audio and keep -3dBFS as maximum level but of course everyone has right to do free choice on their own bill.

Quote

How to cut a waveform above certain db level with ffmpeg?

Thread Tools

Search Thread

Similar Threads

ffmpeg cut point wrong with 5 reference frames video

ffmpeg how to cut off video and keep subtitles

Is Cut point a bug in ffmpeg?

Cannot cut at I frame with FFMPEG

FFmpeg "ErrorDetectionType Per level 1" ?