Movie music too loud but dialogue too quiet fix needed

Thread

24th Jan 2017 22:56 #1
hokkom

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2008
Hello all,

I have several videos that the movie music is loud but dialogue is too quiet. I normally try normalizing audios to help with this, but I found out that there is a better way of doing this for the soft dialogue parts only using an audio compressor.

What I would like to know is what is the best settings or a good starting place for doing this for audacity. Also is there a better plugin for audacity for doing this?

I'm open to suggestions as well as trying other audio editors if this will do what is needed.

I know this is fairly easy to accomplished with software media players, but I'm using my television media player which doesn't have any setting to accomplish this.

Thank you.

Quote
25th Jan 2017 00:06 #2
hech54

View Profile

View Forum Posts

Private Message
Member

Join Date
Jul 2001

Location
Yank in Europe
The best way is to break down the 5.1 audio into 6 waves and create your own 2.0 audio(I hate 5.1 audio for just this reason).....but that's gonna require several steps obviously.

Quote
25th Jan 2017 05:52 #3
pandy

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008
You can give a chance to dynaudnorm filter - i would use ffmpeg and apply this filter https://ffmpeg.org/ffmpeg-filters.html#dynaudnorm only on Center channel (or create 2 channel downmixed version as suggested by hech54) - you can keep original track and add new track to file (of course if this comply to file size).

Quote
25th Jan 2017 20:05 #4
hokkom

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2008
I have broken down the audio to 2 channels stereo, but the dialogue is really low in sound while the music and explosions are really loud. I wanted to increase the gain of the soft passages while keeping the loud one the same.

Quote
25th Jan 2017 20:33 #5
netmask56

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2005

Location
Sydney, Australia
If you could feed the audio into Multitrack view (Adobe Audition) you could process just the centre channel that contains the dialog. Compressing the dialog only would leave the music and effects at a more comfortable level. With a stereo downmix the dialog is spread across both the left and right channels so any compression is going to affect the music and effects equally. The result of that is the overall level of the sound might be louder but the ratio between dialog to M&E will remain much the same.
Some upward expansion on the dialog would lift the low level dialog but leave the normal and loud parts the same. Don't know if Audacity has this ability. You might have to produce 6 wave files from the AC3 ?

SONY 75" Full array 200Hz LED TV, Yamaha A1070 amp, Zidoo UHD3000, BeyonWiz PVR V2 (Enigma2 clone), Chromecast, Windows 11 Professional, QNAP NAS TS851

Quote
25th Jan 2017 22:08 #6
video.baba

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2015
Originally Posted by hokkom

I have broken down the audio to 2 channels stereo, but the dialogue is really low in sound while the music and explosions are really loud. I wanted to increase the gain of the soft passages while keeping the loud one the same.

That's exactly what BOX4 is for. It uses the 'dynaudnorm' filter that pandy mentioned above.

Quote
25th Jan 2017 23:10 #7
hech54

View Profile

View Forum Posts

Private Message
Member

Join Date
Jul 2001

Location
Yank in Europe
Originally Posted by hokkom

I have broken down the audio to 2 channels stereo, but the dialogue is really low in sound while the music and explosions are really loud. I wanted to increase the gain of the soft passages while keeping the loud one the same.

If the original audio is 5.1, going straight to 2.0 is not going to help(obviously). 5.1 to 6 waves is the only way that YOU can control the downmix.

Quote

26th Jan 2017 07:44 #8

Member

Test bellow example - works for me:

Code:

ffmpeg -y -i "%1" -vn -c:a ac3 -b:a 192k -af "pan=stereo|FL < FL+1.414FC+0.5BL+0.5SL+0.25LFE+0.125BR|FR < FR+1.414FC+0.5BR+0.5SR+0.25LFE+0.125BL,firequalizer=gain='if(gte(f,16),0,-INF)+if(lte(f,16000),0,-INF)',dynaudnorm=p=1/sqrt(2):m=100:s=12:g=15,firequalizer=gain='if(gte(f,16),0,-INF)+if(lte(f,16000),0,-INF)',aresample=resampler=soxr:osr=48000:cutoff=0.990:dither_method=none" -f matroska "%~n1_dn.mkv"

If it works for your audio you may try to accommodate audio filter in your script.

Quote

27th Jan 2017 00:17 #9
hokkom

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2008
Thank you all for responding.

I will give the above suggestions a try. I especially like the idea of feeding the audio into Multitrack view as netmask56 have mentioned above. I think this will give me a good idea of what the audio looks like and I will be able to adjust the audio better that way. I like the idea of boosting the center channel, and perhaps the Left and right channels while leaving the other channels alone, before downmixing to stereo.

Quote
27th Jan 2017 00:19 #10
hokkom

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2008
Originally Posted by pandy

Test bellow example - works for me:

Code:

ffmpeg -y -i "%1" -vn -c:a ac3 -b:a 192k -af "pan=stereo|FL < FL+1.414FC+0.5BL+0.5SL+0.25LFE+0.125BR|FR < FR+1.414FC+0.5BR+0.5SR+0.25LFE+0.125BL,firequalizer=gain='if(gte(f,16),0,-INF)+if(lte(f,16000),0,-INF)',dynaudnorm=p=1/sqrt(2):m=100:s=12:g=15,firequalizer=gain='if(gte(f,16),0,-INF)+if(lte(f,16000),0,-INF)',aresample=resampler=soxr:osr=48000:cutoff=0.990:dither_method=none" -f matroska "%~n1_dn.mkv"

If it works for your audio you may try to accommodate audio filter in your script.

Thank you for the ffmpeg script. I'm not really familiar using it at the command line level, but it seems to be a really powerful tools that open up a lot of tweaking and possibilities.
Quote
27th Jan 2017 01:17 #11
hech54

View Profile

View Forum Posts

Private Message
Member

Join Date
Jul 2001

Location
Yank in Europe
I use BeSweet and BeSweetGUI to give me 6 wave files.

Quote
27th Jan 2017 05:38 #12
pandy

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008
Originally Posted by hokkom

Thank you for the ffmpeg script. I'm not really familiar using it at the command line level, but it seems to be a really powerful tools that open up a lot of tweaking and possibilities.

Just create text file with extension bat or cmd and copy provided example - after this you can simply drag and drop file on it (assumption is we talking about Windows OS family), of course you must have ffmpeg executable (to avoid hassle with folders in same place where your script is located).
And yes, ffmpeg is quite powerful so i always highly recommend to spent some time on learning how to use it.

Quote
28th Jan 2017 16:41 #13
hokkom

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2008
Originally Posted by pandy

Originally Posted by hokkom

Thank you for the ffmpeg script. I'm not really familiar using it at the command line level, but it seems to be a really powerful tools that open up a lot of tweaking and possibilities.

Just create text file with extension bat or cmd and copy provided example - after this you can simply drag and drop file on it (assumption is we talking about Windows OS family), of course you must have ffmpeg executable (to avoid hassle with folders in same place where your script is located).
And yes, ffmpeg is quite powerful so i always highly recommend to spent some time on learning how to use it.

Thank you for that suggestion, I didn't know how to work with ffmpeg at the command line level, but your suggestion will make it much easier.

Just to be certain that I got the procedure correct. So the bat and video file should be dropped into the same folder as where ffmpeg is installed? Is this correct?

Quote
29th Jan 2017 04:51 #14
pandy

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008
Originally Posted by hokkom

Thank you for that suggestion, I didn't know how to work with ffmpeg at the command line level, but your suggestion will make it much easier.

Just to be certain that I got the procedure correct. So the bat and video file should be dropped into the same folder as where ffmpeg is installed? Is this correct?

Generally yes, it is way more easier to keep all files in same folder. Alternatively you may try to use some GUI for ffmpeg.
https://github.com/amiaopensource/ffmpeg-amia-wiki/wiki/3%29-Graphical-User-Interface-...s-using-FFmpeg
https://sourceforge.net/projects/ffmpegyag/

Quote
29th Jan 2017 21:30 #15
hokkom

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2008
Originally Posted by pandy

Originally Posted by hokkom

Thank you for that suggestion, I didn't know how to work with ffmpeg at the command line level, but your suggestion will make it much easier.

Just to be certain that I got the procedure correct. So the bat and video file should be dropped into the same folder as where ffmpeg is installed? Is this correct?

Generally yes, it is way more easier to keep all files in same folder. Alternatively you may try to use some GUI for ffmpeg.
https://github.com/amiaopensource/ffmpeg-amia-wiki/wiki/3%29-Graphical-User-Interface-...s-using-FFmpeg
https://sourceforge.net/projects/ffmpegyag/

Thanks for confirming the info and for the gui links. I think I will feel better experimenting with ffmpeg now.

Quote
30th Jan 2017 05:17 #16
awgie

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008

Location
Lanarkshire, Scotland
If you're using Audacity, you can install the ffmpeg import/export library and load the original 5.1 audio and tweak the individual channels. Then you can either export it back to 5.1 audio, or you can save it as a stereo audio file using whichever format is best matched for your final video.

Do or do not. There is no "try." - Yoda

Quote
3rd Feb 2017 21:32 #17
hello_hello

View Profile

View Forum Posts

Private Message
Member

Join Date
Mar 2012
If you want to try it out first, here's the method i use for applying compression on playback. You can load the same WinAMP DSPs directly with Potplayer.
https://forum.videohelp.com/threads/380744-DTS-to-AAC-using-Nero-AAC-encoder-command-li...=1#post2462879

Originally Posted by hech54

Originally Posted by hokkom

I have broken down the audio to 2 channels stereo, but the dialogue is really low in sound while the music and explosions are really loud. I wanted to increase the gain of the soft passages while keeping the loud one the same.

If the original audio is 5.1, going straight to 2.0 is not going to help(obviously). 5.1 to 6 waves is the only way that YOU can control the downmix.

Well.... maybe with the exception of downmix methods that let you control the downmix. The Matrix Mixer DSP I use with foobar2000.

hokkom,
If I was going to compress when downmixing, which I do now and then when converting audio specifically for a video that'll be watched using the TV's media player once, then deleted, (I only keep the uncompressed original) I'd do it the same way I do it on playback on the PC. I've created a foobar2000 conversion preset to downmix to stereo, compress and then encode with QAAC, because QAAC has an option to normalise the over-all volume so the peaks are at maximum.

It'll take a bit of setting up initially (although I can upload my foobar2000 configuration files if it helps) but when it's done, you can simply load a mutichannel file into a playlist, right click and select the conversion preset, and out comes a downmixed, compressed stereo file a few minutes later. Easy.... once it's set up.

Here's some old sample files I uploaded previously. A stereo downmixed version and a few compressed versions, all normalised to the same volume. The idea is to listen to the difference in volume between the speech and the action (gunshots) that follow, or ideally, the lack of difference in volume.

In the zip file:
1. Downmixed to stereo, no compression.
2. Compressed with RockSteady (Wimanp plugin loaded into foobar2000).
3. Compressed with LoudMax (Wimanp plugin loaded into foobar2000). It's not as good as I expected but I just threw that one in. It's probably too compressed. I haven't played with LoudMax much and I'm sure it'll do better.
4. Compressed with foobar2000's EBU R128 Normalizer DSP.

https://forum.videohelp.com/attachments/38927-1476630905/Compression%20Samples.zip

I'll try to add another sample using the Dynamic Audio Normaliser pandy mentioned later, once I get it working properly in foobar2000, and maybe a better LoudMax example.

I guess the advantage of pandy's method is you can do everything in one go. For my foobar2000 method you need to extract the audio yourself, convert it and remux. The latter doesn't bother me though as much of the time I'm re-encoding the video via Avisynth and/or remuxing anyway. AnotherGUI is another GUI I'd recommend trying with ffmpeg too.

Last edited by hello_hello; 3rd Feb 2017 at 21:48.

Quote
3rd Feb 2017 21:43 #18
netmask56

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2005

Location
Sydney, Australia
If you compress the dynamic range of the total sound track ie Lt+Ct+Rt+LRt+RRt then you just make matters worse. You end up with a louder sound track but the ratio of desired sound to undesired sound much closer. You really need to compress only the dialog and then lift it a tad.
It's such a pity there isn't a control on the average A/V amp that gives the user some control over the balance between the centre channel and the rest. My Yamaha has a dialog lift and height control, though subtle does help.

SONY 75" Full array 200Hz LED TV, Yamaha A1070 amp, Zidoo UHD3000, BeyonWiz PVR V2 (Enigma2 clone), Chromecast, Windows 11 Professional, QNAP NAS TS851

Quote
4th Feb 2017 00:06 #19
awgie

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008

Location
Lanarkshire, Scotland
Originally Posted by netmask56

It's such a pity there isn't a control on the average A/V amp that gives the user some control over the balance between the centre channel and the rest. My Yamaha has a dialog lift and height control, though subtle does help.

You're implying that there isn't such a control on the average amp? I buy the least expensive unit I can get while still getting a disc player (I've never paid more than $200 for a system), and every surround system I've ever used has had the option to manually adjust the levels of each speaker individually. It's pretty much an essential feature for any surround system, since the proper balance depends entirely on the room and speaker placement.

But the ability to control the speaker levels on an amp is entirely irrelevant to the OP's problems with mixing 5.1 channels down to stereo using computer software.

Do or do not. There is no "try." - Yoda

Quote
4th Feb 2017 00:30 #20
netmask56

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2005

Location
Sydney, Australia
What you are describing is the global settings for the balance between speakers in a typical setup and once set up ideally should be left alone - but that is not the way to control individual listening of an individual Blu ray or DVD etc title. Mixing down to stereo of course you really need to have access, either software emulated or hardware of a multi-track mixer to be able to accurately control the mix down.

In laypersons terms even on a stereo amp having a centre channel "volume knob" would be a boon to rebalance 5 or 7 channel material. No all that hard to implement at the manufacturing stage but unlikely to happen.

SONY 75" Full array 200Hz LED TV, Yamaha A1070 amp, Zidoo UHD3000, BeyonWiz PVR V2 (Enigma2 clone), Chromecast, Windows 11 Professional, QNAP NAS TS851

Quote
4th Feb 2017 00:46 #21
hokkom

View Profile

View Forum Posts

Private Message
Member

Join Date
May 2008
Thanks guys.

I have been tweaking the audio with various plugins, and it's actually pretty good emphasizing speech while trying to keep the music and explosions the same as much as possible.

Quote
4th Feb 2017 03:55 #22
hello_hello

View Profile

View Forum Posts

Private Message
Member

Join Date
Mar 2012
Originally Posted by netmask56

If you compress the dynamic range of the total sound track ie Lt+Ct+Rt+LRt+RRt then you just make matters worse. You end up with a louder sound track but the ratio of desired sound to undesired sound much closer. You really need to compress only the dialog and then lift it a tad.

I'd have to disagree completely. The problem generally isn't that dialogue is too dynamic, it's that everything else is. If the dialogue isn't all that dynamic, compressing it won't help much. Might as well just turn it up a bit if you can.

It's not even a case of undesired vs desired sound, it's a case of undesired dynamic range vs desired dynamic range.

I do Lt+Ct+Rt+LRt+RRt then compress all the time and it definitely doesn't make things worse. Try the samples in the zip file I posted. It's less than 10MB. In the uncompressed (1st) sample there's normal volume speech followed by loud gunshots and sirens etc. In the Rocksteady (2nd) sample, which is the compression I use on playback, the gunshots and sirens aren't any louder but the dialogue at the beginning is. That's the object of the exercise. You still want to hear everything. You just don't want to be straining to hear something one minute and have your ears bleed the next.

Quote
4th Feb 2017 23:59 #23
davexnet

View Profile

View Forum Posts

Private Message
Member

Join Date
Mar 2008

Location
United States
It's been a while, but for something quick and dirty I used to use the mixer in FFdshow.
Something like avisynth/directshowsource/virtualdub ... save WAV

Quote
5th Feb 2017 13:47 #24
transporterfan

View Profile

View Forum Posts

Private Message
Member

Join Date
Feb 2011
Just as quick if it's being downmixed to stereo is create the stereo file, load it into OcenAudio, play with the 31 band graphic equalizer. Job done.

Quote
5th Feb 2017 17:14 #25
Ealdric-1379

View Profile

View Forum Posts

Private Message
Member

Join Date
Jun 2006

Location
United States
A related question is: Why do the video creators do this in the first place?

Quote
5th Feb 2017 17:27 #26
JVRaines

View Profile

View Forum Posts

Private Message
Member

Join Date
Aug 2010

Location
San Francisco, California
It's called "art." For some reason, they think you want to pay complete attention to their movie and feel like you are there, in the scene, with the helicopters and the tanks and the gunshots and the John Williams orchestra blaring its little heart out.

Quote
5th Feb 2017 17:45 #27
netmask56

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2005

Location
Sydney, Australia
A great deal of the problem is due to the different environment that the final mix is done - movies are mixed in purpose designed studios with acoustics that hopefully come close to the cinema experience. This is very different to the home environment where the listener/viewer is contending with many extra external noises, like passing traffic, air traffic, kids screaming, preparing meals etc.
Ideally a different mix down for domestic conditions ought to be done but that all adds to production cost. When I worked as a TV sound mixer, time of broadcast was a factor as to how you mixed. If it was scheduled for play between 1700 and 2000 dialog was king over everything else. Doing film mixes was a totally different ball game - monitoring at much higher levels, artistic considerations and realism took over. Really A/V manufacturers should cater for this by allowing the user easy control over the balance of the audio over and above the normal global settings for speaker balance for the system and different speaker types.

SONY 75" Full array 200Hz LED TV, Yamaha A1070 amp, Zidoo UHD3000, BeyonWiz PVR V2 (Enigma2 clone), Chromecast, Windows 11 Professional, QNAP NAS TS851

Quote
5th Feb 2017 18:11 #28
awgie

View Profile

View Forum Posts

Private Message
Member

Join Date
Sep 2008

Location
Lanarkshire, Scotland
Originally Posted by netmask56

Really A/V manufacturers should cater for this by allowing the user easy control over the balance of the audio over and above the normal global settings for speaker balance for the system and different speaker types.

It's not nearly as simple on the A/V hardware end as it would be if the audio were remixed in the studio for a home theatre environment. If the studio always put only the dialogue in the center channel, and nowhere else, it would be simple, because the listener could simply boost that channel.

But unfortunately, the reality of it is much different. Dialogue ends up in all 5 channels (maybe even the LFE channel, too, if you're watching a movie starring Michael Clarke Duncan). The end user doesn't get a separate 5.1 soundtrack with just the dialogue that they can make louder or softer to suit their taste.

No matter how good the A/V hardware is, for the end user to do it at home means a compromise. Even your Yamaha system, with its "Dialogue Level" adjustment, is based on a typical set of frequencies and characteristics normally found in dialogue. But if the actual dialogue ventures outside the range of what is typical (such as someone with an especially deep or high-pitched voice), then you begin to lose the desired effect, since some of the dialogue will end up not being boosted. And any of the non-dialogue sounds that fall within that range will be boosted, even though you don't want them to be.

But regardless of whether it's done in the studio or on the hardware end, as you said, it all adds to the cost. And adding to the cost is prohibitive to sales, so it's not going to happen.

Do or do not. There is no "try." - Yoda

Quote
6th Feb 2017 23:10 #29
hello_hello

View Profile

View Forum Posts

Private Message
Member

Join Date
Mar 2012
I was bored so I created some new samples using different compression methods. They all compress reasonably well. First I downmixed to a stereo 32 bit float wave file like this, with Matrix Mixer's "Normalise Matrix" option enabled to also reduce the overall volume enough to prevent clipping when downmixing. I never include the LFE channel when downmixing, but doubly-so when compressing as it can interfere with the compression too much.

From there I used the following steps:
- Scanned the output file and converted it to flac while adjusting the volume to 83dB in ReplayGain speak (EBU R128 Scanning).
- Used the flac file to convert to 32 bit wave files while applying the various compression methods.
- Scanned the wave files and converted to AAC while adjusting the volume to 83dB in ReplayGain speak (EBU R128 Scanning).

That all seemed to work fine except for using the LoudMax compressor. It seems to skew the ReplayGain result for some reason. I left it as it is, and the ReplaqyGain volume is 83dB like all the rest, but it sounds 3 or 4 dB quieter than the others to me. I'll have to investigate that further.

1 - Source FLAC file at 83dB in ReplayGain speak (EBU R128 Scanning).
2 - Compressed with the Dynamic Audio Normalizer with -f 150 in the command line, then adjusted to 83dB (ReplayGain).
3 - Compressed with the foobar2000 EBU R128 Compressor DSP (R128Norm), then adjusted to 83dB (ReplayGain). It has no settings to configure.
4 - Compressed with the VST Version of the LoudMax plugin, threshold set to -18dB, then adjusted to 83dB (ReplayGain).
5 - Compressed with the WinAmp RockSteady plugin, settings in the screenshot here, then adjusted to 83dB (ReplayGain)

Nothing exciting to report in the end, aside from what seems like a LoudMax/ReplayGain/R128 scanning anomaly. They all compress. Which you prefer might be personal preference and the settings used. I still think they improve the dialogue volume and none of them make the problem worse.

Attached Files

Source.flac (17.55 MB, 594 views)

DynAudioNorm.m4a (4.82 MB, 518 views)

EBU R128.m4a (4.84 MB, 524 views)

LoudMax.m4a (4.77 MB, 501 views)

RockSteady.m4a (4.83 MB, 553 views)
Last edited by hello_hello; 7th Feb 2017 at 17:02.
Quote
18th Apr 2017 02:37 #30
rowjekto

View Profile

View Forum Posts

Private Message
Member

Join Date
Apr 2017
Originally Posted by hokkom

What I would like to know is what is the best settings or a good starting place for doing this for audacity.

Here is a suggestion I've found for Audacity's Compressor settings:

- Threshold: -30.0 dB
- Noise Floor: -50.0 dB
- Ratio: 4:1
- Attack Time: 0.3 sec.
- Release Time: 3.0 sec.
- "Make-up gain for 0dB after compressing": enabled

The result is OK, but the loud parts are still a bit too loud imo.
Any suggestions for improvement?

Quote

Movie music too loud but dialogue too quiet fix needed

Thread Tools

Similar Threads

Long, royalty-free calming piano music needed

Event Pan/Crop Dialogue Box Tab, revert to Dialogue Box?

Dialogue Volume in LCD TV is low while action/music volume is high

free, basic video editor needed to fix light

Video and remote controllable music for a drummer ADVICE NEEDED