Does the echo / delay algorithm just cause noise / static?

Question

Does the echo / delay algorithm just cause noise / static?

There were other questions and answers on this site suggesting that to create an echo or delay effect, you only need to add one sound sample with a saved sound sample from the past. So I have the following Java class:

public class DelayAMod extends AudioMod { private int delay = 500; private float decay = 0.1f; private boolean feedback = false; private int delaySamples; private short[] samples; private int rrPointer; @Override public void init() { this.setDelay(this.delay); this.samples = new short[44100]; this.rrPointer = 0; } public void setDecay(final float decay) { this.decay = Math.max(0.0f, Math.min(decay, 0.99f)); } public void setDelay(final int msDelay) { this.delay = msDelay; this.delaySamples = 44100 / (1000/this.delay); System.out.println("Delay samples:"+this.delaySamples); } @Override public short process(short sample) { System.out.println("Got:"+sample); if (this.feedback) { //Delay should feed back into the loop: sample = (this.samples[this.rrPointer] = this.apply(sample)); } else { //No feedback - store base data, then add echo: this.samples[this.rrPointer] = sample; sample = this.apply(sample); } ++this.rrPointer; if (this.rrPointer >= this.samples.length) { this.rrPointer = 0; } System.out.println("Returning:"+sample); return sample; } private short apply(short sample) { int loc = this.rrPointer - this.delaySamples; if (loc < 0) { loc += this.samples.length; } System.out.println("Found:"+this.samples[loc]+" at "+loc); System.out.println("Adding:"+(this.samples[loc] * this.decay)); return (short)Math.max(Short.MIN_VALUE, Math.min(sample + (int)(this.samples[loc] * this.decay), (int)Short.MAX_VALUE)); } }

It takes one 16-bit sample at a time from the input stream, finds an earlier sample, and combines them accordingly. However, the output is simply awful, noisy static, especially when the decomposition rises to a level that will actually lead to a noticeable result. Decreasing the decomposition to 0.01 barely allows the original sound to pass, but there is no echo at this point.

Key troubleshooting facts:

The sound stream sounds great if this processing is skipped.
The sound stream sounds normal if the decomposition is 0 (do not add anything).
Stored samples are indeed stored and available in the correct order and at appropriate places.
Saved samples are decomposed and correctly added to the input samples.
All numbers from calling process() to return sample are exactly what I expect from this algorithm, and remain so even outside this class.

The problem seems to be related to the simple addition of signed shorts together, and the resulting waveform is an absolute disaster. I saw this particular method implemented in different places - C #, C ++, even on microcontrollers - so why is it so complicated here?

EDIT: It seems I did all this wrong. I do not know if this is FFmpeg / avconv or some other factor, but I am not working with a normal PCM signal. Thanks to the graphical representation of the waveform, as well as the unsuccessful attempt of the tone generator and the resulting analysis, I decided that this is some version of differential pulse code modulation; the step is determined by the change from one sample to another, and halving the estimated “volume” factor on a pure sinusoidal wave actually reduces the step and leaves the volume the same. (Messing with a volume multiplier in a sequence other than sine creates the same static as this echo algorithm). Since this and other DSP algorithms are designed to work with linear pulse-code modulation, I will need some way to get the right audio stream.

+4

java audio signal-processing

Digitalman May 08 '13 at 11:20

source share

1 answer

nimrodm · Answer 1 · 2013-05-15T10:11:02+0000

It should definitely work if you don't have significant clipping.

For example, this is a text file with two columns. The leftmost column is a 16-bit input. The second column is the sum of the first and the version deferred by 4001 samples. The sampling rate is 22 kHz.

Each sample in the second column is the result of the summation of x [k] and x [k-4001] (for example, y [5000] = x [5000] + x [999] = -13840 + 9181 = -4659) You can clearly hear the echo -signal when playing samples in the second column.

Try this signal with the code and see if you get the same results.

Does the echo / delay algorithm just cause noise / static?

More articles: