I hope you know that in the niquist theorem you cannot expect very useful results from what you are trying to achieve.
That is, in addition , you only focus on low frequencies . In this case, you can use the low pass filter first. It is almost impossible to understand voices with olny frequencies below 500 Hz . It is commonly said that speech requires 3 kHz , which provides a sampling frequency of 6000 .
For an example of what you should expect, try something similar to:
ffmpeg -i tst.mp3 -ar 1000 tst.wav
using, for example, some vocals and listen to the result . However, you can probably reach an acceptable compromise using, for example, a sample rate of 3000.
An alternative would be to do some compression on the fly, as @manishg suggested. Since smartphones can do real-time video compression these days, this should be fully feasible with iPhone Hard and Software. But this is a completely different thing than lowering the sampling rate .
source share