This is probably a very stupid question, but I could not find details anywhere.
So, I have a sound recording (wav file), the length of which is 3 seconds. This is my sample and should be classified as [class_A] or [class_B].
Following some mutation in MFCC, I divided the sample into frames (more precisely, 291 frames), and I got MFCC from each frame.
Now I have 291 feature vectors, the length of each vector is 13.
My question is: how exactly do you use these vectors with a classifier (e.g. k-NN)? I have 291 vectors that represent 1 pattern. I know how to work with 1 vector for 1 sample, but I do not know what to do if I have 291 of them. I could not find an explanation.
source share