Frequency and step are not the same thing: frequency is a physical quantity, step is a psychological perception - they are similar, but there are important differences that may or may not matter to you, depending on the type of instrument you are trying to measure step.
You need to read a little about the various algorithms for determining the pitch (and the meaning of the tone itself), decide which algorithm you want to use, and only then proceed with its implementation. See this Wikipedia page for a good overview of pitch and pitch (note that you can use FFT for methods based on autocorrelation and frequency domain).
Regarding the use of FFT for identifying peaks in the spectrum and associated frequencies, there are many questions and answers related to this, see, for example: How to get the frequencies of each value in the FFT?
source share