The project deals with automatic recognition of the textual information contained in the speech signal. It is a complex task that has wide application in various technological and social areas. The solution to the problem lies in the correct choice of features extracted from speech as well as the structure of the classification method. When training SNN models, Mel spectrograms were selected for their efficiency.