Automatic speech quality assessment aims to quantify subjective human perception of speech through computational models to reduce the need for labor-consuming manual evaluations. While models based on Deep learning have achieved progress in predicting mean opinion scores (MOS) to asses