The realm of connected devices has seen significant advancements with the integration of voice interaction. This technology empowers users to control their devices seamlessly through spoken commands. At the core of this transformation lies text-to-speech (TTS) synthesis, a crucial component that converts textual information into audible speech. As