I tested Pocket TTS from Kyutai. It’s fun to use for generating French sentences: the accent is strong, but it’s still perfectly understandable, even though French doesn’t seem to be in the training dataset.
Beyond that, the model is quite impressive especially knowing it has only 100M parameters