Introduction to Speech Synthesis Text to Speech

The main content of this blog is to introduce the background knowledge of text-to-speech I hope readers can easily understand the working principle of speech synthesis and lay a foundation for understanding the most advanced text-to-speech algorithm.

This introduction is mainly based on the appendix of this paper, "Wavenet: Generation Model of Raw Audio". The link of the paper is as follows: blogs.com/BaroC/p/4283380.html.

For the algorithm of neural network, generally, 256 quantized values are generated based on softmax classifier, corresponding to 256 quantized values of sound. WaveRNN and wavenet are generated in this way.

The following are some materials for my study of speech synthesis, among which Stanford cs224s is highly recommended, but the logic of this handout is not very clear, so I will understand it after reading it repeatedly.

Ucsb digital speech processing course, the basis of sound signal processing. I suggest you have a look. The link is as follows. /view/68 fbf 1a4f 6 1fb 7360 b4c 658 b . html

Bank financing bill

Topic of human geography thesis

Writing of scientific papers (self-made titles)

An essay on poetic emotion.

Write a composition according to the topic.

Han Shaogong commented that Me and Ditan was a "bumper year". Where did this evaluation originally come from? Urgent! Kowtow!

Come if you can!

What does it mean to write a senior thesis?

How many fingers does doi need to drive Zhihu for the first time?

Brief introduction of industrialist Zhang Jian: Zhang Jian's life in Zhang Jian's story