CNN image classification hint (2): cosine learning rate attenuation

Paper connection

In deep learning training, the adjustment of learning rate is very important. Exponential lr attenuation is the most commonly used and widely used, and its learning rate changes as shown in the following figure:

The red line shows the standard index lr attenuation. The blue line is a step-by-step lr attenuation, which can keep the learning rate unchanged for a period of time. The advantages of this attenuation method are fast convergence and simplicity.

Shilov proposed cosine annealing strategy. Its simplified version reduces the learning rate from the initial value to zero according to the cosine function. Assuming that the total number of batches is, the learning rate in batches can be calculated according to the following formula:

As shown in the figure, cosine attenuation slowly reduces the learning rate at the beginning, almost linearly in the middle, and slowly reduces the learning rate again at the end.

Make sentences with test papers (about 30)

National Day 600-word composition

Talk to yourself and argue.

Where can I find sci papers?

Ceramics reveal an ancient geomagnetic peak.

Is it necessary to buy a children's study chair?

Being strict and loving loose is harmful to argumentative writing.

Which do you think is stronger militarily, the Qin Dynasty or the Roman Empire at the same time?

What are the development fields of e-commerce?

Does alpine skiing originate in the Alps in Europe?