Current location - Education and Training Encyclopedia - Graduation thesis - Zip compression principle and implementation
Zip compression principle and implementation
Principle of file compression

Most of the things we do with computers are processing files. Each file will occupy a certain amount of disk space. We hope that some files, especially those that are temporarily unused but important and cannot be deleted (such as backup files, which are a bit like chicken ribs), will occupy as little disk space as possible. But the storage formats of many files are loose, which wastes some valuable computer storage resources. At this time, we can use compression tools to solve this problem. By compressing the original file, you can save it with less disk space and decompress it when necessary, which greatly saves disk space. When you want to copy many small files, compression can improve the execution efficiency. If there are many small files, it will take a lot of time for the operating system to perform frequent file location operations. It is very convenient to copy these small files if they are compressed into one compressed file first. Because the information processed by computer is expressed in the form of binary numbers, compression software marks the same string in binary information with special characters to achieve the purpose of compression. To help you understand file compression, please imagine a picture of blue sky and white clouds in your mind. For thousands of monotonous blue pixels, instead of defining a long list of colors "blue, blue, blue ……" one by one, it is better to tell the computer that "storing117 blue pixels from this location" is more concise and can greatly save storage space. This is a very simple example of image compression. In fact, in the final analysis, all computer files are stored in the form of "1" and "0". Just like blue pixels, through reasonable mathematical calculation formula, the volume of files can be greatly reduced, and the effect of "lossless and dense data" can be achieved. Generally speaking, compression can be divided into lossy compression and lossless compression. If the loss of individual data will not have much impact, it is a good idea to ignore them, which is lossy compression. Lossy compression is widely used in animation, sound and image files, and the typical examples are mpeg, mp3 and jpg. But in more cases, the compressed data must be accurate, so people have designed lossless compression formats, such as common zip and rar. Compression software is naturally a tool to compress data by using compression principle. The file generated after compression is called archive, and its volume is only a fraction or even smaller. Of course, the compressed package is already another file format. If you want to use the data in it, you must first restore the data with compression software. This process is called decompression. Common compression software includes winzip, winrar, etc.