Current location - Education and Training Encyclopedia - Resume - The difference between python crawler and test
The difference between python crawler and test
The basic process of reptiles

Initiate a request

Send a request to the target site through the HTTP library, that is, send a request, which can contain additional first-class information and wait for the server to respond.

Get the response content

If the server can respond normally, it will get a response, and the content of the response is the content of the page to be obtained, which may be HTML, Json strings, binary data (pictures or videos) and other types.

Parse content

The content may be HTML, which can be parsed by regular expressions and page parsing libraries, Json, which can be directly converted into Json object parsing, or binary data, which can be saved or further processed.

save data

There are many forms of saving, such as saving as text, saving to a database or saving a file with a specific format.