Only 4%-20% of the pages in the Internet are visible, and 96% of the pages are hidden in the depths. This point is mentioned in what is the difference between deep net and dark net. In fact, the main reason is that when searching a web page, search engines such as Google will check the first file named robots.txt without this file, which means that only the information in this web page can be indexed without password protection.
1、Pipl
Robots file set by Pipl can interact with many deep web databases, so it can index deep resources such as publications, court records and personal data.
2. My life
There are about 225 million public pages in my life, which contain a lot of personal information, such as email address, family information, telephone number, home address and so on. Even the place where you have lived can be found, and there is more than 18 information about American citizens.
used at the end of a sentence
Yippy mainly uses other search engines to get the result information, but in particular, it will not leave any web browsing records, including checking emails or contract terms.
Surfwax
Surfwax has many other functions, not simple direct search. Among them, the focus word function can independently set the search range, identify other related contents, and display the time required for retrieval, thus providing the best search results more appropriately.
5. Classic machine
This is the front end of an Internet archive with information of 100T, which can only be accessed through URL. But the loop machine allows the public to upload data, but most of the data are retrieved by reptiles, including 654.38+05 billion pieces of crawling information.
6. Google Academic
This is a web page that allows access to academic documents, publications and other academic materials. You can search by keywords, or you can equip yourself with Google Academic, and you can automatically access journals and databases when searching directly.