Current location - Education and Training Encyclopedia - Graduation thesis - Java web crawler
Java web crawler
1, Java programming guide for network robots, easy to understand, a bit outdated, but suitable for novices.

2, write a web crawler yourself, you can look at the basics, the writing is a bit messy, a lot of content is unclear, and a lot of code is copied. . .

3, search engine-principle, technology and system, Peking University Skynet as a case, very good and powerful, a little academic flavor.

4, Web data mining Liu Bing, Liu Bing's book, strongly recommended.

5, search engine: information retrieval practice, a good book, strongly recommended.

There are also some papers. Find it yourself.

In the case, you can study part of the code of Nutch crawler and write it clearly.

With the above, it should be an introduction ~