An important fact is that by using various tools, such as MapReduce, Pig and Hive, data can use them based on their built-in functions and actual needs. As for analyzing a large amount of data in Hadoop, Anoop pointed out that generally speaking, in the world of big data /Hadoop, some problems may not be complicated and the solutions are simple, but the challenge lies in the amount of data. In this case, different solutions are needed to solve the problem.
Some analysis tasks are counting the number of cleared IDs from log files, converting stored data within a specific date range, and ranking netizens. All these tasks can be solved by various tools and technologies in Hadoop, such as MapReduce, Hive, Pig, Giraph and Mahout. These tools can flexibly extend their functions with the help of custom routines.