Current location - Education and Training Encyclopedia - Education and training - What are the commonly used big data tools?
What are the commonly used big data tools?
Science and Technology Rubik's Cube is a big data model platform and a tool platform for data analysis and mining based on service bus and distributed cloud computing. It uses a distributed file system to store data and supports the processing of massive data. Adopt a variety of data acquisition technologies to support the collection of structured data and unstructured data. Through the graphical model building tool, it supports process model configuration. Through the third-party plug-in technology, other tools and services can be easily integrated into the platform. Data analysis and judgment platform is a process of collecting massive information, establishing data model, mining and analyzing data, and finally forming knowledge service actual combat and decision-making. The platform mainly includes data acquisition part, model configuration part, model execution part and achievement display part.

Bee's network information radar is a product that collects network information directionally. It can collect and update website data set by users, achieve flexible network data collection goals, and provide a basis for Internet data analysis.

Untouchi technology pump station is a data extraction tool of big data platform, which realizes the function of importing data from db to hdfs. With the help of Hadoop, it can provide efficient cluster distributed parallel processing capability, and can extract db data to hdfs file system in parallel in batches through database partition, field partition and paging, which effectively solves the problems of excessive workload and long extraction time in traditional extraction of big data and provides a transmission pipeline for big data warehouses.

The technology cloud computing data center is based on advanced Chinese data processing and massive data support, supplemented by manual services in all links, making the data center run safely and efficiently. According to the different links of cloud computing data center, we have specially equipped system management and maintenance personnel, data processing and compiling personnel, data collection and maintenance personnel, platform system administrators, institutional administrators, public opinion monitoring and analysts to meet the needs of each link. For users, we provide government-oriented and enterprise-oriented solutions.

Science and technology microscope is a big data text mining tool, which refers to extracting valuable information and knowledge from text data by computer processing technology.

Including text classification, text clustering, information extraction, entity recognition, keyword indexing, abstract and so on. Based on Hadoop

The text mining software of MapReduce can realize the mining and analysis of massive texts. An important application field of CKM is intelligent comparison,

Widely used in patent novelty retrieval, scientific and technological novelty retrieval, document copy retrieval, copyright protection, manuscript traceability and other fields.

The scientific and technological data cube to be discovered is a visual relationship mining tool for big data, and its presentation methods include relationship diagram, time axis, analysis diagram, list and other expressions, providing users with all-round information presentation methods.