Current location - Education and Training Encyclopedia - Education and training - Does java high concurrency have anything to do with big data?
Does java high concurrency have anything to do with big data?
The wave of big data has swept the IT world. Nowadays, not only the Internet, but also various shopping and financial platforms are paying more and more attention to big data. So what is the big data we have been talking about? Why do you always mention Java when it comes to big data? What is the relationship between Java and big data? Lan Ou Guangzhou Java Training Institute tells you that Java is not big data!

1 About Java

Java is a programming language, and there are hundreds of programming languages to meet the same requirements. Java is just a tool for big data.

2. About big data

Big data is an industry, and there are many tools to choose from for the same demand. In a narrow sense, from a technical point of view, there are various frameworks such as Hadoop, spark, storm and flink. As far as this kind of technology ecology is concerned, there are various middleware such as flume, kafka and sqoop. Most of these frameworks and tools are written in Java, but they provide various services such as Java, scala, Python and R.

So the internship of big data needs Java, but Java is not big data.

Big data is just a representation or feature of the development of the Internet at this stage. There is no need to myth it or keep it in awe. Under the background of technological innovation represented by cloud computing, these data that were originally difficult to collect and use began to be easily used. Through continuous innovation in all walks of life, big data will gradually create more value for mankind.

The industry (first defined by IBM) divides the characteristics of big data into four "V" (quantity, type, value and speed), or there are four levels of characteristics: First, the amount of data is huge. The starting unit of big data measurement is at least P( 1000 t), E (1 100t) or Z (1 100t); Second, there are many data types. Such as blogs, videos, pictures, geographic information and so on. Third, the value density is low and the commercial value is high. Fourth, the processing speed is fast. This last point is also fundamentally different from the traditional data mining technology.