1. About Java
Java is a programming language, the realization of the same needs of hundreds of programming languages can be completed, Java for big data, is a tool.
2. About Big Data
Big Data is an industry, to achieve the same demand for a variety of tools to choose from, a little narrower to the technical point of view, all kinds of frameworks have Hadoop, spark, storm, flink, etc., in terms of this kind of technology ecosystem, there are a variety of middleware such as flume, kafka, sqoop, etc., most of these frameworks and tools are the same. etc. Most of these frameworks and tools are written in Java, but provide APIs for programming in various languages such as Java, scala, Python, R, etc.
So, the internship in big data requires the use of Java, but Java is not big data.
Big Data is the development of the Internet to the present stage of an appearance or characteristics only, there is no need to myth it or to keep in awe of it, in the cloud computing as the representative of the technological innovation of the curtain of the backdrop, these originally difficult to collect and use the data began to be easy to be utilized through the continuous innovation of all walks of life, Big Data will gradually create more value for mankind.
The industry (IBM's earliest definition) characterizes big data as four "V" (Volume, Variety, Value, Velocity), or four levels of characteristics: First, the volume of data is huge. For example, web logs, videos, images, geolocation information, and so on. Third, low value density and high commercial value. Fourth, fast processing speed. This last point is also fundamentally different from traditional data mining techniques.