Language needed for big data
Java
java can be said to be the most basic programming language for big data, according to my experience over the years, I have contacted a large part of the big data development from Jave Web development to transfer over (of course, it is not absolute, I've even seen the product to transfer to the development of big data, contrary to the sky).
One is because the nature of big data is nothing more than the calculation of massive amounts of data, query and storage, background development is very easy to access the application of large data access scenarios
Two is the ability of the java language, a natural advantage, because many big data components are developed using java like HDFS, Yarn, Hbase, MR, Zookeeper and so on, you want to use java as a tool for the development of big data, but it is not the only thing that you can do. Zookeeper and so on, want to learn in depth, fill in the production environment stepped on a variety of pits, you must first learn java and then go to chew the source code.
Speaking of gnawing on the source code by the way, the beginning of the time will certainly be very difficult, the need for components themselves and the development of language have a more in-depth understanding, practice makes perfect slowly, and so you have passed this stage, accustomed to looking at the source code to solve problems when you will find that the source code is really fragrant.
Scala
scala and java are very similar to the jvm running language, in the development process can be seamlessly called each other. scala in the field of big data most of the influence is from the community of the star Spark and kafka, these two things should be known to everyone (later I will have an article multi-dimensional introduction to them), their strong development directly led to the development of a large number of big data. Their strong growth has directly led to the popularity of Scala in this area.
Python and Shell
Shell should not need much introduction is very common, belonging to the program ape necessary general skills. python is more used in the field of data mining and write some complex and shell difficult to achieve the daily script.