1, Hadoop. Hadoop is a software framework capable of distributed processing of large amounts of data.
2, HPCC High Performance Computing and Communications.
3, Storm. Storm is free open source software, a distributed, fault-tolerant real-time computing system.
4, Apache Drill. In order to help business users to find more effective, accelerate the Hadoop data query method, the Apache Software Foundation recently launched an open source project called "Drill".
5, RapidMiner. RapidMiner is the world's leading data mining solution to a very large extent with advanced technology. It data mining task involves a wide range of data , including a variety of data art , can simplify the design and evaluation of the data mining process .
6, Pentaho BI. Pentaho BI platform is different from traditional BI products, it is a process-centered, solution-oriented (Solution) framework.