Current location - Loan Platform Complete Network - Big data management - What is Big Data? How to mine
What is Big Data? How to mine
Data mining (English: Data mining), also translated as information prospecting, data mining. It is a step in Knowledge-Discovery in Databases (English: Knowledge-Discovery in Databases, abbreviation: KDD). Data mining generally refers to the process of searching for information hidden in a large amount of data through algorithms. Data mining is usually associated with computer science and achieves these goals through a number of methods such as statistics, online analytical processing, intelligence retrieval, machine learning, expert systems (relying on past rules of thumb), and pattern recognition.

The tools used to analyze big data come in two main ecosystems, open source and commercial.

Open source big data ecosystem:

1, Hadoop HDFS, HadoopMapReduce, HBase, Hive gradually born, early Hadoop ecosystem gradually formed.

2. Hypertable is another category. It exists outside the Hadoop ecosystem, but there have been some users.

3. NoSQL, membase, MongoDb

Commercial big data ecosystem:

1. All-in-one database/data warehouse: IBM PureData(Netezza), OracleExadata, SAP Hana, etc..

2, data warehouse: TeradataAsterData, EMC GreenPlum, HPVertica and so on.

3, data marts: QlikView, Tableau, and the domestic REU-BDS Big Data

.