Current location - Loan Platform Complete Network - Big data management - How to check the correctness of the generated million data for data analytics
How to check the correctness of the generated million data for data analytics
Millions of data, whether focusing on OLTP or OLAP, of course, is MySql.

Hundreds of millions of data, focusing on OLTP can continue to Mysql, focusing on OLAP, we have to consider the scenarios.

Real-time computing scenarios: emphasize real-time, commonly used in real-time requirements of high places, you can choose Storm;

Batch computing scenarios: emphasize batch processing, commonly used in data mining, analysis, you can choose Hadoop;

Real-time query scenarios: emphasize the query of the real-time response, commonly used in the DB data into an indexed file, the search engine to query, you can choose solo;

Real-time query scenarios: emphasize query real-time response, commonly used to transform data into indexed files, through the search engine to query. search engine to query, you can choose solr/elasticsearch;

Enterprise ODS/EDW/data mart scenarios: emphasizing real-time analysis of big data based on relational databases, commonly used in business data integration, you can choose Greenplum;

Database systems are generally divided into two types:

One type is For front-end applications, the application is relatively simple, but heavy throughput and high concurrency OLTP type;

One is heavy computing, statistical analysis of large data sets OLAP type.