Current location - Loan Platform Complete Network - Big data management - What is the difference between hbase and hive, and what scenarios do they apply to?
What is the difference between hbase and hive, and what scenarios do they apply to?
The main difference between hbase and hive is: they are different for its internal data storage and management, hbase its main feature is modeled after the bigtable column potential storage, for large data storage, querying than traditional databases have a huge advantage, while the hive its generation of the main response to the problem of data warehousing, which exists in the hdfs on the file directory structure mapped into a table. The main concern is the statistics of the data and other aspects.

Suitable scenarios:

hbase: suitable for large-scale data storage, its role can be compared to the role of traditional databases, the main concern of the data access.

hive: suitable for big data management, statistics, processing, its role is analogous to the role of traditional data warehouses, the main concern of the data processing.

Summary: to cope with big data, if you favor the data storage query hbase is undoubtedly more suitable, and you are concerned about the results of the processing of big data query, for example, you query when there are similar to count, sum and other function operations hive can meet your needs, generally some of the project is lost in the hive inside the data processing, and then the results will be imported to mysql and other databases or hbase query, as for the choice of mysql and hbase Comparison of the amount of data tends to be after your processing