Big data can handle huge data sources.
Data sources for big data platforms are usually: file sources: loaded directly into hive tables via hive load. Relational DB: extracted into hive/HDFS/HBase via sqoop. message queues such as Kafka for real-time consumption and real-time computation to support real-time class scenarios.
Cloud computing provides a storage and computing platform for these massive and diverse big data. By managing, processing, analyzing and optimizing data from different sources, the results will be fed back into the above applications, which will create tremendous economic and social value. Big data has the energy to catalyze social change.
Structure:
Big data includes structured, semi-structured and unstructured data, and unstructured data is increasingly becoming a major part of data. According to IDC: 80% of all data in the enterprise is unstructured, and this data is growing exponentially by 60% every year.
Big data on the development of the Internet to the present stage of an appearance or characteristics only, there is no need to myth it or keep in awe of it, in the cloud computing as the representative of the technological innovation of the curtain of the lining, these originally seem difficult to collect and use the data began to be easy to be utilized through the continuous innovation of all walks of life, big data will gradually create more value for mankind.