1, the definition of big data. Big data, also known as the huge amount of information, refers to the amount of data involved is so large in scale that it can not be achieved through the human brain or even mainstream software tools, within a reasonable time to capture, manage, process, and organize into information to help business decision-making for more positive purposes.
2, the collection of big data. Science and technology and the development of the Internet, promoting the advent of the era of big data, all walks of life every day in the production of a huge number of data fragments, data measurement unit has developed from Byte, KB, MB, GB, TB to PB, EB, ZB, YB and even BB, NB, DB to measure. The collection of data in the era of big data is also no longer a technical problem, just the face of such a large amount of data, how can we find its inner law.
3, the characteristics of big data. The amount of data, data types, the requirement of real-time, the value of the data contained in the big. In all walks of life there is big data, but a lot of information and advice is complex, we need to search, processing, analysis, summarize, summarize the deep law.
4, big data mining and processing. Inevitably, big data cannot be projected and estimated with the human brain, or processed with a single computer, but must use distributed computing architecture, relying on cloud computing distributed processing, distributed database, cloud storage and virtualization technology, therefore, the mining and processing of big data must be used in cloud technology.
5, the application of big data. Big data can be applied to all walks of life to analyze and organize the huge amount of data that people have collected to achieve the effective use of information. To give an example of this profession, for example, in the dairy cow gene level to find the main effect of genes related to milk production, we can first of all the whole genome of the cow scanning, although we get all the phenotypic information and genetic information, but because of the huge amount of data, which requires the use of big data technology, to analyze the comparison and mining of the main effect of genes. There are many more examples.
6. The significance and prospect of big data. In a nutshell, big data is a large amount of, dynamic, and can be sustained data, through the use of new systems, new tools, new models of mining, so as to obtain insights and new value. Previously, in the face of huge data, we may be blinded by a leaf, visible, and therefore can not understand the true nature of things, so that in the scientific work of the wrong inference, and the advent of the era of big data, all the truth will be displayed in front of us.