Current location - Loan Platform Complete Network - Big data management - Characterization of Big Data
Characterization of Big Data
To understand big data, we first need to know what big data is in the end, big data, refers to the collection of data that can not be captured, managed and processed within a certain timeframe using conventional software tools, is the need for a new processing model in order to have stronger decision-making, insight discovery and process optimization capabilities of the massive, high-growth rate and diversity of information assets.

The four characteristics of big data:

First, a large number of

The characteristics of big data are first reflected in the "big", from the first Map3 era, a small MB level of Map3 can satisfy the needs of many people, but with the passage of time, the storage unit from the past GB to TB, and now PB. TB, and even the current PB and EB levels. Only when the volume of data reaches the PB level or above can it be called big data. With the rapid development of information technology, data began to grow explosively. Social networks, mobile networks, various smart tools, etc., have all become sources of data. Taobao's nearly 400 million members generate about 20TB of commodity transaction data every day. there is an urgent need for intelligent algorithms, powerful data processing platforms, and new data processing technologies to statistically, analytically, predictively, and in real time process data on such a large scale.

Second, high-speed

It is through the algorithm of the data logic processing speed is very fast, the law of 1 second, can be from a variety of types of data to quickly obtain high-value information, which is also fundamentally different from the traditional data mining technology. And these data are required to be processed in a timely manner, because it is very uneconomical to spend a large amount of capital to store historical data with a small role. Based on this situation, big data has very strict requirements for processing speed, a large number of resources in the server are used to process and calculate data, many platforms need to do real-time analysis. Data is generated all the time, and whoever is faster has the advantage.

Third, diverse

If there is only a single piece of data, then the data has no value. A wide range of data sources determines the diversity of forms of big data. Any form of data can have a role, the most widely used is the recommendation system, such as Taobao, NetEase cloud music, today's headlines, etc., these platforms will be analyzed by the user's log data, so as to further recommend what the user likes. Log data is structured obvious data, there are some data structure is not obvious, such as pictures, audio, video, etc., these data cause and effect relationship is weak, you need to manually label it.

Fourth, value

This is also the core feature of big data. A small percentage of the data generated in the real world is valuable. If you have more than 1PB of online data of all 20-35 young people in the country, then it naturally has commercial value, for example, by analyzing this data, we know the hobbies of these people, which in turn guides the direction of product development and so on. If we have the data of millions of patients across the country, we can predict the occurrence of diseases by analyzing these data, and these are the values of big data. Big data is widely used, such as in agriculture, finance, health care and other fields, so as to ultimately improve social governance, improve productivity, and promote the effect of scientific research.

The main characteristics of big data are large amount, high speed, variety, and value, which reflect the huge role of big data in the development of today's society, as well as the progress of science, people's lives and other clocks, and at the same time, also reflects the infinite prospects for the future of big data.