Big Data
The so-called 4V, specifically refers to the following 4 points:
1. The characteristics of big data is first reflected in the "big", from the first Map3 era, a small MB level of Map3 can meet the needs of many people, but with the passage of time, the storage unit from the past GB to TB, and even now the PB, EB level. With the rapid development of information technology, data began to grow explosively. Social networks (microblogging, Twitter, Facebook), mobile networks, various smart tools, service tools, etc., have all become sources of data. Taobao's nearly 400 million members generate about 20 terabytes of commodity transaction data every day; Facebook's approximately 1 billion users generate more than 300 terabytes of log data every day. there is an urgent need for intelligent algorithms, powerful data processing platforms, and new data processing technologies to count, analyze, predict, and process such large-scale data in real time.
2. Diverse. The wide range of data sources determines the diversity of big data forms. Any form of data can have a role, the most widely used is the recommender system, such as Taobao, NetEase cloud music, today's headlines, etc., these platforms will be analyzed by the user's log data, so as to further recommend the user's favorite things. Log data is structured obvious data, there are some data structure is not obvious, such as pictures, audio, video, etc., these data cause and effect relationship is weak, you need to manually label it.
Big Data
3. High speed. Big data is generated very quickly, mainly through the Internet transmission. Everyone in life is inseparable from the Internet, which means that every day individuals are providing a large amount of information to big data every day. And these data is the need for timely processing, because spending a lot of capital to store the role of small historical data is very uneconomical, for a platform, perhaps save the data only the past few days or within a month, and then far away from the data should be cleaned up in a timely manner, otherwise the cost is too great. Based on this situation, big data has very strict requirements for processing speed, a large number of resources in the server are used to process and calculate data, many platforms need to do real-time analysis. Data is being generated all the time, and whoever is faster has the advantage.
4. Value. This is a core feature of big data. A small percentage of the data generated in the real world has value. Compared with the traditional small data, the biggest value of big data lies in the fact that by digging out data valuable for predictive analysis of future trends and patterns from a large amount of irrelevant data of various types and analyzing them in depth through machine learning methods, artificial intelligence methods or data mining methods, new laws and new knowledge are discovered and applied to various fields such as agriculture, finance and medical care, so as to ultimately achieve the improvement of social governance, improve production efficiency, and promote the effect of scientific research.
Big Data
In the era of big data, everyone will enjoy the convenience brought by big data. You can buy things without leaving your home; you can go out in an emergency without having to wait for a cab as you go; and you only need to move your fingers if you want to know what's going on in the world. Although big data can create personal privacy issues, but in general, big data is still constantly improving our lives, make life more convenient