Current location - Quotes Website - Excellent quotations - Data is like water
Data is like water

In the past two years, I have been indulged in various big data trends and big data-related projects. In the past few days, I have calmed down and reflected on the various projects under the cloak of "big data" that have succeeded, failed, impressed me, or scorned. After thinking about it, I only have four things left in my mind. Words: Data is like water. . . . . .

We might as well imagine the entire Internet as a city. Then, each data center is undoubtedly the high-rise buildings in the city; the optical fiber network connecting the data centers is naturally the various pipeline channels crisscrossing the city. So, what is data? The most appropriate metaphor is naturally water flowing in a pipe. It is difficult to imagine a city without water. Likewise, the value of data to the Internet and various enterprises is the same. However, for a vibrant city, its water resources must also be vibrant, flowing and circulating. Similarly, the real meaning of data lies in its "flow".

If I tell you, there is such a city with sufficient water. But water is stored in each bathtub tank. How would you feel if it wasn't flowing in water pipes and sewers? You will definitely find it incredible, and you may even think: Damn! Wouldn't it stink if these data were left like this? . . . . . The same is true for data. Data that does not flow is meaningless in the Internet era! What we need to do now is to open up the data and let the data flow organically on the Internet, instead of mechanically storing the data in bathtub buckets and waiting for it to stink. So: Data is like water, you can just "use" the data, and there is no need to waste too much resources and energy on "installing" the data.

So, how to "use" data? In fact, it is the same as using water. How do you use water to wash your hands, bathe, wash dishes, and drink? Isn't it just a simple matter of faucets, showerheads, and water purifiers of various sizes? Then, corresponding volumes of water are obtained from these devices, boiled to make tea and coffee, etc. In fact, the same goes for "using" data. We don't need to "store" a large amount of data. We only need to install various "faucets" and "water purifiers" (data cleaning, ETL tools) suitable for our business purposes. Through this type of equipment, you can obtain data that meets your needs, and then "boil" the data (data mining, business intelligence) to meet your higher-level needs, such as drinking, etc. Of course, after use, don’t forget to let the used and screened data continue to “flow” out and be recycled, just like you defecate or pour sewage into the sewer.

I think this is the correct way for companies to use data to generate value in the era of big data. That is to say, be a user of big data rather than a private collector of big data. "Flowing" and "circulating" data are the real big data in the Internet era. Once the data really "flows" like this, the value of the data will come out. Data centers that play the role of "water source" will also operate faster and more efficiently, and the "big city" of the Internet will also become more and more efficient. Charming and energetic. Unfortunately, the slogan of big data is loud now, but most companies are just doing their own "bathtubs" and "buckets" in most projects, blindly hoarding data. At most, they can install a "water meter" in their own homes and check their own home storage from time to time. How much water was consumed, and which bucket contained the same water as a week ago. . . . . . As everyone knows, running water does not rot. Without the help of faucets and water purifiers, the few boxes and buckets of water you have saved will soon become ugly and worthless. At that time, you may not have the energy or the place to pour it out if you want to. Therefore, I write so much just to appeal to two things: 1. When CEOs and CTOs start big data projects, you can first consider installing "faucets" and "water purifiers" for your own companies, without having to consider whether to install them first. Install a "bigger bathtub." 2. For data that is idle and ready to be discarded in the enterprise, you might as well open it up and let it "flow". Maybe they will generate new value after such a cycle. I don’t know if these metaphors are appropriate, but I just have some ideas that I don’t like. If you think it's nonsense, just laugh it off. Don't argue with me. I actually don't know much about anything.