What is Big Data?How it became the challenge for top MNC’s like Google,Facebook etc.?

Sriramadasu Prasanth Kumar
3 min readSep 17, 2020

--

This is the article based on the increasing demand of big data on this modern digital world.

What is BigData?

Big data is a term that describes the large volume of data — both structured and unstructured — that inundates a business on a day-to-day basis. But it’s not the amount of data that’s important.

Have you ever think about:

How google, facebook , instagram are storing such a huge data?

How they are extracting the data within seconds?

Where does big data come from?

Big data refers to the data that may contain information,videos,music,files and anything it may be useful or useless.

Big data is nothing but data of huge amount.

When we are searching anything in google it shows the millions of records related to that search that is called data.

The bulk of big data generated comes from three primary sources: social data, machine data and transactional data.

Increasing market trends of BigData

As we need to store the every data , day by day the data usage keep on increasing so the big data comes.

Challenges faced by Top MNCs due to BigData

1.Dealing with data growth

2. Generating insights in a timely manner

3. Recruiting and retaining big data talent

4. Integrating disparate data sources

5. Validating data

6. Securing big data

7. Organizational resistance

3Vs of BigData:

  • Volume
  • Velocity
  • Variety.
  1. VOLUME

Within the Social Media space for example, Volume refers to the amount of data generated through websites, portals and online applications. Especially for B2C companies, Volume encompasses the available data that are out there and need to be assessed for relevance. Consider the following -Facebook has 2 billion users, Youtube 1 billion users, Twitter 350 million users and Instagram 700 million users. Every day, these users contribute to billions of images, posts, videos, tweets etc. You can now imagine the insanely large amount -or Volume- of data that is generated every minute and every hour.

2.VELOCITY

With Velocity we refer to the speed with which data are being generated. Staying with our social media example, every day 900 million photos are uploaded on Facebook, 500 million tweets are posted on Twitter, 0.4 million hours of video are uploaded on Youtube and 3.5 billion searches are performed in Google. This is like a nuclear data explosion. Big Data helps the company to hold this explosion, accept the incoming flow of data and at the same time process it fast so that it does not create bottlenecks.

3.VARIETY

Variety in Big Data refers to all the structured and unstructured data that has the possibility of getting generated either by humans or by machines. The most commonly added data are structured -texts, tweets, pictures & videos. However, unstructured data like emails, voicemails, hand-written text, ECG reading, audio recordings etc, are also important elements under Variety. Variety is all about the ability to classify the incoming data into various categories.

--

--

Sriramadasu Prasanth Kumar

MLOps| Hybrid Cloud | DevOps | Hadoop | Kubernets | Data Science| AWS | GCP |