Skip to main content

What is Big Data ?

What is Big Data ?
It is now time to answer an important question – What is Big Data?

Big data, as defined by Wikipedia, is this:
“Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, data curation, search, sharingstorage,transfervisualizationquerying and information privacy. The term often refers simply to the use of predictive analytics or certain other advanced methods to extract value from data, and seldom to a particular size of data set.”
In simple terms, Big Data is data that has the 3 characteristics that we mentioned in the last section –
• It is big – typically in terabytes or even petabytes
• It is varied – it could be a traditional database, it could be video data, log data, text data or even voice data
• It keeps increasing as new data keeps flowing in
This kind of data is becoming common place in many fields including Science, public administration and business.
The ability to harness such data for better decision making is therefore in great demand in today’s world.

Where is Big Data Used ?
Big Data is most prevalent in consumer-centric industries that typically generate large volumes of data. Examples of
such industries are –
• Consumer products such as Proctor & Gamble
• Credit card and Insurance such as Capital One and Progressive Insurance
• E-commerce companies such as Amazon, Netflix and Flipkart
• Travel and leisure such as United Airlines and Caesars Casino
• Public utilities such as electricity companies
Big Data is also becoming increasingly important in industries such as –
• Telecom
• Media and Entertainment
• Education
• And healthcare
Within each of these industries, Big Data can be applied to various functions such as –
• Marketing – for example social media analysis to understand customer pulse
• Supply chain – for example better inventory management through GPS data analysis
• Finance – for example for fraud control
• Manufacturing – for example, to link manufacturing operations with the supply chain for better optimization
In this section, you have seen industries and functions where Big Data is making a significant impact.
Now let us get an overview of some of the technologies that are driving the Big Data revolution.

Comments

Popular posts from this blog

Let's Understand Ten Machine Learning Algorithms

Ten Machine Learning Algorithms to Learn Machine Learning Practitioners have different personalities. While some of them are “I am an expert in X and X can train on any type of data”, where X = some algorithm, some others are “Right tool for the right job people”. A lot of them also subscribe to “Jack of all trades. Master of one” strategy, where they have one area of deep expertise and know slightly about different fields of Machine Learning. That said, no one can deny the fact that as practicing Data Scientists, we will have to know basics of some common machine learning algorithms, which would help us engage with a new-domain problem we come across. This is a whirlwind tour of common machine learning algorithms and quick resources about them which can help you get started on them. 1. Principal Component Analysis(PCA)/SVD PCA is an unsupervised method to understand global properties of a dataset consisting of vectors. Covariance Matrix of data points is analyzed here to un

gRPC with Java : Build Fast & Scalable Modern API & Microservices using Protocol Buffers

gRPC Java Master Class : Build Fast & Scalable Modern API for your Microservice using gRPC Protocol Buffers gRPC is a revolutionary and modern way to define and write APIs for your microservices. The days of REST, JSON and Swagger are over! Now writing an API is easy, simple, fast and efficient. gRPC is created by Google and Square, is an official CNCF project (like Docker and Kubernetes) and is now used by the biggest tech companies such as Netflix, CoreOS, CockRoachDB, and so on! gRPC is very popular and has over 15,000 stars on GitHub (2 times what Kafka has!). I am convinced that gRPC is the FUTURE for writing API for microservices so I want to give you a chance to learn about it TODAY. Amongst the advantage of gRPC: 1) All your APIs and messages are simply defined using Protocol Buffers 2) All your server and client code for any programming language gets generated automatically for free! Saves you hours of programming 3) Data is compact and serialised 4) API

GraphQL - A Short Intro

Why GraphQL is the future of APIs Since the beginning of the web, developing APIs has been a difficult task for developers. The way we develop our APIs must evolve with time so that we can always build good, intuitive and well-designed APIs. In the last few years, GraphQL has been growing in popularity among developers. A lot of companies have started adopting this technology to build their APIs. GraphQL is a query language developed by Facebook in 2012 and released publicly in 2015. It has been gaining a lot of traction. It has been adopted by a lot of big companies such as Spotify, Facebook, GitHub, NYTimes, Netflix, Walmart, and so on. In this series of tutorials, we’re going to examine GraphQL, understand what it is, and see what features make this query language so intuitive and easy to use. So, let’s get started by examining the problems with REST, and how GraphQL solves them. We will also find out why companies have been building their APIs with GraphQL, and why