Skip to main content

What is Big Data ?

What is Big Data ?
It is now time to answer an important question – What is Big Data?

Big data, as defined by Wikipedia, is this:
“Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, data curation, search, sharingstorage,transfervisualizationquerying and information privacy. The term often refers simply to the use of predictive analytics or certain other advanced methods to extract value from data, and seldom to a particular size of data set.”
In simple terms, Big Data is data that has the 3 characteristics that we mentioned in the last section –
• It is big – typically in terabytes or even petabytes
• It is varied – it could be a traditional database, it could be video data, log data, text data or even voice data
• It keeps increasing as new data keeps flowing in
This kind of data is becoming common place in many fields including Science, public administration and business.
The ability to harness such data for better decision making is therefore in great demand in today’s world.

Where is Big Data Used ?
Big Data is most prevalent in consumer-centric industries that typically generate large volumes of data. Examples of
such industries are –
• Consumer products such as Proctor & Gamble
• Credit card and Insurance such as Capital One and Progressive Insurance
• E-commerce companies such as Amazon, Netflix and Flipkart
• Travel and leisure such as United Airlines and Caesars Casino
• Public utilities such as electricity companies
Big Data is also becoming increasingly important in industries such as –
• Telecom
• Media and Entertainment
• Education
• And healthcare
Within each of these industries, Big Data can be applied to various functions such as –
• Marketing – for example social media analysis to understand customer pulse
• Supply chain – for example better inventory management through GPS data analysis
• Finance – for example for fraud control
• Manufacturing – for example, to link manufacturing operations with the supply chain for better optimization
In this section, you have seen industries and functions where Big Data is making a significant impact.
Now let us get an overview of some of the technologies that are driving the Big Data revolution.

Comments

Popular posts from this blog

Automatic Builds With GCP Cloud Build

Automatic Builds With GCP Cloud Build If you are looking for an easy way to automatically build your application in the cloud, then maybe Google Cloud Platform (GCP) Cloud Build is for you. In this post, we will build a Spring Boot Maven project with Cloud Build, create a Docker image for it, and push it to GCP Container Registry. 1. Introduction Cloud Build is the build server tooling of GCP, something similar as Jenkins. But, Cloud Build is available out-of-the-box in your GCP account and that is a major advantage. The only thing you will need is a build configuration file in your git repository containing the build steps. Each build step is running in its own Docker container. Several cloud builders which can be used as a build step are generally available. You can read more about Cloud Build on the  overview  and  concepts  website of GCP. There are three categories of build steps: Official  cloud builders provided by GCP; Community  cloud ...

EVENT DRIVEN MICROSERVICES

EVENT BASED MICROSERVICES - Event Sourcing In a Microservice Architecture, especially with Database per Microservice, the Microservices need to exchange data. For resilient, highly scalable, and fault-tolerant systems, they should communicate asynchronously by exchanging Events. In such a case, you may want to have Atomic operations, e.g., update the Database and send the message. If you have SQL databases and want to have distributed transactions for a high volume of data, you cannot use the two-phase locking (2PL) as it does not scale. If you use NoSQL Databases and want to have a distributed transaction, you cannot use 2PL as many NoSQL databases do not support two-phase locking. In such scenarios, use Event based Architecture with Event Sourcing. In traditional databases, the Business Entity with the current “state” is directly stored. In Event Sourcing, any state-changing event or other significant events are stored instead of the entities. It means the modifications of a Busines...

Recommendation Engines - Know How

Recommendation Engines perform a variety of tasks - but the most important one is to find products that are most relevant to the user. Content based filtering, collaborative filtering and Association rules are common approaches to do so. So let's first  Understand basics of Recommendation Engines and then we'll later on Build Our Own Recommendation Engine !!! HIGH QUALITY, PERSONALIZED  ARE THE HOLY GRAIL FOR EVERY ONLINE STORE. UNLIKE OFFLINE STORES,  ONLINE STORES HAVE NO SALES PEOPLE. USERS ON THE OTHER HAND  HAVE LIMITED TIME AND PATIENCE,  ARE NOT SURE WHAT THEY ARE LOOKING FOR  ONLINE STORES HAVE A HUGE NUMBER OF  PRODUCTS. RECOMMENDATIONS HELP USERS  NAVIGATE THE MAZE OF ONLINE STORES  FIND WHAT THEY ARE LOOKING FOR  FIND THINGS THEY MIGHT LIKE, BUT DIDN’T KNOW OF. RECOMMENDATIONS HELP ONLINE STORES  SOLVE THE PROBLEM OF DISCOVERY. BUT HOW? Lets Explain this. ONLINE STORES HAVE DATA 1) WHAT USERS  BOUGHT 2)...