UrbanPro
true

Learn Big Data from the Best Tutors

  • Affordable fees
  • 1-1 or Group class
  • Flexible Timings
  • Verified Tutors

Search in

Big Data for Beginners

Silvia Priya
29/03/2019 0 0

Hello Big Data Enthusiast,

Many of you would have heard about this term "Big Data" getting buzzed out everywhere and wondering what it could be.

Ok, let's sort out things with an example.

Imagine you have a machine with a capacity of  8 GB storage, and you want to store a data of size 12 GB from a client and perform some analytics on it. So, think of the possible ways in which you can  store the desired data.

1. Extend your HD capacity to around 15GB or beyond for a succesfull storage.

2. Hire a cloud serivce and upload the data in cloud for analysis, but if the client don't want to upload the data into cloud due to multiple reasons, then this option will be ruled out.

3. Upload the data into a distributed file system, after analysing it pro's and con's.

True, you can follow any one of the above mentioned cases.

This data of 12GB size which is beyond the machine storage capacity is actually called as BIG DATA.

This BIG DATA can be in any format/type like structured,unstructured and semistructured.

Structured - RDBMS data(table data with proper rows,columns,keys etc..)

Unstructured - Images, Pictures ,Videos etc.

Semistructured - Files of format HTML,XML etc.

 

A data can be big data to you and not necessarily be a big data to other person.

Seems confusing..., giong back to the previous example, if you have have machine of 20 GB storage capacity, you can conveniently store our 12GB and it is not a big data for you at first place.

 

Hope now have you climbed up a little, on the mountain of Big data!!..

Big data can also be defined in other way if it satisifies the below criterias.

 

 

Volume  

If the size of the data you are planning to analyse, is much bigger than the capacity of your machine, then call it as a big data.

Velocity

If the rate or speed of the data entering into your machine increases exponentially with respect to time , then call it as a big data.

Variety

The data could be in any formats like Structured,Unstructured,Semi-structured as we have seen previously.

Veracity

The data that we are going to process can contain some uncertain information or incorrect data.

Value

The data should makes some sense to the business, that is we should be able to make some analysis out of the data.

 

Now cooking up all of the above information, you can keep your first step into learning Big data.

We will see more insights about this on our next lesson.

Thank you!!

0 Dislike
Follow 1

Please Enter a comment

Submit

Other Lessons for You

Lesson: Hive Queries
Lesson: Hive Queries This lesson will cover the following topics: Simple selects ? selecting columns Simple selects – selecting rows Creating new columns Hive Functions In SQL, of which...
C

Data Filtering In Excel
Filtering data in MS Excel refers to displaying only the rows that meet certain conditions (The other rows gets hidden). Using the store data, if you are interested in seeing data where Shoe Size is 36,...

Power View
Power View is now a feature of Microsoft Excel 2013, and is part of the Microsoft SQL Server 2012 Reporting Services add-in for Microsoft SharePoint Server 2010 and 2013 Enterprise Editions. Power View...

HDFS Commands - Data Engineering
HDFS commands : HDFS commands will interact with namenode to show results some commands like cat ,tail will interact with datanode to show results HDFS help commands: hadoop fs -help ls list commands...

5 Tips For Improving Your Documentation Immediately.
Tip 1) Quit it with the Passive Voice The passive voice is a plague on effective documentation. It reduces its clarity, its consistency, and the efficiency and tightness of the writing. The passive voice...
X

Looking for Big Data Classes?

The best tutors for Big Data Classes are on UrbanPro

  • Select the best Tutor
  • Book & Attend a Free Demo
  • Pay and start Learning

Learn Big Data with the Best Tutors

The best Tutors for Big Data Classes are on UrbanPro

This website uses cookies

We use cookies to improve user experience. Choose what cookies you allow us to use. You can read more about our Cookie Policy in our Privacy Policy

Accept All
Decline All

UrbanPro.com is India's largest network of most trusted tutors and institutes. Over 55 lakh students rely on UrbanPro.com, to fulfill their learning requirements across 1,000+ categories. Using UrbanPro.com, parents, and students can compare multiple Tutors and Institutes and choose the one that best suits their requirements. More than 7.5 lakh verified Tutors and Institutes are helping millions of students every day and growing their tutoring business on UrbanPro.com. Whether you are looking for a tutor to learn mathematics, a German language trainer to brush up your German language skills or an institute to upgrade your IT skills, we have got the best selection of Tutors and Training Institutes for you. Read more