Why in Hadoop do we have default block size as 128 and replication as 3?

Asked by Last Modified  

2 Answers

Follow 3
Answer

Please enter your answer

Trainer

The block size depends upon the Data structure its built with. The replication can be customised, the default replica is 3 in order to avoid data loss, you can increase or decrease based upon your serve size
Comments

Because of HDFS it having Replication provides the Fault Tolerance.and replicato of data use to solve any problem of data which losss in unfavorable conditionsUnfavorable conditions are crashing of node, hardware failure and so on
Comments

Related Questions

Hi, I am an Oracle forms report developer, PLSQL developer with 6 + yrs exp.

I am looking for a change as Oracle form and reports is outdated. I have interest in data analysis. What will be a better option:

1. ETL,

2. Big Data or,

3. SAP HANA?

Future is bigdata or nothing. All companies are moving thier workloads (data processing) from Traditional RDBMs to Bigdata tools. Majority of usecases can be handled by Hive, Spark SQL and Sqoop which...
NAJISH
What should be the fees for Online weekend Big Data Classes. All stack Hadoop, Spark, Pig, Hive , Sqoop, HBase , NIFI, Kafka and others. I Charged 8K and people are still negotiating. Is this too much?
Based on experience we can demand and based on how many hours you are spending for whole course. But anyway 8K is ok. But some of the people are offering 6k. So they will ask. Show your positives compare...
Binay Jha
Which are the best course, big data or data science, for beginners with a non-tech background?
You are saying that you are from non technical background so it is better to choose Data science even lot of people from commerce group's joining in this. You should have a passion to learn then there is a lot of opportunities out side. All the best
Priya

Now ask question in any of the 1000+ Categories, and get Answers from Tutors and Trainers on UrbanPro.com

Ask a Question

Related Lessons

What is the difference between Analytics and analysis?
Analysis> Separation of a whole into its component parts> Looks backwards over time, providing marketers with a historical view of what has happened Analytics > Defines the science behind the...

R programming language
R is a programming language and software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and...

Big Data for Beginners
Hello Big Data Enthusiast, Many of you would have heard about this term "Big Data" getting buzzed out everywhere and wondering what it could be. Ok, let's sort out things with an example. Imagine you...

What Is Power Query?
Power Query is an Excel add-in that can be used for data discovery, reshaping the data and combining data coming from different sources. Power Query is one of the Excel add-ins provided as part of Microsoft...

What is a VBA Module?
VBA code is stored and typed in the VBA Editor in what are called modules As stated on the VBA Editor page, a collection of modules is what is called a VBA project Every major Microsoft Office product...

Recommended Articles

Big data is a phrase which is used to describe a very large amount of structured (or unstructured) data. This data is so “big” that it gets problematic to be handled using conventional database techniques and software.  A Big Data Scientist is a business employee who is responsible for handling and statistically evaluating...

Read full article >

Hadoop is a framework which has been developed for organizing and analysing big chunks of data for a business. Suppose you have a file larger than your system’s storage capacity and you can’t store it. Hadoop helps in storing bigger files than what could be stored on one particular server. You can therefore store very,...

Read full article >

We have already discussed why and how “Big Data” is all set to revolutionize our lives, professions and the way we communicate. Data is growing by leaps and bounds. The Walmart database handles over 2.6 petabytes of massive data from several million customer transactions every hour. Facebook database, similarly handles...

Read full article >

In the domain of Information Technology, there is always a lot to learn and implement. However, some technologies have a relatively higher demand than the rest of the others. So here are some popular IT courses for the present and upcoming future: Cloud Computing Cloud Computing is a computing technique which is used...

Read full article >

Looking for Big Data Training?

Learn from the Best Tutors on UrbanPro

Are you a Tutor or Training Institute?

Join UrbanPro Today to find students near you