What is the need for correlation in data science?

Asked by Last Modified  

2 Answers

Follow 1
Answer

Please enter your answer

Big Data Architect

Correlation measure how two observed variables are related to each other . It has been used in many different ways in data science. Correlation is used im univariate analysis to identify which feature is more predictive for classification of regression task. To identify multicollinearity in the feature...
read more
Correlation measure how two observed variables are related to each other . It has been used in many different ways in data science. Correlation is used im univariate analysis to identify which feature is more predictive for classification of regression task. To identify multicollinearity in the feature set . Multicollinearity reduse the accuracy of model. Identify casual relation ship between variables. Their are many other extension like cca canonical correlation analysis. read less
Comments

Correlation is needed to find out the relationships between two or more separate sets of data, which are related to one another. A simple example can be a order management process. Starting from the order details to fulfilment and invoicing, order id acts as the correlation id. Similarly customer id...
read more
Correlation is needed to find out the relationships between two or more separate sets of data, which are related to one another. A simple example can be a order management process. Starting from the order details to fulfilment and invoicing, order id acts as the correlation id. Similarly customer id is used to correlated separate and independent orders placed by the same customer. read less
Comments

Related Questions

What should be the fees for Online weekend Big Data Classes. All stack Hadoop, Spark, Pig, Hive , Sqoop, HBase , NIFI, Kafka and others. I Charged 8K and people are still negotiating. Is this too much?
Based on experience we can demand and based on how many hours you are spending for whole course. But anyway 8K is ok. But some of the people are offering 6k. So they will ask. Show your positives compare...
Binay Jha
Hi, What is opinion on Big data analytics for MBA graduates who doesn't know coding. Please suggest. Is it Coding related course.
You should focus on the analytics part of Data Science, and not on big data. Analytics require knowledge of business along with Data Science skills.
Srinivas

Hi, I want to know about the future about Big Data technology. Please advice.

The big data technology and services market is expected to reach $57 billion by 2020. If we look at the overall big data industry (security, services, storage infrastructure, networking, data center infrastructure,...
Sivakumar
What are the top three institutes in Kolkata that provide Big Data Training? What are the areas I should look into while a course and institute for Big Data Training?
Don't thing institutional training is very good. Ask for real time practice, POC, and Preparation for the interview.
Subhadip
0 0
8

Now ask question in any of the 1000+ Categories, and get Answers from Tutors and Trainers on UrbanPro.com

Ask a Question

Related Lessons

Best and cheapest Seo Services in Delhi
www.sgttbinfotech.blogspot.in/
S

Microsoft Excel
Software developed and manufactured by Microsoft Corporation that allows users to organize, format, and calculate data with formulas using a spreadsheet system broken up by rows and columns. Microsoft...

CheckPointing Process - Hadoop
CHECK POINTING Checkpointing process is one of the vital concept/activity under Hadoop. The Name node stores the metadata information in its hard disk. We all know that metadata is the heart core...

HDFS Commands - Data Engineering
HDFS commands : HDFS commands will interact with namenode to show results some commands like cat ,tail will interact with datanode to show results HDFS help commands: hadoop fs -help ls list commands...

Use of Piggybank and Registration in Pig
What is a Piggybank? Piggybank is a jar and its a collection of user contributed UDF’s that is released along with Pig. These are not included in the Pig JAR, so we have to register them manually...
S

Sachin Patil

0 0
0

Recommended Articles

Hadoop is a framework which has been developed for organizing and analysing big chunks of data for a business. Suppose you have a file larger than your system’s storage capacity and you can’t store it. Hadoop helps in storing bigger files than what could be stored on one particular server. You can therefore store very,...

Read full article >

We have already discussed why and how “Big Data” is all set to revolutionize our lives, professions and the way we communicate. Data is growing by leaps and bounds. The Walmart database handles over 2.6 petabytes of massive data from several million customer transactions every hour. Facebook database, similarly handles...

Read full article >

In the domain of Information Technology, there is always a lot to learn and implement. However, some technologies have a relatively higher demand than the rest of the others. So here are some popular IT courses for the present and upcoming future: Cloud Computing Cloud Computing is a computing technique which is used...

Read full article >

Smart cities, Pokémon Go, Google’s AlphGo algorithm, and much more- 2016 were a happening year from the technology viewpoint. The year has set new milestones for futuristic technologies like Augmented Reality (AR), Virtual Reality (VR), and Big Data. Out of these technologies, Big Data is poised for a big leap in the near...

Read full article >

Looking for Big Data Training?

Learn from the Best Tutors on UrbanPro

Are you a Tutor or Training Institute?

Join UrbanPro Today to find students near you