true

Find the best tutors and institutes for Data Science

Find Best Data Science Classes

Please select a Category.

Please select a Locality.

No matching category found.

No matching Locality found.

Outside India?

Search for topics

Outlier

Nitish Vig
01 Jul 0 0

Outliers
* An Outlier is an observation point that is distant from other observations.
* An outlier may indicate an experimental error, or it may be due to variability in the measurement.
* Outliers are different from the noise data. Noise is random error or variance that needs to be removed before outlier detection
* We can check the performance of dataset without outlier by checking scores.

 

Categories of Outlier

a) Global Outlier / Point Anomalies
* They are defined as data points that differ from the rest of the data.
* A particular type of global outlier is the influential point. It is defined as the outlier that impacts the rest of the data.
* We can evaluate an outlier based on the R2 score and regression line.
* Removing an outlier may or may not increase the R2. We have to study the individual purpose if needed.

b) Contextual / Conditional Outlier
* It is defined as the data point whose context differs from the rest of the data.
* It can be a point that is following a trend in some other context, with what defined for rest of the dataset.
* We have to study it and remove it carefully.
* We shall focus on two conditions :
    Contextual attributes: e.g., time, location, etc.
    Behavioural attributes: e.g., temperature, calories taken

c) Collective Outlier
* It is defined as a set of data point that deviates significantly from the rest of the data, even if the individual data points are not the outliers.
* We have to study it carefully. Removal of such data points may degrade the system. We shall focus on improving it.

0 Dislike
Follow 2

Please Enter a comment

Submit

Other Lessons for You

What is Time Series?
What is a Time Series? Time Series data is a series of data points indexed or listed or graphed with an equally spaced period. Time series forecasting is the use of the model to predict future values...

Pavan Balaji N | 28 Jul

0 0
0

Regularisation in Machine Learning
Regularization In Machine Learning, Regularization is the concept of shrinking or regularizing the coefficients towards zero. It helps the model to prevent overfitting. Overfitting in Machine Learning...

Mathematics used in various Machine learning concepts
Mathematics is the building block for data science. This blog focuses on various mathematical concepts that are used in machine learning. The mathematical concepts used for machine learning are categorized...

Linear Regression and its types
Linear Regression A Linear regression is a Regression Analysis technique which is used for modeling the predictions on the continuous data. A Linear Regression can be modelled using 1. A Simple Regression...

Code: Gantt Chart: Horizontal bar using matplotlib for tasks with Start Time and End Time
import pandas as pd from datetime import datetimeimport matplotlib.dates as datesimport matplotlib.pyplot as plt def gantt_chart(df_phase): # Now convert them to matplotlib's internal format... ...

Rishi B. | 08 Apr

0 0
0
X

Looking for Data Science Classes?

Find best tutors for Data Science Classes by posting a requirement.

  • Post a learning requirement
  • Get customized responses
  • Compare and select the best

Looking for Data Science Classes?

Find best Data Science Classes in your locality on UrbanPro

Post your learning requirement

UrbanPro.com is India's largest network of most trusted tutors and institutes. Over 25 lakh students rely on UrbanPro.com, to fulfill their learning requirements across 1,000+ categories. Using UrbanPro.com, parents, and students can compare multiple Tutors and Institutes and choose the one that best suits their requirements. More than 6.5 lakh verified Tutors and Institutes are helping millions of students every day and growing their tutoring business on UrbanPro.com. Whether you are looking for a tutor to learn mathematics, a German language trainer to brush up your German language skills or an institute to upgrade your IT skills, we have got the best selection of Tutors and Training Institutes for you. Read more