How can you handle categorical data with a large number of categories?

Asked by Last Modified  

1 Answer

Follow 1
Answer

Please enter your answer

Good teacher teaching online Class 9 and Class 10 CBSE

Decrease the number of groups and apply one-hot encoding. You don't need to reject one-hot encoding entirely. Think of the meaningful ways to map your data to several larger categories. Then one-hot encode your newly mapped categories.
Comments

Related Questions

I want to learn data science in home itself bcz i dont want much time to take any coaching and also most of the institutes are asking high amount for  training. Pease lemme know how i can prepare myself.

First of all you start leaning following. 1.Database(Sql,Nosql) 2 Python,Pandas,Numpy 3 Basic Linux,Big Data(Hadoop,Scala,Spark) 4. Machine Learning 5. Deep Learning
Vishal
I have 2+ yrs working experience in BI domain. Can I pursue Data science for a job change? Will I get Job opportunity as per my experience or not in field of data science? R or python what to chose?
Hi Asish you can choose R or Python selecting programming tools is not criteria learning Deep Analytics is most important you should focus on Mathematicsfor (classification algorithms) statistics(EDA...
Asish
0 0
8
Which are the best course, big data or data science, for beginners with a non-tech background?
You are saying that you are from non technical background so it is better to choose Data science even lot of people from commerce group's joining in this. You should have a passion to learn then there is a lot of opportunities out side. All the best
Priya

I want to get into data science but I dont have any prior knowledge on any of the programing languages, how do I go about it?

Easiest way to get started is with simlpe tools like excel and regression. Doesn't require programming language, basic maths and statistics would suffice to get the grasp at beginner level. Next, more...
Likith
Hi, anyone personal tutor who can teach data science with 100% job guarantee?
Yes,we have sarted such program. The course is designed to make you expert in 4 month time(60 Hourse course+60 Hours project work) 1)Machine Learning 2) Deep learning ,NLP and Speech to text with expert...
Kunal

Now ask question in any of the 1000+ Categories, and get Answers from Tutors and Trainers on UrbanPro.com

Ask a Question

Related Lessons

Lesson: Hive Queries
Lesson: Hive Queries This lesson will cover the following topics: Simple selects ? selecting columns Simple selects – selecting rows Creating new columns Hive Functions In SQL, of which...

Learn Data Science In 8 Steps
8 Steps To Learn Data Science There have been a lot of surveys over the past few years on the educational background of data scientists. As a result, there have also been many different results. In the...

What is Time Series?
What is a Time Series? Time Series data is a series of data points indexed or listed or graphed with an equally spaced period. Time series forecasting is the use of the model to predict future values...

Code: Gantt Chart: Horizontal bar using matplotlib for tasks with Start Time and End Time
import pandas as pd from datetime import datetimeimport matplotlib.dates as datesimport matplotlib.pyplot as plt def gantt_chart(df_phase): # Now convert them to matplotlib's internal format... ...
R

Rishi B.

0 0
0

What Is Cart?
CART means classification and regression tree. It is a non-parametric approach for developing a predictive model. What is meant by non-parametric is that in implementing this methodology, we do not have...

Recommended Articles

Microsoft Excel is an electronic spreadsheet tool which is commonly used for financial and statistical data processing. It has been developed by Microsoft and forms a major component of the widely used Microsoft Office. From individual users to the top IT companies, Excel is used worldwide. Excel is one of the most important...

Read full article >

Applications engineering is a hot trend in the current IT market.  An applications engineer is responsible for designing and application of technology products relating to various aspects of computing. To accomplish this, he/she has to work collaboratively with the company’s manufacturing, marketing, sales, and customer...

Read full article >

Whether it was the Internet Era of 90s or the Big Data Era of today, Information Technology (IT) has given birth to several lucrative career options for many. Though there will not be a “significant" increase in demand for IT professionals in 2014 as compared to 2013, a “steady” demand for IT professionals is rest assured...

Read full article >

Business Process outsourcing (BPO) services can be considered as a kind of outsourcing which involves subletting of specific functions associated with any business to a third party service provider. BPO is usually administered as a cost-saving procedure for functions which an organization needs but does not rely upon to...

Read full article >

Looking for Data Science Classes?

Learn from the Best Tutors on UrbanPro

Are you a Tutor or Training Institute?

Join UrbanPro Today to find students near you