UPENN | DS-AI Sample | Decision Tree

Question

UPENN | DS-AI Sample | Decision Tree

306 views

See all

1 Answer

Best answer

The most appropriate criterion to maximize when choosing a feature in a decision tree is $(c) \ H(Y) - H(Y | X_j)$.

Explanation:

$H(Y)$ represents the entropy of the target variable $Y$, measuring its uncertainty or randomness. $H(Y | X_j)$ represents the conditional entropy of $Y$ given a specific feature $X_j$, indicating how much uncertainty remains about $Y$ after knowing the value of $X_j$.

Information Gain:

The difference between these two entropies, $H(Y) - H(Y | X_j)$, is called the information gain associated with feature $X_j$. It quantifies the reduction in uncertainty about $Y$ achieved by knowing the value of $X_j$.

Goal of Decision Trees:

Decision trees aim to create splits that reduce uncertainty about the target variable as much as possible. Therefore, maximizing the information gain, which means maximizing $H(Y) - H(Y | X_j)$, is the most appropriate criterion for feature selection.

rajveer43 answered Jan 16 • selected Jan 16 by rajveer43

rajveer43

See all

Related questions

444

views

1 answers

0 votes

rajveer43 asked Jan 16

444 views

AI Sample Question for DS-AI

Imagine you are guiding a robot through a grid-based maze using the A* algorithm. The robot is currently at node A (start) and wants to reach node B (goal). The heuristi...

rajveer43

444 views

rajveer43 asked Jan 16

350

views

1 answers

0 votes

rajveer43 asked Jan 16

350 views

UPENN | ML | DECISION TREE

Given the following table of observations, calculate the information gain $IG(Y |X)$ that would result from learning the value of $X$. XYRedTrueGreenFalseBrownFalseBrownF...

rajveer43

350 views

rajveer43 asked Jan 16

564

views

1 answers

0 votes

rajveer43 asked Jan 14

564 views

DA Practice | UPENN | ML | Naive Bais

Suppose you have a three-class problem where class label $ y \in \{0, 1, 2\} $, and each training example $ \mathbf{X} $ has 3 binary attributes \( X_1, X_2, X_3 \in ...

rajveer43

564 views

rajveer43 asked Jan 14

336

views

1 answers

0 votes

rajveer43 asked Jan 15

336 views

UPENN | ML Questions for GATE DA

In fitting some data using radial basis functions with kernel width $σ$, we compute training error of $345$ and a testing error of $390$.(a) increasing $σ$ will most li...

rajveer43

336 views

rajveer43 asked Jan 15

tags	tag:apple
author	user:martin
title	title:apple
content	content:apple
exclude	-tag:apple
force match	+apple
views	views:100
score	score:10
answers	answers:2
is accepted	isaccepted:true
is closed	isclosed:true

UPENN | DS-AI Sample | Decision Tree

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

0 reply

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

0 reply

Please log in or register to add a comment.

Related questions

0

0