How do you calculate Gini impurity in a decision tree?

Take the class counts at the node, divide each by the total to get proportions pₖ, square each proportion, add the squares, and subtract from one: G = 1 − Σ pₖ². A node with counts [6, 4] gives proportions [0.6, 0.4], so G = 1 − (0.36 + 0.16) = 0.48.

What is the difference between Gini impurity and entropy?

Both measure how mixed a node is. Gini impurity uses 1 − Σ pₖ²; Shannon entropy uses −Σ pₖ log₂ pₖ (in bits). Gini is slightly cheaper to compute and is scikit-learn's default. In practice they almost always select the same split, so the choice rarely changes the final tree.

What is the maximum value of Gini impurity?

For K classes the maximum is 1 − 1/K, reached when every class is equally frequent. For 2 classes that is 0.5, for 3 classes about 0.667, for 4 classes 0.75. The minimum is always 0, for a pure node where every sample belongs to one class.

What is Gini gain and how does CART choose a split?

Gini gain is the impurity decrease from a split: ΔG = G_parent − Σ (Nⱼ/N)·Gⱼ, the parent's Gini minus the sample-weighted average of the children's Gini. CART evaluates every candidate split and picks the one with the largest Gini gain, then repeats on each child until a stopping rule is hit.

Is Gini impurity the same as the Gini coefficient in economics?

No. They share Corrado Gini's name but measure different things. The Gini coefficient measures income or wealth inequality across a population (the Lorenz-curve statistic). Gini impurity measures class mixedness at a decision-tree node. This tool computes the machine-learning impurity, not the economics coefficient.

Can Gini impurity handle more than two classes?

Yes. The formula G = 1 − Σ pₖ² works for any number of classes K ≥ 2. Enter one number per class, for example 3, 5, 2 for three classes. Single-node mode accepts any K; the split mode here handles the standard binary parent-to-two-children case typed inline.

Does this calculator match scikit-learn exactly?

Yes. It uses the same definition as DecisionTreeClassifier(criterion="gini") — G = 1 − Σ pₖ² per node and the sample-weighted impurity decrease for a split. Results agree to floating-point precision; only display rounding (your chosen decimal places) differs from a raw NumPy print.

What does a Gini impurity of 0 mean?

Zero means the node is pure — every sample belongs to a single class, so there is nothing left to separate. A decision tree stops splitting a branch once it reaches (or gets close enough to) a pure node, because no further split can reduce the impurity below zero.

Machine Learning · Decision Trees

Gini Impurity Calculator

Compute the Gini impurity of a decision-tree node from its class counts or proportions, with the full 1 − Σ pₖ² working, a Shannon-entropy comparison, and the Gini gain of a candidate split. Matches scikit-learn. No signup, runs in your browser.

By Induwara Ashinsana— Executive Director, Ryzera TechnologiesUpdated Jun 10, 2026

Gini impuritynode & split

Matches scikit-learn

Input as

Mode

Class counts (one number per class)

Comma- or space-separated, e.g. 6, 4 for 6 of class A and 4 of class B.

Decimal places

Examples

Gini impurity

0.4800

Shannon entropy

0.9710 bits

Max impurity (1 − 1/2)

0.5000

2 classes

Σ pₖ²

0.5200

N = 10

G = 1 − (0.6000² + 0.4000²) = 1 − 0.5200 = 0.4800

Cross-checked against the sum form Σ pₖ(1 − pₖ) = 0.4800.

Per-class working

Class	Count nₖ	Proportion pₖ	pₖ²
A	6	0.6000	0.3600
B	4	0.4000	0.1600
Σ pₖ²			0.5200
G = 1 − Σ pₖ²			0.4800

Gini vs entropy: the two criteria nearly always pick the same split. Entropy is shown for comparison — see the ML metrics tools for the Shannon entropy and cross-entropy calculators.

Formula G = 1 − Σ pₖ²and the weighted impurity-decrease (Gini gain) follow scikit-learn's Decision Trees mathematical formulation. Last verified 2026-06-10. Full sources are listed below the calculator.

How it works

Gini impurity measures how mixed the class labels are at a node in a classification tree. It is the default splitting criterion in scikit-learn's DecisionTreeClassifier and the diversity index introduced by Breiman, Friedman, Olshen and Stone in Classification and Regression Trees (CART, 1984).

Let a node hold counts n₁, …, n_K over K classes, with total N = Σ nₖ. The calculation is four steps:

Class proportions. For each class, pₖ = nₖ / N. These are the fractions of samples in the node belonging to each class.
Square each proportion to get pₖ², then add them: Σ pₖ². This sum is the probability that two samples drawn at random from the node share a class.
Gini impurity. G = 1 − Σ pₖ². Equivalently G = Σ pₖ(1 − pₖ) — the probability that two random draws differ. The calculator computes both forms and shows they agree, as a built-in cross-check.
Range. G lies in [0, 1 − 1/K]. It is 0 for a pure node (one class only) and reaches its maximum 1 − 1/K when all classes are equally frequent.

For comparison the tool also reports Shannon entropy, H = −Σ pₖ log₂ pₖ bits, using the convention 0·log₂0 = 0 so pure classes do not produce NaN. Entropy is the alternative splitting criterion (scikit-learn's criterion="entropy"); in practice Gini and entropy nearly always choose the same split.

In split mode, the tool evaluates a candidate split of a parent node into two children. It computes each child's Gini Gⱼ, the sample weights wⱼ = Nⱼ / N, the weighted child impurity Σ wⱼ·Gⱼ, and the Gini gain ΔG = G_parent − Σ wⱼ·Gⱼ. This impurity decrease is exactly what the CART algorithm maximises when it chooses which feature and threshold to split on: the larger the Gini gain, the better the split separates the classes.

Worked examples

Maximally mixed binary node — counts [50, 50]

N = 100, proportions p = [0.5, 0.5]
Squares: 0.5² = 0.25 and 0.5² = 0.25
Σ pₖ² = 0.25 + 0.25 = 0.50
G = 1 − 0.50 = 0.5 (the maximum 1 − 1/2 for two classes)
Entropy = −(0.5·log₂0.5 + 0.5·log₂0.5) = 1 bit

Skewed node — counts [3, 1]

N = 4, proportions p = [0.75, 0.25]
Squares: 0.75² = 0.5625 and 0.25² = 0.0625
Σ pₖ² = 0.5625 + 0.0625 = 0.625
G = 1 − 0.625 = 0.375
Entropy = −(0.75·log₂0.75 + 0.25·log₂0.25) ≈ 0.8113 bits

Split mode (Gini gain) — parent [6, 4] → [4, 0] | [2, 4]

Parent: G = 1 − (0.6² + 0.4²) = 1 − 0.52 = 0.48
Left child [4, 0]: pure, G = 0, weight = 4/10 = 0.4
Right child [2, 4]: p = [1/3, 2/3], G = 1 − (0.1111 + 0.4444) = 0.4444, weight = 6/10 = 0.6
Weighted child impurity = 0.4·0 + 0.6·0.4444 = 0.2667
Gini gain ΔG = 0.48 − 0.2667 = 0.2133

Frequently asked questions

Sources & references

The Gini impurity and Gini-gain formulas were last cross-checked against the scikit-learn documentation on 2026-06-10. These are standard, uncontested textbook formulas with no rates or policy that drift.

Related tools

LiveAI

Information Gain Calc

Compute the information gain and gain ratio of a decision-tree split from class counts, with every intermediate entropy step shown — parent entropy, per-child entropy, weighted child entropy, split information. Matches Quinlan's ID3/C4.5, runs in the browser.

Open tool

LiveAI

ROUGE Score Calculator

Calculate ROUGE-1, ROUGE-2, ROUGE-L and ROUGE-Lsum precision, recall and F1 between a generated summary and one or more references, entirely in your browser. Shows matched n-grams and the longest common subsequence. Matches Google rouge-score, no signup.

Open tool

LiveAI

Mutual Information Calc

Compute the mutual information I(X;Y) between two discrete variables from a count or joint-probability table, in bits or nats, with the full information-theory breakdown — marginal entropies H(X), H(Y), joint entropy H(X,Y), conditional entropies, normalized MI, and every per-cell term. Runs in your browser, matched to scikit-learn.

Open tool

Rate this tool

Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Found a bug, edge case, or want to suggest an improvement?

Email me at [email protected] — most fixes ship within 24 hours.