What is a good silhouette score?

As a rule of thumb from Kaufman and Rousseeuw (1990): above 0.70 is strong structure, 0.51–0.70 is reasonable, 0.26–0.50 is weak and may be artificial, and 0.25 or below shows no substantial structure. A negative score means points are, on average, closer to a neighbouring cluster than their own — the labelling is probably wrong. So 0.55 is a reasonable result; 0.71+ is what you want.

How is the silhouette score calculated?

For each point i, compute a(i), the mean distance to the other points in its own cluster, and b(i), the smallest mean distance to any other cluster. The point's silhouette is s(i) = (b(i) − a(i)) / max(a(i), b(i)), which lies between −1 and 1. The overall silhouette score is the average of s(i) over every point. A point alone in its cluster gets s(i) = 0.

What does a negative silhouette score mean?

A negative s(i) for a single point means it is, on average, closer to a neighbouring cluster than to the cluster it was assigned to — a likely misassignment. A negative overall score means this is true for most points, so the whole clustering is poor: the labels do not match the natural grouping of the data. This calculator highlights every negative row so you can spot the offending points.

What is the difference between the silhouette score and the silhouette coefficient?

They usually mean the same thing — the average of the per-point silhouette values s(i). In scikit-learn, silhouette_score returns that overall mean, while silhouette_samples returns the individual s(i) values. Some texts call a single point's s(i) its silhouette coefficient and reserve silhouette score for the average. This tool gives you both: the overall score and the full per-sample table.

Can the silhouette score choose the number of clusters (k)?

Yes — it is one of the standard ways. Run your clustering for several values of k, compute the silhouette score for each labelling, and pick the k with the highest score. Because the score needs no ground-truth labels, it works as an internal validity index. This tool scores one labelling at a time, so paste each k's result in turn and compare the numbers.

Does this match scikit-learn's silhouette_score?

Yes. It uses the same definition — the mean of (b − a) / max(a, b) over all points — with Euclidean distance by default and the same convention that a single-point cluster contributes s(i) = 0. For the same points, labels, and metric, the number here equals sklearn.metrics.silhouette_score. The tool also re-derives the score from the per-cluster means as an independent cross-check.

What distance metrics does it support?

Euclidean (L2) and Manhattan (L1). Euclidean is the default and matches scikit-learn's default metric. Manhattan sums the absolute coordinate differences instead of the squared ones, which is sometimes preferred for high-dimensional or sparse data. Cosine and precomputed-distance metrics are not in this version; for those, use scikit-learn directly with metric set accordingly.

Is there a limit on how many points I can paste?

The tool scores up to 2,000 points at once. Silhouette uses every pairwise distance, so the work grows with the square of the number of points; 2,000 keeps it instant in the browser. For larger datasets, scikit-learn's sample_size option estimates the score from a random subset — a sensible approach above a few thousand points.

Does this calculator send my data anywhere?

No. Parsing your points, computing every distance, and averaging the silhouettes all happen in your browser with plain JavaScript. Nothing is uploaded, logged, or stored, so you can paste real feature vectors without concern. The page keeps working offline once it has loaded.

AI · Clustering metrics

Silhouette Score Calculator

Paste your data points and cluster labels to get the silhouette score, the per-cluster means, and the full a(i)/b(i)/s(i) working for every point — with misassigned points flagged. Matches scikit-learn silhouette_score. No signup, nothing uploaded.

By Induwara Ashinsana— Executive Director, Ryzera TechnologiesUpdated Jun 11, 2026

Silhouette score calculator

Data points

One point per line. Coordinates separated by commas or spaces.

Cluster labels

One label per point, same order. Numbers or names (e.g. 0, 0, 1).

Distance metric

Decimals

Examples

Silhouette score

0.9448

Strong structure

Clusters are dense and clearly separated. This labelling holds up well.

Points

Clusters

Dimensions

Flagged

Cross-check. The mean of all per-point s(i) is 0.9448; computed independently as the size-weighted average of the per-cluster means it is 0.9448. The two aggregations reconcile, as they must.

Per-cluster mean silhouette

Cluster	Size	Mean s(i)	Reading
0	2	0.9289	Strong structure
1	2	0.9606	Strong structure

Per-sample silhouette

Point	Cluster	a(i)	b(i)	Nearest	s(i)
#1	0	0.5099	7.0725	1	0.9279
#2	0	0.5099	7.2801	1	0.9300
#3	1	0.2828	7.3157	0	0.9613
#4	1	0.2828	7.0370	0	0.9598

Worked formula for one point

Point

Point #1 = [5, 2] · cluster 0 · euclidean (l2) distance

mean dist → cluster 0 = 0.5099= a(i)

mean dist → cluster 1 = 7.0725= b(i) (nearest)

s(i) = (b − a) / max(a, b) = (7.0725 − 0.5099) / 7.0725 = 0.9279

Method: a(i) = mean intra-cluster distance, b(i) = nearest other-cluster mean, s(i) = (b − a) / max(a, b); score = mean s(i). Matches scikit-learn silhouette_score. Sources: scikit-learn docs, Rousseeuw (1987), Kaufman & Rousseeuw (1990). Nothing leaves this page.

How it works

The silhouette score measures how well each point fits its assigned cluster compared with the nearest neighbouring cluster. It needs no ground-truth labels, which makes it an internal validity index — the standard way to judge a k-means, hierarchical, or DBSCAN result, and to compare different choices of k. It was introduced by Peter Rousseeuw in 1987 and is what scikit-learn returns from silhouette_score.

For a point i in cluster C, the tool computes three quantities:

a(i) — cohesion. The mean distance from i to every other point in its own cluster: a(i) = (1 / (|C| − 1)) · Σ d(i, j) over j in C, j ≠ i. Lower is better — the point is close to its cluster-mates.
b(i) — separation. For every other cluster C′, take the mean distance from i to all of its points, then keep the smallest: b(i) = min over C′ of (1 / |C′|) · Σ d(i, j). That nearest rival cluster is the one the point would most plausibly belong to instead.
s(i) — the silhouette. s(i) = (b(i) − a(i)) / max(a(i), b(i)), which falls between −1 and 1. Near 1 means the point is far from other clusters and snug in its own; near 0 means it sits on a boundary; negative means it is closer to a neighbour and probably misassigned. A point alone in its cluster gets s(i) = 0 by convention.

The overall score is the plain average of s(i) across all N points. Distance d(·,·) is Euclidean √(Σ (xₖ − yₖ)²) by default — matching scikit-learn — or Manhattan Σ |xₖ − yₖ| when selected. The computation is O(N²) in the number of points, which stays instant in the browser for the few-thousand-point inputs this tool accepts. As a credibility check it also recomputes the score a second way — as the size-weighted average of the per-cluster mean silhouettes — and confirms the two routes agree. To read the result, common practice (Kaufman & Rousseeuw, 1990) treats above 0.70 as strong structure, 0.51–0.70 as reasonable, 0.26–0.50 as weak or artificial, and 0.25 or below as no substantial structure.

Worked examples

Well-separated clusters (1-D) — points 1, 2, 10, 11; labels 0, 0, 1, 1

Point 1 (cl 0): a = |1−2| = 1; b = mean(9, 10) = 9.5; s = 8.5/9.5 = 0.8947
Point 2 (cl 0): a = 1; b = mean(8, 9) = 8.5; s = 7.5/8.5 = 0.8824
Point 10 (cl 1): a = 1; b = 8.5; s = 0.8824
Point 11 (cl 1): a = 1; b = 9.5; s = 0.8947
Score = (0.8947 + 0.8824 + 0.8824 + 0.8947)/4 = 0.8885 → strong structure

One misassigned point (2-D) — A = {(0,0),(0,1)}, B = {(0,2),(20,20)}

(0,0) A: a = 1; b = mean(2, √800=28.2843) = 15.1421; s = 0.9340
(0,1) A: a = 1; b = mean(1, √761=27.5862) = 14.2931; s = 0.9300
(0,2) B: a = √724 = 26.9072; b = mean(2, 1) = 1.5; s = −0.9443 (flagged)
(20,20) B: a = 26.9072; b = mean(28.2843, 27.5862) = 27.9353; s = 0.0368
Score = (0.9340 + 0.9300 − 0.9443 + 0.0368)/4 = 0.2391 → weak / questionable

Single-point cluster (s = 0 convention) — points 0, 0.5, 10; labels 0, 0, 1

Point 0 (cl 0): a = 0.5; b = dist to 10 = 10; s = 9.5/10 = 0.9500
Point 0.5 (cl 0): a = 0.5; b = 9.5; s = 9/9.5 = 0.9474
Point 10 (cl 1): only member of its cluster ⇒ s = 0 (scikit-learn rule)
Score = (0.9500 + 0.9474 + 0)/3 = 0.6325 → reasonable structure

Frequently asked questions

Sources & references

The formulas on this page were last cross-checked against these sources on 2026-06-11. The silhouette score is a stable mathematical definition, so this tool needs no rate or schedule updates — only the worked examples are periodically re-reconciled against scikit-learn.

Related tools

LiveAI

MCC Calculator

Compute the Matthews Correlation Coefficient from a confusion matrix or two label columns, with formula breakdown and imbalanced-data interpretation, entirely in the browser.

Open tool

LiveAI

F1 Score Calculator

Calculate the F1 score, precision, recall and F-beta of a binary classifier from confusion-matrix counts (TP, FP, FN) or directly from precision and recall, with every step of the arithmetic shown. Matches scikit-learn, runs in your browser.

Open tool

LiveAI

Brier Score Calculator

Compute the Brier score and Brier Skill Score for probabilistic predictions. Paste forecast probabilities and 0/1 outcomes to get the mean-squared-error of the probabilities, the skill score versus a baseline, the exact formula and a per-pair breakdown. Matches scikit-learn, runs in the browser.

Open tool

Rate this tool

Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Found a bug, edge case, or want to suggest an improvement?

Email me at [email protected] — most fixes ship within 24 hours.