Question 1

What is a good Matthews correlation coefficient value?

Accepted Answer

MCC runs from −1 to +1. As a rough guide: above 0.7 is strong, 0.4–0.7 is moderate, 0.2–0.4 is weak, and anything near 0 is no better than random guessing. What counts as 'good' depends on the problem — a 0.4 on a hard, imbalanced medical task can be more valuable than a 0.9 on an easy one.

Question 2

How do you calculate MCC from a confusion matrix?

Accepted Answer

Take TP, TN, FP, FN. The numerator is (TP·TN) − (FP·FN). The denominator is the square root of (TP+FP)(TP+FN)(TN+FP)(TN+FN). MCC is the numerator divided by the denominator. If any of those four sums is zero, MCC is defined as 0.

Question 3

Why is MCC better than accuracy for imbalanced datasets?

Accepted Answer

Accuracy counts correct predictions, so on a 95%-negative dataset a model that always predicts 'negative' scores 95% while learning nothing. MCC uses all four cells of the confusion matrix and only rises when the model gets both classes right, so the same lazy model scores 0 — an honest verdict. This is the central argument in Chicco & Jurman (2020).

Question 4

Can the Matthews correlation coefficient be negative?

Accepted Answer

Yes. A negative MCC means predictions disagree with the truth more often than chance — the classifier is anti-correlated with the labels. A value near −1 usually signals that the positive and negative labels are swapped somewhere in your pipeline. Flipping the predictions would turn a strong negative MCC into a strong positive one.

Question 5

What does an MCC of 0 mean?

Accepted Answer

Zero means the predictions carry no usable information about the true labels — the classifier is no better than random guessing. It also appears in the degenerate case where the model predicts a single class for every sample: the denominator is zero, so MCC is reported as 0 by the scikit-learn convention.

Question 6

What is the difference between MCC and the F1 score?

Accepted Answer

F1 is the harmonic mean of precision and recall and ignores true negatives entirely, so it can look good even when the negative class is handled poorly. MCC uses TP, TN, FP and FN together and is symmetric — swapping which class you call 'positive' does not change its magnitude. On imbalanced data MCC is generally the more honest single number.

Question 7

Does this tool work for multiclass classification?

Accepted Answer

This version is binary only (two classes). scikit-learn does generalise MCC to K classes using the full K×K contingency matrix, but that is out of scope here to keep the inputs simple. For multiclass work, build the confusion matrix first and use scikit-learn's matthews_corrcoef directly.

Question 8

Is my data sent to a server?

Accepted Answer

No. The whole calculation runs in your browser with plain arithmetic — nothing is uploaded, stored, or logged. You can disconnect from the network and it still works, which makes it safe for confidential model-evaluation data.

Matthews Correlation Coefficient (MCC) Calculator

Where it sits on the scale

Confusion matrix used

Formula with your numbers

How it works

Worked examples

Frequently asked questions

Sources & references

Related tools

Silhouette Score Calc

Mean Reciprocal Rank

Confusion Matrix Calculator

Comments & feedback