What is the Levenshtein distance between two words?

It is the minimum number of single-character edits — insertions, deletions, or substitutions — needed to change one word into the other. For example, the distance between "kitten" and "sitting" is 3: substitute k→s, substitute e→i, then insert g. Each edit counts as 1.

How is edit distance calculated step by step?

Build a table where cell d[i][j] is the distance between the first i characters of string A and the first j of string B. Fill the first row and column with 0,1,2,…, then for every other cell take the minimum of the cell above plus 1 (deletion), the cell left plus 1 (insertion), and the diagonal plus 0 or 1 (match or substitution). The bottom-right cell is the answer.

What is the difference between Levenshtein distance and Hamming distance?

Hamming distance only counts positions where two equal-length strings differ, so it allows substitutions only. Levenshtein distance also allows insertions and deletions, so it works on strings of different lengths and usually gives a smaller or equal number. Levenshtein is the more general edit metric.

How do you convert Levenshtein distance to a similarity percentage?

A common normalization is (1 − distance / max(lenA, lenB)) × 100. For "kitten" vs "sitting", that is (1 − 3/7) × 100 ≈ 57.1%. This similarity figure is a presentation convenience, not part of the formal Levenshtein definition, so different tools may normalize differently.

Is Levenshtein distance the same as the Wagner–Fischer algorithm?

They are related but not identical. Levenshtein (1966) defined the distance metric. Wagner & Fischer (1974) gave the standard O(m×n) dynamic-programming algorithm that computes it, including the backtracking step that recovers the actual edit operations. This calculator implements the Wagner–Fischer method.

Why does the tool sometimes show a different edit script than I expected?

When several edit scripts share the same minimum distance, there is more than one optimal answer. This tool returns one of them using a fixed tie-break order — match, then substitution, then deletion, then insertion. The distance is always the same; only which specific minimal path is shown can differ between tools.

Does case and whitespace affect the distance?

By default the comparison is case-sensitive and counts whitespace, so "Hello" and "hello" have distance 1. Toggle "case-insensitive" to fold letter case before comparing, and toggle "ignore whitespace" to strip all spaces, tabs, and newlines first. Both toggles are applied before the matrix is built.

Where is Levenshtein distance used in real software?

Spell-checkers suggest words within a small edit distance of a misspelling; fuzzy search and record deduplication match entries below a distance threshold; DNA and text-diff tools use the same alignment idea; and natural-language pipelines use it as the basis of metrics like word error rate. A threshold of 1 or 2 is common for typo tolerance.

Strings · Algorithms

Levenshtein Distance Calculator

Find the edit distance between two strings — the fewest single-character insertions, deletions, and substitutions to turn one into the other. See the full dynamic-programming matrix and the step-by-step edit script. Runs entirely in your browser, no signup.

By Induwara Ashinsana— Executive Director, Ryzera TechnologiesUpdated Jun 10, 2026

Edit distance

Cross-checked ✓

String A6 / 2,000

String B7 / 2,000

Examples

Edit distance

Similarity

57.1%

1 − distance / max(len)

Matched chars

String lengths

6 · 7

len A · len B (after toggles)

Operation breakdown

Substitutions

Insertions

Deletions

Matches

Aligned edit script

sub'k' → 's'keep'i'keep't'keep't'sub'e' → 'i'keep'n'ins'g'

Dynamic-programming matrix

	ε	s	i	t	t	i	n	g
ε	0	1	2	3	4	5	6	7
k	1	1	2	3	4	5	6	7
i	2	2	1	2	3	4	5	6
t	3	3	2	1	2	3	4	5
t	4	4	3	2	1	2	3	4
e	5	5	4	3	2	2	3	4
n	6	6	5	4	3	3	2	3

Sources cited: Wagner & Fischer (1974), The String-to-String Correction Problem; Levenshtein (1966); NIST DADS. Unit cost per insertion, deletion, and substitution. The full list with links is in the references section below.

How it works

The calculator uses the Wagner–Fischer dynamic-programming algorithm, the standard method for computing the Levenshtein (edit) distance defined by Vladimir Levenshtein in 1966. Given two strings A of length m and B of length n, it fills a table of size (m+1) × (n+1) where each cell d[i][j] holds the edit distance between the first i characters of A and the first j characters of B.

Apply the pre-processing toggles first: lowercase both strings if the comparison is case-insensitive, and strip whitespace if that option is on.
Initialise the edges: d[i][0] = i (turning a prefix into an empty string costs i deletions) and d[0][j] = j (building from empty costs j insertions).

Fill each remaining cell with the recurrence below, where the substitution cost is 0 when the characters match and 1 when they differ:

cost = (A[i-1] == B[j-1]) ? 0 : 1
d[i][j] = min(
  d[i-1][j]   + 1,   // deletion
  d[i][j-1]   + 1,   // insertion
  d[i-1][j-1] + cost // substitution / match
)

The Levenshtein distance is the bottom-right cell, d[m][n].
Backtrack from d[m][n] to d[0][0], at each step choosing the predecessor that produced the minimum, to recover one optimal edit script and the per-operation counts. Ties are broken in a fixed, documented order: match, substitution, deletion, insertion.

The algorithm runs in O(m × n) time. The similarity percentage shown alongside the distance is a presentation convenience, computed as (1 − distance / max(m, n)) × 100 and defined as 100% when both strings are empty; it is not part of the formal Levenshtein definition. Every result is independently cross-checked against a second, space-optimized two-row implementation of the same recurrence — if the two ever disagreed, the badge in the tool would flag it.

Worked examples

kitten → sitting (the textbook case)

distance = 3 · similarity ≈ 57.1%

Align the strings: k i t t e n vs s i t t i n g
Substitute k → s (position 1) ............ cost 1
Keep i, t, t (already matching) .......... cost 0
Substitute e → i (position 5) ............ cost 1
Keep n .................................... cost 0
Insert g at the end ...................... cost 1
Total = 3 edits. max(len) = 7, so similarity = (1 − 3/7) × 100 ≈ 57.1%

Saturday → Sunday

distance = 3 · similarity = 62.5%

Keep S .................................... cost 0
Delete a, delete t (from "Sat…") ......... cost 2
Keep u .................................... cost 0
Substitute r → n .......................... cost 1
Keep d, a, y .............................. cost 0
Total = 3 edits. max(len) = 8, so similarity = (1 − 3/8) × 100 = 62.5%

flaw → lawn (edge: empty-ish overlap)

distance = 2 · similarity = 50.0%

The substring "law" is shared by both strings
Delete f from the front of "flaw" ........ cost 1
Keep l, a, w .............................. cost 0
Insert n at the end to reach "lawn" ...... cost 1
Total = 2 edits. max(len) = 4, so similarity = (1 − 2/4) × 100 = 50.0%

Frequently asked questions

Sources & references

The algorithm, recurrence, and worked examples on this page were last cross-checked against these sources on 2026-06-10. The distance is deterministic and verified against an independent second implementation on every calculation.

Related tools

LiveAI

Cosine Similarity Calc

Compute the cosine similarity, cosine distance, and angle between two numeric vectors or two short texts, with the full dot-product and magnitude working. Matches scikit-learn, runs entirely in the browser.

Open tool

LiveAI

Hamming Distance Calc

Compute the Hamming distance between two equal-length binary strings, text strings, or numeric vectors — the count of positions that differ. Also shows the normalised Hamming loss, similarity, and exactly which positions mismatch, entirely in your browser.

Open tool

LiveAI

Euclidean Distance Calc

Compute the Euclidean (L2) distance between two points or two numeric vectors of any dimension, with the full per-dimension working. Also shows the squared Euclidean, Manhattan (L1), and Chebyshev (L∞) distances, and matches scikit-learn and NumPy — entirely in your browser.

Open tool

Rate this tool

Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Found a bug, edge case, or want to suggest an improvement?

Email me at [email protected] — most fixes ship within 24 hours.