Question 1

How accurate is this summarizer?

Accepted Answer

The neural model is DistilBART fine-tuned on the CNN/DailyMail summarization corpus, with a ROUGE-2 score of 21.26 reported on its Hugging Face model card. On long-form news writing and structured prose it produces tight, fluent summaries; on conversational chat logs or technical reference material the quality drops. The TextRank extractive bullets shown alongside give a transparent second view — when both methods surface the same idea, you can be confident it is central to the text.

Question 2

Is anything uploaded? Where does my text go?

Accepted Answer

Your text is sent once to this site's server, which forwards it to the Hugging Face Inference API for scoring, then discards the request. We do not log input text, do not store it, and do not run analytics on its contents. If neural inference is not enabled on the current build, the summarizer falls back to the TextRank pass — that runs entirely server-local with no outbound call at all.

Question 3

Why server-side and not in my browser?

Accepted Answer

A browser-side summarizer would have to download roughly 300 MB of distilled BART weights before the first summary. On a typical Sri Lankan home connection that is a minute-long wait the user did not ask for, and on mobile it can blow through a data plan. Running inference server-side keeps the page lightweight (under 100 KB JavaScript), the first summary fast, and means the tool works on low-end devices too.

Question 4

What is the maximum input length?

Accepted Answer

20,000 characters per summary, which is enough for a long article or a short paper. DistilBART itself has a 1,024-token encoder window — text longer than that is truncated server-side before scoring. For a multi-chapter document, summarize each chapter separately and then summarize the summaries.

Question 5

How does TextRank actually pick the bullets?

Accepted Answer

TextRank builds a graph where every sentence is a node and the edges are similarity scores (Mihalcea & Tarau, 2004). It then runs PageRank with a damping factor of 0.85 until the scores stop changing (tolerance 1e-4 or 30 iterations). The top-scoring sentences are returned in original document order, so the bullets read in the same flow as the source. Stop-words (126 of them) are filtered before similarity is computed.

Question 6

Does it work with Sinhala or Tamil text?

Accepted Answer

Not reliably. DistilBART-CNN was trained on English news and produces noisy output on Sinhala, Tamil, or transliterated text. The TextRank baseline still runs on any script, but the stop-word list and similarity scoring assume English word forms — for purely Sinhala or Tamil text the bullet ranking is roughly random. Sentences that are more than 70% non-Latin script are flagged in the breakdown.

Question 7

What is the difference between abstractive and extractive?

Accepted Answer

Abstractive summarization (the TL;DR paragraph) generates new sentences in the model's own words — fluent, but with a small risk of hallucination. Extractive summarization (the bullets) picks sentences verbatim from the original text — never hallucinates, but reads slightly choppy. This tool runs both so you can trust the bullets as evidence and the TL;DR for readability.

Question 8

Is the summary I see going to change if I run it again?

Accepted Answer

The TextRank pass is fully deterministic — the same text always produces the same bullets and scores. The DistilBART pass runs with do_sample disabled (greedy decoding), so for any given input and length preset the TL;DR is also deterministic. If the Hugging Face Inference API updates the model weights, the abstractive output may shift on the next call; we re-verify against the model card on every release.

Question 9

Can I use this commercially?

Accepted Answer

The tool itself is free, no signup, no ads. The model is open: DistilBART-CNN is Apache 2.0. Hugging Face's Inference API has a permissive free tier; for higher volumes you would deploy your own copy of the model. Running this summarizer in your own product or internal workflow is fine — if you build something more serious, link back so others can find the tool.

Question 10

Can I summarize a PDF or Word document?

Accepted Answer

Not directly — paste the text in. The summarizer reads plain text, not file formats. If your content is in a PDF, convert it first with our PDF to Word tool and copy the text across; for a scanned page or screenshot, run it through the Image to Text (OCR) tool to pull the words out, then paste those here. Keeping the tools separate keeps each one fast and predictable.

Question 11

Why does the first summary take a few seconds?

Accepted Answer

The neural model is served on demand through the Hugging Face Inference API. If it has been idle, the first request triggers a cold start while the model loads onto a GPU — usually 5–15 seconds. Subsequent summaries return in well under a second. The TextRank bullets are computed instantly server-side regardless, so you always see the key points without waiting on the model.

Question 12

When was the model and source list last verified?

Accepted Answer

Model card, API endpoint, and references were last cross-checked on 2026-05-12. Hugging Face model files and the inference contract can change independently — when an upstream patch lands, the next request picks it up automatically.

AI Text Summarizer — Free, No Signup, Sources Cited

How it works

1. DistilBART abstractive (server-side)

2. TextRank extractive (always on)

3. Non-English handling

4. Compression and reading-time stats

Worked examples

Frequently asked questions

Sources & references

Related tools

Paraphrasing Tool

AI Presentation Maker

Sentiment Analyzer

Comments & feedback