Question 1

How does AI answer questions from a passage?

Accepted Answer

The model reads the passage and question together, then predicts the start and end positions of the contiguous word span most likely to answer the question. It does not generate new text — the answer is always a substring of the passage you supplied. This is called extractive question answering, and the underlying model (DistilBERT (cased)) was fine-tuned on SQuAD 1.1, a 100,000-question reading-comprehension corpus released by Stanford in 2016.

Question 2

What is extractive question answering?

Accepted Answer

Extractive QA returns a verbatim span from the source text. Generative QA (think ChatGPT, Claude) writes a new sentence in its own words. Extractive is faithful by construction — it cannot hallucinate facts the passage does not contain — but it cannot synthesise an answer across multiple sentences. The two methods complement each other; this tool is the extractive one.

Question 3

Can I use BERT to answer questions from my own text?

Accepted Answer

Yes — that is exactly what this page does. The model is distilbert/distilbert-base-cased-distilled-squad, the standard DistilBERT-cased-SQuAD checkpoint, hosted on Hugging Face. Inference runs on the server through the Hugging Face Inference API. Your passage is sent once over HTTPS for scoring and not stored. No model weights are downloaded to your browser.

Question 4

Is there a free AI tool that answers questions from documents?

Accepted Answer

This is one. It is free with no signup, no API key, and no usage cap on this build. For PDFs and images, run them through our image-to-text OCR tool first, then paste the extracted text here. Multi-document retrieval (asking one question across many files) is not in scope for v1 — keep one passage per query.

Question 5

How accurate are SQuAD-trained models on real text?

Accepted Answer

The model card reports F1 = 86.9 and exact-match = 79.1 on SQuAD 1.1's development set. In plain English: roughly four out of five answers match the human reference exactly, and the overlap on the remainder is high. SQuAD passages are encyclopedic prose, so accuracy is best on textbook-style writing. Performance drops on lists, tables, contracts with archaic language, and any text in Sinhala or Tamil — the model is English-only.

Question 6

What is the confidence score and what does it mean?

Accepted Answer

The score is the softmax-normalised probability the model assigns to its top answer span. Bands: ≥ 80% high (usually correct on clean SQuAD-style text), 50–80% medium (treat as a candidate; verify against the passage), and below 50% low (the passage probably does not contain the answer). The score correlates with — but is not identical to — real-world factual correctness.

Question 7

Does it handle long documents?

Accepted Answer

Yes, up to 8,000 characters per call. Longer than the model's 512-token attention window? The pipeline applies a sliding window of 384 context tokens with a stride of 128 tokens, scores every chunk, then keeps the best span. The expected chunk count is shown in the result panel. For book-length material, split into sections — accuracy is higher on focused passages.

Question 8

Does it work with Sinhala or Tamil questions?

Accepted Answer

Not reliably. The model was trained on English Wikipedia passages. Asking a Sinhala or Tamil question or supplying non-English context will produce a span the model cannot meaningfully score. A multilingual variant (XLM-RoBERTa fine-tuned on SQuAD2 + TyDi-QA) is on the roadmap; for now use this tool for English text.

Question 9

What if the passage does not contain the answer?

Accepted Answer

The model still returns its best non-CLS span, but the confidence falls into the low band, and the lexical baseline ends up with a small keyword-overlap score too. The page flags both with a clear "likely no answer in passage" warning rather than presenting the span as authoritative. The right move is usually to broaden the passage or rephrase the question into the form a textbook would ask.

Question 10

What is the lexical baseline shown next to the model answer?

Accepted Answer

A deterministic, keyword-overlap cross-check. The baseline scores each context sentence by the fraction of content words from your question it contains, applies a small bonus for sentences matching the question's wh-pattern (numbers for "how long", dates for "when", proper nouns for "who/where"), and returns the highest-scoring sentence. It always runs, even when the neural backend is offline, so the page never shows a dead state.

Question 11

Is my pasted text stored anywhere?

Accepted Answer

No. The text is sent once to our server, forwarded to the Hugging Face Inference API for scoring, and discarded. There are no logs of submitted passages or questions. The site also runs without ads or tracking pixels — see /about for the full data-handling note.

AI Question Answering — Free Extractive QA Over Any Passage

How it works

Worked examples

Frequently asked questions

Sources & references

Related tools

Text Summarizer

AI Keyword Extractor

AI Similarity Checker

Comments & feedback