Question 1

What is zero-shot text classification?

Accepted Answer

Zero-shot classification labels text using categories the model has never been trained on. Instead of fine-tuning on examples of each class, the tool turns every candidate label into a hypothesis sentence ('This text is about urgent.') and asks a Natural Language Inference (NLI) model how likely the input text entails the hypothesis. The entailment probability becomes the label's score, so you can invent new categories at runtime without training data.

Question 2

Can I really classify text without any training data?

Accepted Answer

Yes — that is the whole point. The underlying NLI model was trained on the MultiNLI corpus, which teaches it 'given premise X, does hypothesis Y follow?' across many topics and writing styles. Once you have that general capability, you can ask about any label by writing a one-sentence hypothesis. Accuracy is lower than a fine-tuned classifier on a specific domain, but it is free, instant, and works on labels you only thought of two minutes ago.

Question 3

Which model does this tool use?

Accepted Answer

valhalla/distilbart-mnli-12-3 — a distilled BART model (147M parameters · 12-layer encoder, 3-layer decoder) fine-tuned on MultiNLI by valhalla. The same NLI-as-zero-shot-classifier method is the default in Hugging Face's transformers library and on its zero-shot-classification task page. The model is invoked server-side through the Hugging Face Inference API; no model weights download into your browser.

Question 4

Is there a free alternative to OpenAI text classification?

Accepted Answer

This is one. OpenAI's text-classification endpoints, AWS Comprehend's custom-classifier API, and direct Hugging Face Inference calls all require an API key on your side plus, in OpenAI and AWS cases, a credit card. This page hides the API key on our backend and serves the same family of NLI model for free. Trade-offs: a ~3 % accuracy gap vs the largest BART-MNLI checkpoint and a hard cap on input length to keep latency predictable.

Question 5

How accurate is BART-MNLI for zero-shot tasks?

Accepted Answer

BART-MNLI-large reaches roughly 78 % accuracy on the AG News 4-class topic dataset and around 90 % on the Yahoo Answers categories benchmark when used zero-shot (Yin et al., 2019; see Sources). The distilled-12-3 variant used here trades 2 – 3 percentage points of accuracy for a 3× smaller model and faster inference. For everyday triage — support tickets, news topics, comment moderation — that is competitive with proprietary APIs.

Question 6

Single-label vs multi-label — when do I use which?

Accepted Answer

Single-label assumes the categories are mutually exclusive: exactly one fits ('urgent' OR 'not urgent', a news article's primary topic). Scores are softmaxed across labels and sum to 100 %. Multi-label assumes labels are independent: a movie review can be 'positive' AND 'spoiler-heavy', a ticket can be 'urgent' AND 'billing-related'. Each label is scored on its own and many can be high. Switch to multi-label whenever your labels are not mutually exclusive.

Question 7

Does it work with Sinhala or Tamil text?

Accepted Answer

Not reliably — the MultiNLI training data is English-only, so distilbart-mnli will misclassify or hallucinate on Sinhala, Tamil, or other non-Latin scripts. Translate first using our AI Translator tool, then run the English output through this classifier. A multilingual XNLI-based model is on the roadmap.

Question 8

Does my text get stored anywhere?

Accepted Answer

No. Your text and labels are POSTed once to our /api/tools/zero-shot route, immediately forwarded to the Hugging Face Inference API, and the response is returned to your browser. Nothing is written to disk or to any database. The page uses Next.js streaming with `cache: "no-store"` so there is no edge cache either.

Question 9

What is the hypothesis template and when should I edit it?

Accepted Answer

The template is the sentence the tool builds for each candidate label. The default — 'This text is about {}.' — works for topic classification. For other tasks, a more specific template raises accuracy: 'This support ticket is {}.' for triage, 'This product review expresses {}.' for sentiment, 'This email is from {}.' for routing. The placeholder {} is mandatory — it marks where each label gets substituted.

Question 10

When was the model and source list last verified?

Accepted Answer

The model card, Hugging Face Inference API endpoint, and source citations were last cross-checked on 2026-05-12. Inference runs against pinned Hugging Face weights; the page is reviewed each time the upstream model repo is updated or a new MNLI-based NLI checkpoint becomes the recommended default.

Zero-Shot Text Classifier — Free, Server-Side, No API Key

How it works

1. Build a hypothesis per label

2. Run an NLI head

3. Aggregate across labels

4. Server-side inference & privacy

Worked examples

Frequently asked questions

Sources & references

Related tools

Sentiment Analyzer

AI Entity Recognizer

AI Keyword Extractor

Comments & feedback