cmarkea
/

bloomz-3b-nli

Zero-Shot Classification

text-classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Mar 17, 2024

Commit

ed03454

·

verified ·

1 Parent(s): 63570cd

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -13,7 +13,8 @@ We introduce the Bloomz-3b-NLI model, fine-tuned on the [Bloomz-3b-dpo-chat](htt
 ## Zero-shot Classification
 The primary appeal of training such models lies in their zero-shot classification performance. This means the model is capable of classifying any text with any label without specific training. What sets the Bloomz-3b-NLI LLMs apart in this realm is their ability to model and extract information from significantly more complex and lengthy test structures compared to models like BERT, RoBERTa, or CamemBERT.
-This task can be summarized by:
 $$P(hypothesis=i\in\mathcal{C}|premise)=\frac{e^{P(premise=entailment\vert hypothesis=i)}}{\sum_{j\in\mathcal{C}}e^{P(premise=entailment\vert hypothesis=j)}}$$
 With *i* representing a hypothesis composed of a template (for example, "This text is about {}.") and candidate labels ("cinema", "politics", etc.), the set of hypotheses comprises {"This text is about cinema.", "This text is about politics.", ...}. It is these hypotheses that we will measure against the premise, which is the sentence we aim to classify.

 ## Zero-shot Classification
 The primary appeal of training such models lies in their zero-shot classification performance. This means the model is capable of classifying any text with any label without specific training. What sets the Bloomz-3b-NLI LLMs apart in this realm is their ability to model and extract information from significantly more complex and lengthy test structures compared to models like BERT, RoBERTa, or CamemBERT.
+The zero-shot classification task can be summarized by:
 $$P(hypothesis=i\in\mathcal{C}|premise)=\frac{e^{P(premise=entailment\vert hypothesis=i)}}{\sum_{j\in\mathcal{C}}e^{P(premise=entailment\vert hypothesis=j)}}$$
 With *i* representing a hypothesis composed of a template (for example, "This text is about {}.") and candidate labels ("cinema", "politics", etc.), the set of hypotheses comprises {"This text is about cinema.", "This text is about politics.", ...}. It is these hypotheses that we will measure against the premise, which is the sentence we aim to classify.