Known stable releases of the miscii-1020 based models
Sthenno
sthenno
AI & ML interests
To contact me: [email protected]
Recent Activity
reacted
to
MoritzLaurer's
post
with β€οΈ
1 day ago
FACTS is a great paper from @GoogleDeepMind on measuring the factuality of LLM outputs. You can now download their prompt templates from @huggingface to improve LLM-based fact-checking yourself!
π The paper introduces the FACTS Grounding benchmark for evaluating the factuality of LLM outputs.
π€ Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document.
π§ͺ The authors tested different prompt templates on held-out data to ensure their generalization.
π It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations.
πΎ You can now download and reuse these prompt templates via the prompt-templates library!
π The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Letβs make LLM work more transparent and reproducible by sharing more templates like this!
Links π
- prompt-templates docs: https://moritzlaurer.github.io/prompt_templates/
- all templates on the HF Hub: https://huggingface.co/datasets/MoritzLaurer/facts-grounding-prompts
- FACTS paper: https://storage.googleapis.com/deepmind-media/FACTS/FACTS_grounding_paper.pdf
liked
a dataset
1 day ago
DAMO-NLP-SG/multimodal_textbook
Organizations
Collections
1
models
26
sthenno/tempesthenno-14b-nuslerp-0111
Updated
β’
2
β’
1
sthenno/tempesthenno-nuslerp-001
Text Generation
β’
Updated
β’
22
β’
1
sthenno/tempesthenno-14b-0111
Updated
β’
4
β’
1
sthenno/miscii-1225-19b-preset
Text Generation
β’
Updated
β’
6
β’
1
sthenno/inferno-math-stage1-exp1230
Text Generation
β’
Updated
β’
4
sthenno/inferno-math-stage1-ckpt1484
Updated
β’
6
sthenno/inferno-math-stage1-ckpt1400
Updated
β’
6
sthenno/inferno-math-stage1-ckpt1300
Updated
β’
6
sthenno/inferno-math-stage1-ckpt1200
Updated
β’
6
sthenno/inferno-math-stage1-ckpt1100
Updated
β’
6
datasets
None public yet