German-RAG-NEMO-12B (Retrieval Augmented Generation)

avemio 's Collections

German-RAG-LLAMA-3.1-8B (Retrieval Augmented Generation)

German-RAG-MISTRAL-7B-v3.0 (Retrieval Augmented Generation)

German-RAG-PHI-4B (Retrieval Augmented Generation)

German-RAG-EMBEDDING-MODELS

German-RAG-WHISPER-MODELS

German-RAG-BENCHMARKS

German-RAG-DATASETS

German-RAG-NEMO-12B (Retrieval Augmented Generation)

updated 22 days ago

Here you can find all the final checkpoints & datasets from training Nemo-12B Model from MistralAI & NVIDIA on the German-RAG Datasets.

Upvote

avemio/German-RAG-NEMO-12B-ORPO-HESSIAN-AI

Question Answering • Updated 22 days ago • 69

Note This model was trained on 20.7 Million Tokens in ORPO (Odd-Ratio-Preference Optimization) on synthetically generated or enhanced Data. Please see the German-RAG-ORPO-Dataset (https://huggingface.co/datasets/avemio/German-RAG-ORPO-ShareGPT-HESSIAN-AI) for reference.
avemio/German-RAG-NEMO-12B-SFT-HESSIAN-AI

Question Answering • Updated 22 days ago • 88 • 2

Note This model was trained on 1,5 Billion Tokens in SFT(Supervised Fine-Tuning) on synthetically generated or enhanced Data. Please see the German-RAG-SFT-Dataset (https://huggingface.co/datasets/avemio/German-RAG-SFT-ShareGPT-HESSIAN-AI) for reference.
avemio/German-RAG-NEMO-12B-CPT-HESSIAN-AI

Question Answering • Updated 22 days ago • 53

Note This model was trained on 507,5 Million Tokens in CPT (Continued Pre-Training) on synthetically generated or enhanced Data. Please see the German-RAG-CPT-Dataset (https://huggingface.co/datasets/avemio/German-RAG-CPT-HESSIAN-AI) for reference.
avemio/German-RAG-ORPO-ShareGPT-HESSIAN-AI

Viewer • Updated 22 days ago • 13.7k • 547 • 2
avemio/German-RAG-SFT-ShareGPT-HESSIAN-AI

Viewer • Updated 22 days ago • 1.01M • 1.4k • 1
avemio/German-RAG-CPT-HESSIAN-AI

Viewer • Updated 22 days ago • 654k • 29
avemio/German-RAG-NEMO-12B-ORPO-HESSIAN-AI-Q8_0-GGUF

Question Answering • Updated 22 days ago • 61
avemio/German-RAG-NEMO-12B-SFT-HESSIAN-AI-Q8_0-GGUF

Question Answering • Updated 22 days ago • 41

Upvote