
German-RAG-NEMO-12B (Retrieval Augmented Generation)
Here you can find all the final checkpoints & datasets from training Nemo-12B Model from MistralAI & NVIDIA on the German-RAG Datasets.
Question Answering • Updated • 69Note This model was trained on 20.7 Million Tokens in ORPO (Odd-Ratio-Preference Optimization) on synthetically generated or enhanced Data. Please see the German-RAG-ORPO-Dataset (https://huggingface.co/datasets/avemio/German-RAG-ORPO-ShareGPT-HESSIAN-AI) for reference.
avemio/German-RAG-NEMO-12B-SFT-HESSIAN-AI
Question Answering • Updated • 88 • 2Note This model was trained on 1,5 Billion Tokens in SFT(Supervised Fine-Tuning) on synthetically generated or enhanced Data. Please see the German-RAG-SFT-Dataset (https://huggingface.co/datasets/avemio/German-RAG-SFT-ShareGPT-HESSIAN-AI) for reference.
avemio/German-RAG-NEMO-12B-CPT-HESSIAN-AI
Question Answering • Updated • 53Note This model was trained on 507,5 Million Tokens in CPT (Continued Pre-Training) on synthetically generated or enhanced Data. Please see the German-RAG-CPT-Dataset (https://huggingface.co/datasets/avemio/German-RAG-CPT-HESSIAN-AI) for reference.
avemio/German-RAG-ORPO-ShareGPT-HESSIAN-AI
Viewer • Updated • 13.7k • 547 • 2avemio/German-RAG-SFT-ShareGPT-HESSIAN-AI
Viewer • Updated • 1.01M • 1.4k • 1avemio/German-RAG-CPT-HESSIAN-AI
Viewer • Updated • 654k • 29avemio/German-RAG-NEMO-12B-ORPO-HESSIAN-AI-Q8_0-GGUF
Question Answering • Updated • 61avemio/German-RAG-NEMO-12B-SFT-HESSIAN-AI-Q8_0-GGUF
Question Answering • Updated • 41