Building a Custom Arabic Semantic Search Model with Arabic Matryoshka Embeddings for RAG Using Sentence Transformers Sep 25, 2024 • 4
Arabic ModernBERT Collection This collection highlights efforts to enhance Arabic NLP tasks using the latest ModernBERT models. NAMAA-Space/AraModernBert-Topic-Classifier Text Classification • Updated 1 day ago • 11 • 4
Huggingface FineWeb2 Arabic Dataset Portions Collection of a comprehensive dataset of Arabic text sourced from the FineWeb2 project, representing diverse content across Arabic MSA and Dialect. HuggingFaceFW/fineweb-2 Viewer • Updated 4 days ago • 12.5B • 91.1k • 389 Omartificial-Intelligence-Space/FineWeb2-MSA Viewer • Updated 29 days ago • 907M • 3.13k • 1 Omartificial-Intelligence-Space/FineWeb2-Egyptian-Arabic Viewer • Updated Dec 12, 2024 • 23.9M • 119 • 1 Omartificial-Intelligence-Space/FineWeb2-Moroccan-Arabic Viewer • Updated Dec 12, 2024 • 69.6M • 106 • 1
Omartificial-Intelligence-Space/FineWeb2-Egyptian-Arabic Viewer • Updated Dec 12, 2024 • 23.9M • 119 • 1
Omartificial-Intelligence-Space/FineWeb2-Moroccan-Arabic Viewer • Updated Dec 12, 2024 • 69.6M • 106 • 1
Omartificial-Intelligence-Space/GATE-AraBert-v1 Sentence Similarity • Updated about 8 hours ago • 2.65k • 11
Omartificial-Intelligence-Space/Marbert-all-nli-triplet-Matryoshka Sentence Similarity • Updated 2 days ago • 335 • 1
Omartificial-Intelligence-Space/Arabic-Triplet-Matryoshka-V2 Sentence Similarity • Updated 2 days ago • 855 • 10
Omartificial-Intelligence-Space/Arabic-mpnet-base-all-nli-triplet Sentence Similarity • Updated 2 days ago • 1.09k • 10
Omartificial-Intelligence-Space/Arabic-MiniLM-L12-v2-all-nli-triplet Sentence Similarity • Updated 2 days ago • 373 • 4
Omartificial-Intelligence-Space/Arabic-labse-Matryoshka Sentence Similarity • Updated 2 days ago • 357 • 2
Omartificial-Intelligence-Space/Arabic-all-nli-triplet-Matryoshka Sentence Similarity • Updated 2 days ago • 343 • 1
Omartificial-Intelligence-Space/Arabert-all-nli-triplet-Matryoshka Sentence Similarity • Updated 2 days ago • 1.46k • 10
Omartificial-Intelligence-Space/E5-all-nli-triplet-Matryoshka Sentence Similarity • Updated 15 days ago • 8 • 1
Omartificial-Intelligence-Space/FineWeb2-Najdi-Arabic Viewer • Updated Dec 12, 2024 • 48.4M • 168 • 1
Omartificial-Intelligence-Space/FineWeb2-North-Levantine-Arabic Viewer • Updated Dec 12, 2024 • 223k • 141 • 1
Omartificial-Intelligence-Space/FineWeb2-Moroccan-Arabic Viewer • Updated Dec 12, 2024 • 69.6M • 106 • 1
Omartificial-Intelligence-Space/FineWeb2-Egyptian-Arabic Viewer • Updated Dec 12, 2024 • 23.9M • 119 • 1
Omartificial-Intelligence-Space/ILMAAM-Arabic-Culturally-Aligned-MMLU Viewer • Updated Dec 11, 2024 • 12.5k • 46
Omartificial-Intelligence-Space/Arabic_Reasoning_Dataset Viewer • Updated Dec 1, 2024 • 9.21k • 45 • 2
Omartificial-Intelligence-Space/Arabic-finanical-rag-embedding-dataset Viewer • Updated Oct 9, 2024 • 7k • 127 • 6