Blog, Articles, and discussions

Visual Document Retrieval Goes Multilingual

By January 10, 2025 guest • 36

Community Articles

view all

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

•

2 days ago

• 2

Python Is All You Need? Introducing Dria-Agent-α

•

2 days ago

• 6

Search the Web with AI

•

2 days ago

• 2

TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation

•

3 days ago

• 17

Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation

•

3 days ago

• 13

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

•

3 days ago

• 2

AI-Powered Content Creation for Release Notes Using KaibanJS

•

5 days ago

Synthetic Data Generation with FastData and Hugging Face

•

5 days ago

• 12

Crowd-sourced Open Preference Dataset for Text-to-Image Generation

•

5 days ago

• 17

Accelerating Language Model Inference with Mixture of Attentions

•

5 days ago

• 24

🌁#82: AI and ML in Real Life

•

6 days ago

• 15

Announcing NVIDIA Cosmos World Foundation Models

•

6 days ago

• 21

How to Automate Reddit Comment Generation with AI Agents in KaibanJS

•

6 days ago

• 4

Fine-tune SmolLM's on custom synthetic data

•

7 days ago

• 15

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

•

8 days ago

• 4

The Reformer - Pushing the limits of language modeling

By July 3, 2020 • 1

How to generate text: using different decoding methods for language generation with Transformers

By March 1, 2020 • 138

How to train a new language model from scratch using Transformers and Tokenizers

By February 14, 2020 • 25

Community Articles

view all

Mastering Tensor Dimensions in Transformers

•

about 8 hours ago

• 3

A Multi-Agent Ecosystem for Autonomous AI

•

about 14 hours ago

Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text

•

about 18 hours ago

• 7

🦸🏻#7: From Agentic AI to Physical AI

•

1 day ago

• 3

N-Queens Problem Based Monte Carlo Algorithm

•

1 day ago

• 7

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

•

2 days ago

• 2

Python Is All You Need? Introducing Dria-Agent-α

•

2 days ago

• 6

Search the Web with AI

•

2 days ago

• 2

TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation

•

3 days ago

• 17

Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation

•

3 days ago

• 13

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

•

3 days ago

• 2

AI-Powered Content Creation for Release Notes Using KaibanJS

•

5 days ago

Synthetic Data Generation with FastData and Hugging Face

•

5 days ago

• 12

Crowd-sourced Open Preference Dataset for Text-to-Image Generation

•

5 days ago

• 17

Accelerating Language Model Inference with Mixture of Attentions

•

5 days ago

• 24

🌁#82: AI and ML in Real Life

•

6 days ago

• 15

Announcing NVIDIA Cosmos World Foundation Models

•

6 days ago

• 21

How to Automate Reddit Comment Generation with AI Agents in KaibanJS

•

6 days ago

• 4

Fine-tune SmolLM's on custom synthetic data

•

7 days ago

• 15

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

•

8 days ago

• 4

Blog, Articles, and discussions

Visual Document Retrieval Goes Multilingual

Mastering Tensor Dimensions in Transformers

A Multi-Agent Ecosystem for Autonomous AI

Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text

🦸🏻#7: From Agentic AI to Physical AI

**N-Queens Problem Based Monte Carlo Algorithm**

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

Python Is All You Need? Introducing Dria-Agent-α

Search the Web with AI

TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation

Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation

🅰️ℹ️ 1️⃣0️⃣1️⃣ **What is HtmlRAG, Multimodal RAG and Agentic RAG?**

AI-Powered Content Creation for Release Notes Using KaibanJS

Synthetic Data Generation with FastData and Hugging Face

Crowd-sourced Open Preference Dataset for Text-to-Image Generation

Accelerating Language Model Inference with Mixture of Attentions

🌁#82: AI and ML in Real Life

Announcing NVIDIA Cosmos World Foundation Models

How to Automate Reddit Comment Generation with AI Agents in KaibanJS

**Fine-tune SmolLM's on custom synthetic data**

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

The Reformer - Pushing the limits of language modeling

How to generate text: using different decoding methods for language generation with Transformers

How to train a new language model from scratch using Transformers and Tokenizers

Mastering Tensor Dimensions in Transformers

A Multi-Agent Ecosystem for Autonomous AI

Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text

🦸🏻#7: From Agentic AI to Physical AI

**N-Queens Problem Based Monte Carlo Algorithm**

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

Python Is All You Need? Introducing Dria-Agent-α

Search the Web with AI

TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation

Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation

🅰️ℹ️ 1️⃣0️⃣1️⃣ **What is HtmlRAG, Multimodal RAG and Agentic RAG?**

AI-Powered Content Creation for Release Notes Using KaibanJS

Synthetic Data Generation with FastData and Hugging Face

Crowd-sourced Open Preference Dataset for Text-to-Image Generation

Accelerating Language Model Inference with Mixture of Attentions

🌁#82: AI and ML in Real Life

Announcing NVIDIA Cosmos World Foundation Models

How to Automate Reddit Comment Generation with AI Agents in KaibanJS

**Fine-tune SmolLM's on custom synthetic data**

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

N-Queens Problem Based Monte Carlo Algorithm

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

Fine-tune SmolLM's on custom synthetic data

N-Queens Problem Based Monte Carlo Algorithm

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

Fine-tune SmolLM's on custom synthetic data