Shivanand M N's picture

1 15 5

Shivanand M N

shivanandmn

·

shivanandmn

AI & ML interests

None yet

Recent Activity

liked a Space 19 days ago

alielfilali01/LLM-Training-Cost-Calculator

liked a model 27 days ago

sometimesanotion/Qwen2.5-14B-Vimarckoso-v3

liked a dataset 27 days ago

mlabonne/orpo-dpo-mix-40k

View all activity

Organizations

None yet

shivanandmn's activity

upvoted a paper 10 months ago

Condition-Aware Neural Network for Controlled Image Generation

Paper • 2404.01143 • Published Apr 1, 2024 • 12

upvoted a collection 10 months ago

Preference Datasets for KTO

This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. • 5 items • Updated Dec 11, 2024 • 15

upvoted a paper 10 months ago

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 69

upvoted a paper 11 months ago

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16, 2024 • 43

upvoted a paper 12 months ago

SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

upvoted 3 papers about 1 year ago

RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture

Paper • 2401.08406 • Published Jan 16, 2024 • 37

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 158

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Paper • 2312.12456 • Published Dec 16, 2023 • 41

upvoted 3 collections about 1 year ago

Papers We've Read

Papers discussed in the H4 journal club • 3 items • Updated Apr 12, 2024 • 9

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12, 2024 • 67

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 128

upvoted a paper about 1 year ago

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 33

upvoted 2 papers over 1 year ago

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 143

AudioPaLM: A Large Language Model That Can Speak and Listen

Paper • 2306.12925 • Published Jun 22, 2023 • 53