ICML2023

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

alexzhou907 authored a paper 1 day ago

Inductive Moment Matching

alexzhou907 authored a paper 1 day ago

Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms

hysts updated a Space 2 months ago

ICML2023/ICML2023_papers

View all activity

ICML2023's activity

AtAndDev

posted an update about 18 hours ago

Post

617

Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.

alexzhou907

authored 2 papers 1 day ago

Inductive Moment Matching

Paper • 2503.07565 • Published 3 days ago • 6

Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms

Paper • 2503.07154 • Published 3 days ago • 2

AtAndDev

posted an update 26 days ago

Post

2420

@nroggendorff is that you sama?

2 replies

·

ameerazam08

posted an update about 1 month ago

Post

2346

Diffusion-Eraser
ameerazam08/Diffusion-Eraser

AtAndDev

posted an update about 1 month ago

Post

1892

everywhere i go i see his face

AtAndDev

posted an update about 2 months ago

Post

533

Deepseek gang on fire fr fr

AtAndDev

posted an update about 2 months ago

Post

1614

R1 is out! And with a lot of other R1 releated models...

hysts

updated a Space 2 months ago

ICML2023 Papers

vwxyzjn

authored 5 papers 2 months ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

A2C is a special case of PPO

Paper • 2205.09123 • Published May 18, 2022

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Paper • 2410.18252 • Published Oct 23, 2024 • 7

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 59

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 17

mbrack

authored a paper 3 months ago

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Paper • 2412.15035 • Published Dec 19, 2024 • 4

akhaliq

posted an update 3 months ago

Post

13176

Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: akhaliq/anychat

3 replies

·

AtAndDev

posted an update 3 months ago

Post

463

@s3nh Hey man check your discord! Got some news.

4 replies

·

Kameshr

authored a paper 3 months ago

Think Beyond Size: Adaptive Prompting for More Effective Reasoning

Paper • 2410.08130 • Published Oct 10, 2024 • 2

akhaliq

posted an update 4 months ago

Post

13687

QwQ-32B-Preview is now available in anychat

A reasoning model that is competitive with OpenAI o1-mini and o1-preview

try it out: akhaliq/anychat

1 reply

·

akhaliq

posted an update 4 months ago

Post

4227

New model drop in anychat

allenai/Llama-3.1-Tulu-3-8B is now available

try it here: akhaliq/anychat