12 434 45

Vlad Bogolin

vladbogo

https://vladbogo.com

AI & ML interests

LLMs, Computer Vision

Recent Activity

updated a collection about 10 hours ago

AI Paper of the Day

upvoted a paper about 10 hours ago

The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input

updated a collection 1 day ago

AI Paper of the Day

View all activity

Articles

Organizations

vladbogo's activity

upvoted a paper about 10 hours ago

The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input

Paper • 2501.03200 • Published 6 days ago • 1

upvoted a paper 1 day ago

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Paper • 2501.04575 • Published 5 days ago • 21

upvoted a paper 2 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 5 days ago • 68

upvoted a paper 4 days ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 6 days ago • 55

upvoted 2 papers 5 days ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published 7 days ago • 46

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published 9 days ago • 35

upvoted a paper 8 days ago

Edicho: Consistent Image Editing in the Wild

Paper • 2412.21079 • Published 14 days ago • 21

upvoted 3 papers 9 days ago

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published 13 days ago • 23

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 11 days ago • 45

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published 13 days ago • 20

upvoted a paper 11 days ago

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

Paper • 2412.21037 • Published 14 days ago • 23

upvoted 2 papers 13 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published 20 days ago • 66

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

Paper • 2412.18605 • Published 19 days ago • 20

upvoted 2 papers 14 days ago

DepthLab: From Partial to Complete

Paper • 2412.18153 • Published 20 days ago • 34

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models

Paper • 2412.18609 • Published 19 days ago • 15

upvoted a paper 15 days ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 20 days ago • 35

upvoted 3 papers 18 days ago

Re-assessing ImageNet: How aligned is its single-label assumption with its multi-label nature?

Paper • 2412.18409 • Published 20 days ago • 1

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 24 days ago • 33

TRecViT: A Recurrent Video Transformer

Paper • 2412.14294 • Published 25 days ago • 12

upvoted a paper 21 days ago

Alignment faking in large language models

Paper • 2412.14093 • Published 25 days ago • 7

Vlad Bogolin

AI & ML interests

Recent Activity

Articles

Many-shot jailbreaking

Gecko: Versatile Text Embeddings Distilled from Large Language Models

VideoMamba: State Space Model for Efficient Video Understanding

Genie: Generative Interactive Environments

Rephrasing the Web A Recipe for Compute and Data-Efficient Language Modeling

Reformatted Alignment

Organizations

vladbogo's activity