Umitcan Sahin's picture

Umitcan Sahin

ucsahin

·

AI & ML interests

Visual Language Models, Large Language Models, Vision Transformers

Recent Activity

liked a model 7 days ago

Lightricks/LTX-2.3

liked a dataset 15 days ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

liked a dataset 15 days ago

peteromallet/dataclaw-peteromallet

View all activity

Organizations

None yet

upvoted 3 collections about 1 year ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 1 day ago • 265

DeepSeek-V3

4 items • Updated Nov 27, 2025 • 283

DeepSeek-VL2

5 items • Updated Nov 27, 2025 • 80

upvoted 5 collections over 1 year ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated about 20 hours ago • 87

Turkish Instruction Datasets

Collection of instruction datasets for Turkish. • 49 items • Updated Feb 7 • 19

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated about 20 hours ago • 63

Nov 15 Releases 🍂

15 items • Updated Nov 15, 2024 • 6

Turkish Vision-Language Datasets

Collection of Turkish vision-language datasets. • 29 items • Updated 11 days ago • 11

upvoted 5 papers over 1 year ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5, 2024 • 62

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9, 2024 • 50

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 120

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 78

upvoted a collection over 1 year ago

Vision Language Leaderboards

This collection has all the vision language leaderboards. • 7 items • Updated Aug 24, 2024 • 22

upvoted 2 articles over 1 year ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

+2

Jul 31, 2024

•

60

Article

The Rise of Agentic Data Generation

Jul 15, 2024

•

89

upvoted 2 papers over 1 year ago

EVLM: An Efficient Vision-Language Model for Visual Understanding

Paper • 2407.14177 • Published Jul 19, 2024 • 45

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 47

upvoted a collection over 1 year ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 245

upvoted an article over 1 year ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

+1

Jul 18, 2024

•

62