Dilxat Muhtar

PumpkinCat

https://pumpkin-co.github.io//

pUmpKin-Co

AI & ML interests

Computer Vision;Large Language Model; Efficient LM

Recent Activity

upvoted a paper about 13 hours ago

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

liked a model 10 days ago

pengxiang/DreamPara

updated a model 10 days ago

PumpkinCat/SmolLM3-PartialSFT-PartialDPO

View all activity

Organizations

upvoted a paper about 13 hours ago

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published 1 day ago • 32

liked a model 10 days ago

pengxiang/DreamPara

8B • Updated 20 days ago • 34 • 1

updated a model 10 days ago

PumpkinCat/SmolLM3-PartialSFT-PartialDPO

Text Generation • 150k • Updated 10 days ago • 20

published a model 11 days ago

PumpkinCat/SmolLM3-PartialSFT-PartialDPO

Text Generation • 150k • Updated 10 days ago • 20

updated a model 11 days ago

PumpkinCat/SmolLM3-SFT-repoc-DPO

Text Generation • 150k • Updated 11 days ago • 12

published a model 11 days ago

PumpkinCat/SmolLM3-SFT-repoc-DPO

Text Generation • 150k • Updated 11 days ago • 12

updated a model 15 days ago

PumpkinCat/SmolLM3-FULL-After-FULL

Text Generation • 150k • Updated 15 days ago • 15

published a model 15 days ago

PumpkinCat/SmolLM3-FULL-After-FULL

Text Generation • 150k • Updated 15 days ago • 15

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.76k

The secrets to building world-class LLMs

upvoted a paper 2 months ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11, 2025 • 50

published a dataset 2 months ago

PumpkinCat/ParallelThinkingDLM

Viewer • Updated Oct 26, 2025 • 225k • 17

updated a dataset 2 months ago

PumpkinCat/ParallelThinkingDLM

Viewer • Updated Oct 26, 2025 • 225k • 17

upvoted a paper 3 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15, 2025 • 57

upvoted a paper 4 months ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 25

authored a paper 4 months ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 25

updated a model 8 months ago

PumpkinCat/Qwen2VL-7B-RS

Image-to-Text • 8B • Updated May 12, 2025 • 10

liked a dataset 8 months ago

PumpkinCat/LHRS_Data

Viewer • Updated Oct 26, 2024 • 4.6k • 299 • 4

liked a model 9 months ago

LHRS/LHRS-Bot-Nova

Updated Dec 23, 2024 • 2

authored 2 papers 10 months ago

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

Paper • 2410.09437 • Published Oct 12, 2024

Quality-Driven Curation of Remote Sensing Vision-Language Data via Learned Scoring Models

Paper • 2503.00743 • Published Mar 2, 2025 • 1

Dilxat Muhtar

AI & ML interests

Recent Activity

Organizations

PumpkinCat's activity

The Smol Training Playbook