andreydelpozo
's Collections
explorations
updated
teknium/OpenHermes-2.5-Mistral-7B
Text Generation
•
7B
•
Updated
•
192k
•
881
Text-to-Image
•
Updated
•
96.1k
•
•
2.12k
Text Generation
•
9B
•
Updated
•
102k
•
1.23k
dphn/dolphin-2.2.1-mistral-7b
Text Generation
•
7B
•
Updated
•
459
•
198
dphn/dolphin-2.5-mixtral-8x7b
Text Generation
•
47B
•
Updated
•
1.66k
•
1.24k
dphn/dolphin-2.6-mistral-7b-dpo-laser
Text Generation
•
7B
•
Updated
•
79
•
120
ise-uiuc/Magicoder-Evol-Instruct-110K
Viewer
•
Updated
•
111k
•
2.99k
•
170
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe
Interpolation
Paper
•
2408.15239
•
Published
•
30
WebShaper: Agentically Data Synthesizing via Information-Seeking
Formalization
Paper
•
2507.15061
•
Published
•
60
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
Paper
•
2510.01284
•
Published
•
34
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot
Paper
•
2510.06751
•
Published
•
21
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement
Learning
Paper
•
2509.24372
•
Published
•
9
Paper
•
2508.10104
•
Published
•
291
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
Paper
•
2510.07310
•
Published
•
35
Real-Time Object Detection Meets DINOv3
Paper
•
2509.20787
•
Published
•
10
A Survey of Reinforcement Learning for Large Reasoning Models
Paper
•
2509.08827
•
Published
•
190
A Survey of Context Engineering for Large Language Models
Paper
•
2507.13334
•
Published
•
259
Scaling RL to Long Videos
Paper
•
2507.07966
•
Published
•
159
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
Paper
•
2507.05964
•
Published
•
119
SingLoRA: Low Rank Adaptation Using a Single Matrix
Paper
•
2507.05566
•
Published
•
113
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM
Fine-Tuning Data from Unstructured Documents
Paper
•
2507.04009
•
Published
•
51
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for
Long Video Generation
Paper
•
2506.19852
•
Published
•
42
KV Cache Steering for Inducing Reasoning in Small Language Models
Paper
•
2507.08799
•
Published
•
40
PartCrafter: Structured 3D Mesh Generation via Compositional Latent
Diffusion Transformers
Paper
•
2506.05573
•
Published
•
82
Qwen3 Embedding: Advancing Text Embedding and Reranking Through
Foundation Models
Paper
•
2506.05176
•
Published
•
77
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation
Paper
•
2506.09790
•
Published
•
53
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Paper
•
2506.07491
•
Published
•
50
Paper
•
2505.09388
•
Published
•
320
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture,
Training and Dataset
Paper
•
2505.09568
•
Published
•
98
Flow-GRPO: Training Flow Matching Models via Online RL
Paper
•
2505.05470
•
Published
•
86
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Paper
•
2505.17612
•
Published
•
81
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Paper
•
2505.04588
•
Published
•
65
dx8152/Qwen-Edit-2509-Multiple-angles
Image-to-Image
•
Updated
•
66.5k
•
•
857