explorations - a andreydelpozo Collection

andreydelpozo 's Collections

explorations

updated Nov 7, 2025

random things

teknium/OpenHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Feb 19, 2024 • 192k • 881
ByteDance/SDXL-Lightning

Text-to-Image • Updated Apr 3, 2024 • 96.1k • • 2.12k
google/gemma-7b-it

Text Generation • 9B • Updated Aug 14, 2024 • 102k • 1.23k
dphn/dolphin-2.2.1-mistral-7b

Text Generation • 7B • Updated May 20, 2024 • 459 • 198
dphn/dolphin-2.5-mixtral-8x7b

Text Generation • 47B • Updated May 21, 2024 • 1.66k • 1.24k
dphn/dolphin-2.6-mistral-7b-dpo-laser

Text Generation • 7B • Updated Mar 4, 2024 • 79 • 120
ise-uiuc/Magicoder-Evol-Instruct-110K

Viewer • Updated Dec 28, 2023 • 111k • 2.99k • 170
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27, 2024 • 30
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20, 2025 • 60
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation

Paper • 2510.01284 • Published Sep 30, 2025 • 34
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot

Paper • 2510.06751 • Published Oct 8, 2025 • 21
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Paper • 2509.24372 • Published Sep 29, 2025 • 9
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291
MATRIX: Mask Track Alignment for Interaction-aware Video Generation

Paper • 2510.07310 • Published Oct 8, 2025 • 35
Real-Time Object Detection Meets DINOv3

Paper • 2509.20787 • Published Sep 25, 2025 • 10
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 259
Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159
T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8, 2025 • 119
SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8, 2025 • 113
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5, 2025 • 51
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24, 2025 • 42
KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11, 2025 • 40
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5, 2025 • 82
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5, 2025 • 77
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11, 2025 • 53
SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9, 2025 • 50
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 98
Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8, 2025 • 86
Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23, 2025 • 81
ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7, 2025 • 65
dx8152/Qwen-Edit-2509-Multiple-angles

Image-to-Image • Updated Nov 28, 2025 • 66.5k • • 857