6 29 11

Marius Dinca

Puddings22

Puddings22

AI & ML interests

None yet

Recent Activity

commentedon a paper 1 day ago

Fast Byte Latent Transformer

upvoted a paper 1 day ago

Fast Byte Latent Transformer

upvoted a paper 28 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

View all activity

Organizations

commented a paper 1 day ago

Fast Byte Latent Transformer

Paper • 2605.08044 • Published 6 days ago • 9 •

upvoted a paper 1 day ago

Fast Byte Latent Transformer

Paper • 2605.08044 • Published 6 days ago • 9

upvoted a paper 28 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 29 days ago • 159

upvoted 3 papers 29 days ago

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published about 1 month ago • 36

Large Language Models Align with the Human Brain during Creative Thinking

Paper • 2604.03480 • Published Apr 3 • 6

Beyond the Assistant Turn: User Turn Generation as a Probe of Interaction Awareness in Language Models

Paper • 2604.02315 • Published Apr 3 • 5

updated a collection about 1 month ago

interesting

Collection

3 items • Updated Apr 6

upvoted a collection about 1 month ago

Bonsai

Collection

1-bit Bonsai models • 7 items • Updated 26 days ago • 193

upvoted a paper about 2 months ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 49

upvoted a paper 2 months ago

Training Language Models via Neural Cellular Automata

Paper • 2603.10055 • Published Mar 9 • 8

commented a paper 2 months ago

SimpleGPT: Improving GPT via A Simple Normalization Strategy

Paper • 2602.01212 • Published Feb 1 • 3 •

upvoted 5 papers 3 months ago

Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality

Paper • 2602.14080 • Published Feb 15 • 21

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Paper • 2602.16849 • Published Feb 18 • 7

commented a paper 3 months ago

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

Paper • 2602.11715 • Published Feb 12 • 7 •

upvoted a paper 3 months ago

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

Paper • 2602.11715 • Published Feb 12 • 7

commented a paper 3 months ago

Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm

Paper • 2602.11543 • Published Feb 12 • 6 •

upvoted a paper 3 months ago

Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm

Paper • 2602.11543 • Published Feb 12 • 6

Marius Dinca

AI & ML interests

Recent Activity

Organizations

Puddings22's activity