Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Michael Goin's picture
104 21 21

Michael Goin

mgoin
geetu040's profile picture pierrci's profile picture jaypyon's profile picture
·
  • mgoin_
  • mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

updated a model 8 days ago
google/gemma-4-E4B-it-qat-mobile-ct
updated a model 8 days ago
google/gemma-4-E2B-it-qat-mobile-ct
published a model 10 days ago
google/gemma-4-E4B-it-qat-mobile-ct
View all activity

Organizations

Neural Magic's profile picture garage-bAInd's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture NM Testing's profile picture Red Hat AI's profile picture Inference Optimization's profile picture gg-hf-qat's profile picture

authored a paper almost 2 years ago

Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization

Paper • 2409.00492 • Published Aug 31, 2024 • 11
authored a paper about 2 years ago

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7
authored a paper over 2 years ago

Sparse Finetuning for Inference Acceleration of Large Language Models

Paper • 2310.06927 • Published Oct 10, 2023 • 15
authored a paper almost 3 years ago

The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models

Paper • 2203.07259 • Published Mar 14, 2022 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs