Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents
•
30
LLMs for language and code + Time series and geospatial foundation models
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Extract and convert document content from images
Convert document images to HTML with Docling
Granite 4.0 1B Speech recognition and translation demo
RAG example using Granite [vision, embedding, instruct]
Extract and convert document content from images
Convert document images to HTML with Docling
Granite 4.0 1B Speech recognition and translation demo
RAG example using Granite [vision, embedding, instruct]