AI & ML interests
Enterprise AI and ML, Foundation Models, Responsible AI
Recent Activity
Papers
VAREX: A Benchmark for Multi-Modal Structured Extraction from Documents
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
Articles
-
ibm-research/ttm-research-r2
Time Series Forecasting • 855k • Updated • 4.81k • 5 -
ibm-research/ttm-r3
Time Series Forecasting • 1.41M • Updated • 28.9k • 1 -
ibm-research/flowstate
Time Series Forecasting • 9.07M • Updated • 663k • 7 -
ibm-research/patchtst-fm-r1
Time Series Forecasting • 0.3B • Updated • 25.4k • 7
-
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance
Paper • 2506.03828 • Published • 18 -
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Paper • 2506.03278 • Published • 6 -
ibm-research/AssetOpsBench
Viewer • Updated • 467 • 662 • 15 -
AssetOpsBench
📉3Evaluating Autonomous AI Agents for Industry 4.0 Tasks
-
AssetOpsBench
🚀18Generate and benchmark machine learning models with ease
-
CUGA Agent
🤖98Configurable Generalist Agent, leader in AppWorld Benchmark
-
ITBench-Lite-Space
🚀6Develop and run interactive code notebooks with JupyterLab
-
VAKRA Leaderboard
🏆10Benchmark AI agents on multi‑hop, multi‑source enterprise tasks
-
ibm-research/granite-3.2-2b-instruct-GGUF
Text Generation • 3B • Updated • 580 • 11 -
ibm-research/granite-3.2-8b-instruct-GGUF
Text Generation • 8B • Updated • 618 • 8 -
ibm-research/granite-vision-3.2-2b-GGUF
3B • Updated • 624 • 11 -
ibm-research/granite-guardian-3.2-3b-a800m-GGUF
Text Generation • 3B • Updated • 145 • 2
-
ibm-research/ttm-research-r2
Time Series Forecasting • 855k • Updated • 4.81k • 5 -
ibm-research/ttm-r3
Time Series Forecasting • 1.41M • Updated • 28.9k • 1 -
ibm-research/flowstate
Time Series Forecasting • 9.07M • Updated • 663k • 7 -
ibm-research/patchtst-fm-r1
Time Series Forecasting • 0.3B • Updated • 25.4k • 7
-
AssetOpsBench
🚀18Generate and benchmark machine learning models with ease
-
CUGA Agent
🤖98Configurable Generalist Agent, leader in AppWorld Benchmark
-
ITBench-Lite-Space
🚀6Develop and run interactive code notebooks with JupyterLab
-
VAKRA Leaderboard
🏆10Benchmark AI agents on multi‑hop, multi‑source enterprise tasks
-
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance
Paper • 2506.03828 • Published • 18 -
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Paper • 2506.03278 • Published • 6 -
ibm-research/AssetOpsBench
Viewer • Updated • 467 • 662 • 15 -
AssetOpsBench
📉3Evaluating Autonomous AI Agents for Industry 4.0 Tasks
-
ibm-research/granite-3.2-2b-instruct-GGUF
Text Generation • 3B • Updated • 580 • 11 -
ibm-research/granite-3.2-8b-instruct-GGUF
Text Generation • 8B • Updated • 618 • 8 -
ibm-research/granite-vision-3.2-2b-GGUF
3B • Updated • 624 • 11 -
ibm-research/granite-guardian-3.2-3b-a800m-GGUF
Text Generation • 3B • Updated • 145 • 2