IBM

company

Verified

https://www.ibm.com/

AI & ML interests

Enterprise AI and ML, Foundation Models, Responsible AI

Recent Activity

LeoYML submitted a paper about 1 month ago

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Inbars authored a paper about 1 month ago

VAREX: A Benchmark for Multi-Modal Structured Extraction from Documents

Avihu submitted a paper about 2 months ago

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

View all activity

Papers

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

View all Papers

ibm 's Spaces 7

BenchBench Leaderboad

Compare benchmarks for language models

Unitxt

Risk Atlas Nexus

Evaluate AI risks with common risk taxonomies

JuStRank

Display ranked LLM judges based on performance metrics

README

Biomed-multi-alignment unified demo with PPI and TDI examples

Demo for MAMMAL approch on multiple domains

Llm Rank Themselves

Rank and compare language models using benchmarks