BigScience Workshop

non-profit

https://bigscience.huggingface.co

bigscience-workshop

AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

pbloem submitted a paper about 1 month ago

Predicting integers from continuous parameters

odegiber authored a paper about 2 months ago

Scaling Low-Resource MT via Synthetic Data Generation with LLMs

odegiber authored a paper about 2 months ago

Open Machine Translation for Esperanto

View all activity

submitted a paper to Daily Papers about 4 hours ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Paper • 2605.31433 • Published 3 days ago • 9

shubhamagarwal92

authored 11 papers about 1 month ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

LitLLM: A Toolkit for Scientific Literature Review

Paper • 2402.01788 • Published Mar 21, 2025

History for Visual Dialog: Do we really need it?

Paper • 2005.07493 • Published May 8, 2020

Chitrarth: Bridging Vision and Language for a Billion People

Paper • 2502.15392 • Published Feb 21, 2025

LitLLMs, LLMs for Literature Review: Are we there yet?

Paper • 2412.15249 • Published Dec 15, 2024 • 2

IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs

Paper • 2511.04727 • Published Nov 6, 2025

VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

Paper • 2510.07978 • Published Oct 9, 2025

Seeing Straight: Document Orientation Detection for Efficient OCR

Paper • 2511.04161 • Published Nov 6, 2025

Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems

Paper • 2602.16430 • Published Feb 18

Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation

Paper • 2502.20420 • Published Feb 27, 2025

MUTANT: A Recipe for Multilingual Tokenizer Design

Paper • 2511.03237 • Published Mar 22

submitted a paper to Daily Papers 2 months ago

Composer 2 Technical Report

Paper • 2603.24477 • Published Mar 25 • 18

RTT1

authored a paper 3 months ago

EvoClaw: Evaluating AI Agents on Continuous Software Evolution

Paper • 2603.13428 • Published Mar 13 • 21

authored 3 papers 3 months ago

Agentic Uncertainty Reveals Agentic Overconfidence

Paper • 2602.06948 • Published Feb 6

Complex Query Answering with Neural Link Predictors

Paper • 2011.03459 • Published Nov 6, 2020

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

Paper • 2603.10225 • Published Mar 10

in bigscience/bloom 3 months ago

[SPAM] Deleted

#289 opened 3 months ago by

authored a paper 3 months ago

LLM2Vec-Gen: Generative Embeddings from Large Language Models

Paper • 2603.10913 • Published Mar 11 • 44

stas

posted an update 3 months ago

Post

244

Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into
HuggingFace Trainer, Accelerate and TRL

For extensive details please see this writeup:
https://huggingface.co/blog/ulysses-sp

Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.