2 43 38

huy

bui

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

unsloth/GLM-4.1V-9B-Thinking-GGUF

liked a model 3 days ago

zai-org/AutoGLM-Phone-9B

liked a model about 1 month ago

black-forest-labs/FLUX.2-klein-4B

View all activity

Organizations

upvoted 3 papers 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published Feb 26 • 53

Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

Paper • 2602.18422 • Published Feb 20 • 30

upvoted an article 4 months ago

Article

CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)

Jun 16, 2024

•

upvoted a paper 5 months ago

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Paper • 2512.13303 • Published Dec 15, 2025 • 17

upvoted an article 5 months ago

Article

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Dec 12, 2025

•

upvoted a collection 5 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 716

upvoted a paper 5 months ago

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published Nov 25, 2025 • 23

upvoted 3 papers 6 months ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published Nov 11, 2025 • 42

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 134

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 128

upvoted 2 papers 7 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 119

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

Paper • 2509.22220 • Published Sep 26, 2025 • 66

upvoted an article 8 months ago

Article

Code a simple RAG from scratch

Oct 29, 2024

•

331

upvoted a paper 8 months ago

Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR

Paper • 2509.18174 • Published Sep 17, 2025 • 134

upvoted a paper 9 months ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27

upvoted an article 9 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

•

1.19k

upvoted 2 papers 9 months ago

OmniTry: Virtual Try-On Anything without Masks

Paper • 2508.13632 • Published Aug 19, 2025 • 15

Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13, 2025 • 58

upvoted a paper 11 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 114

huy

AI & ML interests

Recent Activity

Organizations

bui's activity

CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Code a simple RAG from scratch

Introducing smolagents: simple agents that write actions in code.