23 5

Austin Liu

Austin362667

austin362667

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

updated a model 27 days ago

Austin362667/Qwen3-1.7B-MLX-bf16-python-18k-alpaca

updated a model 27 days ago

Austin362667/Qwen3-0.6B-MLX-bf16-python-18k-alpaca

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 10 days ago • 34

updated 3 models 27 days ago

published a model 27 days ago

Austin362667/Qwen3-0.6B-MLX-bf16-python-5k-alpaca-resampled-Qwen-4B

Text Generation • 0.6B • Updated 27 days ago • 656

updated a dataset 27 days ago

Austin362667/python_code_instructions_5k_alpaca_qwen3_4B_resampled

Viewer • Updated 27 days ago • 5k • 13

published a dataset 27 days ago

Austin362667/python_code_instructions_5k_alpaca_qwen3_4B_resampled

Viewer • Updated 27 days ago • 5k • 13

published 2 models 28 days ago

Austin362667/Qwen3-1.7B-MLX-bf16-python-18k-alpaca

Text Generation • 2B • Updated 27 days ago • 843

Austin362667/Qwen3-0.6B-MLX-bf16-python-18k-alpaca

Text Generation • 0.6B • Updated 27 days ago • 652

updated a dataset 28 days ago

Austin362667/python_code_instructions_5_alpaca_qwen3_4B_resampled

Viewer • Updated 28 days ago • 5.01k • 16

published a dataset 30 days ago

Austin362667/python_code_instructions_5_alpaca_qwen3_4B_resampled

Viewer • Updated 28 days ago • 5.01k • 16

upvoted 2 articles about 1 month ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

•

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

•

129

upvoted a collection about 1 month ago

SiliconMind-V1

Collection

4 items • Updated Feb 11 • 2

upvoted 2 articles about 2 months ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

May 20, 2025

•

Article

KV Cache from scratch in nanoVLM

Jun 4, 2025

•

115

upvoted an article 2 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted an article 4 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

356

upvoted 2 articles 6 months ago

Article

Key Insights into the Law of Vision Representations in MLLMs

Sep 2, 2024

•

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

292

Austin Liu

AI & ML interests

Recent Activity

Organizations

Austin362667's activity

Assisted Generation: a new direction toward low-latency text generation

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

KV Cache from scratch in nanoVLM

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Continuous batching from first principles

Key Insights into the Law of Vision Representations in MLLMs

KV Caching Explained: Optimizing Transformer Inference Efficiency