6 3 6

Kian Kyars

kyars

https://sites.ualberta.ca/~kkyars/

AI & ML interests

None yet

Recent Activity

updated a Space 1 day ago

kyars/sandbox-448fd80a

published a model 2 days ago

kyars/CogDrift-R1-14B

updated a Space 2 days ago

kyars/sandbox-e6e41285

View all activity

Organizations

updated a Space 1 day ago

ml-intern sandbox

🌍

published a model 2 days ago

kyars/CogDrift-R1-14B

Updated 2 days ago

updated a Space 2 days ago

ml-intern sandbox

🌍

New activity in ScaleAI/SWE-Atlas-QnA 28 days ago

Kudos

❤️ 1

#3 opened 28 days ago by

kyars

updated a Space about 2 months ago

Compute Market Environment Server

📊

Step through a resource‑allocation simulation via web UI

published a Space about 2 months ago

Compute Market Environment Server

📊

Step through a resource‑allocation simulation via web UI

updated a Space about 2 months ago

Compute Market Environment Server

📊

Control a simulated resource allocation environment via web UI

published a Space about 2 months ago

Compute Market Environment Server

📊

Control a simulated resource allocation environment via web UI

updated a dataset 3 months ago

kyars/gpqa-results

Viewer • Updated Jan 21 • 5.94k • 80

published a dataset 3 months ago

kyars/gpqa-results

Viewer • Updated Jan 21 • 5.94k • 80

commented on KV Caching Explained: Optimizing Transformer Inference Efficiency 4 months ago

Yes, it's done for each transformer block in an LM because each transformer block has different attention heads. If you do it for only one transformer block across all blocks, then you don't get the same representation.

commented on KV Caching Explained: Optimizing Transformer Inference Efficiency 4 months ago

I think I got lost around the standard inference versus Kv caching section because I couldn't understand the matmuls happening based on each flashing repetition of those yellow blocks. But perhaps I just need to go through the blog post once again to try to better understand it.