·
AI & ML interests
Post training & evals
Organizations
Viewer
• Updated • 1.35k • 5
Viewer
• Updated • 500 • 6
Viewer
• Updated • 9 • 4
lvogel123/cybench-results
Viewer
• Updated • 52 • 26
lvogel123/gpqa-diamond-all
Viewer
• Updated • 10 • 27
lvogel123/gpqa-diamond-glm-4.6-2
Viewer
• Updated • 199 • 37
Viewer
• Updated • 33 • 4
lvogel123/jailbreak-gpt-oss-120b-high-2
Viewer
• Updated • 1.45k • 5
lvogel123/arc-agi-1-kimi-k2
Viewer
• Updated • 402 • 6
lvogel123/gpqa-diamond-gpt-oss-120b-high
Viewer
• Updated • 200 • 41
Viewer
• Updated • 540 • 5
Viewer
• Updated • 11 • 4
lvogel123/factscore-glm-4.6
Viewer
• Updated • 152 • 5
lvogel123/jailbreak-llamma-3.3-nemotron
Viewer
• Updated • 540 • 4
lvogel123/factscore-grok-4
Viewer
• Updated • 152 • 2
lvogel123/factscore-kimi-k2
Viewer
• Updated • 152 • 4
lvogel123/factscore-gpt-5-high
Viewer
• Updated • 152 • 4
lvogel123/factscore-llama-3.3-nemotron-super-49b-v1.5
Viewer
• Updated • 152 • 4
lvogel123/factscore-deepseek-v3.2-exp
Viewer
• Updated • 152 • 7
lvogel123/factscore-qwen3-235b-a22b-thinking-2507
Viewer
• Updated • 152 • 5
lvogel123/gpqa-diamond-kimi-k2
Viewer
• Updated • 200 • 36
lvogel123/jailbreak-kimi-k2
Viewer
• Updated • 1.45k • 7
lvogel123/factscore-gemini-2.5-pro
Viewer
• Updated • 152 • 4
lvogel123/gpqa-diamond-deepseek-v3.2-exp-high
Viewer
• Updated • 200 • 37
lvogel123/factscore-gpt-oss-120b-high
Viewer
• Updated • 152 • 4
lvogel123/jailbreak-deepseek-v3.2-exp
Viewer
• Updated • 1.45k • 5
lvogel123/factscore-claude-4.5-sonnet
Viewer
• Updated • 152 • 8
lvogel123/factscore-llama-4-maverick
Viewer
• Updated • 152 • 3
lvogel123/cybench-gpt-5-high
Viewer
• Updated • 11 • 4