ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning Paper • 2603.05863 • Published Mar 6 • 6
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning Paper • 2603.05863 • Published Mar 6 • 6
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 Text Generation • 32B • Updated Mar 15 • 540k • • 348