Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Akash kathole's picture
Building on HF
1

Akash kathole

akashkathole
ยท

AI & ML interests

None yet

Recent Activity

posted an update 27 days ago
๐Ÿš€ Just shipped reconcile_gst2b_env at OpenEnv Hackathon 2026 (Meta x Scaler India). An RL environment for the monthly GST tax reconciliation that 14M Indian businesses do by hand. Trained Qwen3-4B SFT + GRPO with custom Tier 2c length-shaping reward modification. Headline: n=5 mean composite reward 0.305, +69% over prompted baseline. 5 documented failure modes including a novel research finding: the SAME composite reward design that defends against 6 red-team attacks ALSO makes a 3-step shortcut score higher than 50 steps of honest training. Empirically proven on-site (step-350 mean > step-375 mean). Live demo + repo + writeup linked below. ๐Ÿ”— huggingface.co/spaces/akashkathole/reconcile_gst2b_env ๐ŸŽฅ youtube.com/watch?v=K-sZ8c1TMjw ๐Ÿ“ BLOG.md in the Space https://huggingface.co/spaces/akashkathole/reconcile_gst2b_env
updated a Space 28 days ago
akashkathole/reconcile_gst2b_env
published a Space about 1 month ago
akashkathole/reconcile_gst2b_env
View all activity

Organizations

Blog-explorers's profile picture

akashkathole 's models 1

akashkathole/lora_model

Updated Jun 21, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs