Improve model card: Add pipeline tag, library, paper & code links, introduction, and installation

by nielsr HF Staff - opened Oct 4

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+89

-3

nielsr

Oct 4

This PR significantly improves the model card for the ExGRPO-Llama3.1-8B-Zero model by:

Adding pipeline_tag: text-generation to ensure proper discoverability and categorization on the Hugging Face Hub.
Including library_name: transformers to enable the automated "How to use" widget, as evidenced by the config.json file (LlamaForCausalLM architecture and transformers_version).
Providing a descriptive introduction based on the paper abstract and the GitHub repository's README.md.
Linking directly to the associated paper: ExGRPO: Learning to Reason from Experience.
Adding a link to the official GitHub repository (https://github.com/ElliottYan/LUFFY/tree/main/ExGRPO) for the project code.
Incorporating the visual overview image, key highlights, and direct installation instructions from the GitHub README.md to help users get started.
Including the "Released Models" table and citation information from the GitHub README.md.

These enhancements will make the model's documentation more comprehensive, improve its discoverability, and facilitate easier adoption by the community.

Improve model card: Add pipeline tag, library, paper & code links, introduction, and installationc3cd8dad

rzzhan changed pull request status to merged Oct 24

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment