Space for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
View all Papers
models
5
cx-cmu/AutoGEO_mini_Qwen1.7B_Ecommerce
Text Generation
•
2B
•
Updated
•
19
cx-cmu/AutoGEO_mini_Qwen1.7B_GEOBench
Text Generation
•
2B
•
Updated
•
14
cx-cmu/AutoGEO_mini_Qwen1.7B_ResearchyGEO
Text Generation
•
2B
•
Updated
•
31
cx-cmu/repro-rephraser-1B
1B
•
Updated
•
13
cx-cmu/repro-rephraser-4B
Text Generation
•
196k
•
Updated
•
7
•
2
datasets
8
cx-cmu/Researchy-GEO
Viewer
•
Updated
•
47k
•
44
cx-cmu/GEO-Bench
Viewer
•
Updated
•
37.4k
•
59
cx-cmu/E-commerce
Viewer
•
Updated
•
7.97k
•
67
cx-cmu/ClueWeb-Reco
Viewer
•
Updated
•
87.2M
•
86
•
1
cx-cmu/repro-organic-data-72B
Viewer
•
Updated
•
58.3M
•
504
cx-cmu/repro-rl-data
Viewer
•
Updated
•
41k
•
20
cx-cmu/repro-rephrased-data-72B
Viewer
•
Updated
•
39M
•
648
cx-cmu/CLUE-LLM
Viewer
•
Updated
•
1.21k
•
13