Dr. Joao Paulo Schwarz Schuler PRO
schuler
AI & ML interests
artificial intelligence
Recent Activity
reacted
to
ajibawa-2023's
post with ๐ 5 days ago
PHP-Code-Large
Dataset: https://huggingface.co/datasets/ajibawa-2023/PHP-Code-Large
PHP-Code-Large is a large-scale corpus of PHP source code comprising more than 12 million lines of PHP code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and static program analysis for the PHP ecosystem.
By providing a high-volume, language-specific corpus, PHP-Code-Large enables systematic experimentation in PHP-focused model training, domain adaptation, and downstream code understanding tasks.
PHP-Code-Large addresses the need for a dedicated PHP-only dataset at substantial scale, enabling focused research across backend systems, CMS platforms, APIs, and full-stack PHP environments. posted an
update
6 days ago
โก Speaking is faster than typing - BPSA - Powerful Open Source Agentic Coding Tool with Voice Support - a smolagents fork - Interactive REPL CLI Tool - New Voice Support!
โฟ Accessibility โ It's a huge benefit for people with mobility issues, dyslexia, or other conditions that make typing difficult.
๐๏ธ Hands-free operation โ You can dictate while doing other things, like looking at documents or moving around.
https://github.com/joaopauloschuler/beyond-python-smolagents/ Organizations
None yet