Benjamim Alves Nepomuceno Neto
AI & ML interests
Recent Activity
Organizations
- RunningFeatured2.03k
Wan2.1
π»2.03kWan: Open and Advanced Large-Scale Video Generative Models
- Runtime errorMCPFeatured1.6k
Wan2.1 Fast
π₯1.6kGenerate a video from an image with a prompt
- Runtime errorFeatured72
NAG Wan2-1-fast
π’72Demo of Normalized Attention Guidance for 4 steps Wan2.1
- PausedMCPFeatured321
Self Forcing Wan 2.1
π₯321Real-time video generation
- Running43
Mediapipe Face Mesh 3d
π43create 3d-gltf face-mesh from image with mediapipe
- Running7
Mediapipe Head Pose Estimation
π72 head pose estimation with mediapipe and trained-model
- Running10
Mediapipe 68 Points Facial Mask
β‘10create facial masks from 68 points landmark
- Running on ZeroFeatured1.1k
InfiniteYou-FLUX
πΈ1.1kFlexible Photo Recrafting While Preserving Your Identity
- Running on ZeroFeatured219
MatAnyone
π€‘219Gradio demo for MatAnyone 1 & 2
- Running on ZeroFeatured590
Video Background Removal
π½590Remove/Change background of video.
- Running on ZeroFeatured111
SAM3 Video Segmentation
π111Track and label objects in videos using text prompts or clicks
- Running on Zero16
VideoMaMa
β‘16Remove video backgrounds and generate matte videos
- Build error117
Dpt Depth Estimation + 3D Voxels
π§117Create 3D models from images using depth estimation
- Running on Zero3.24k
Hunyuan3D-2.0
π3.24kText-to-3D and Image-to-3D Generation
- Running on ZeroFeatured4.78k
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
- Running on ZeroFeatured219
Video Depth Anything
π219Generate depth video from input video
- RunningFeatured178
Manimator
π178Transform research papers and mathematical concepts into stu
- PausedFeatured179
Gaze Demo
π179Gaze detection using Moondream
- Running11
Metropolitan Museum
π¨11The Metropolitan Museum of Art Collection
- SleepingFeatured119
CountGD_Multi-Modal_Open-World_Counting
π119Count objects in images using text, visual examples, or both
- Running on ZeroFeatured570
Midi Music Generator
πΌ570Generate AIβcomposed MIDI music
- PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
- Paused51
Open SUNO
π©51Your Lyrics into Complete Songs with Vocals in Multilingual
- Running on ZeroFeatured679
DiβͺβͺRhythm
πΆ679Blazingly Fast and Embarrassingly Simple Song Generation
- Running on ZeroFeatured259
SD3 Long Captioner
π259Generate detailed captions for any image
- Runtime errorFeatured111
ChartGemma
π¨111Generate insights from charts using text prompts
- Running on Zero90
AuraFlow-v0.3 with Captioner
πΌ90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification β’ 0.4B β’ Updated β’ 9.08M β’ 1.97k
- Runtime errorFeatured462
Omni-Zero
π§462Restylize & repose person ID
- Running on Zero1.2k
PhotoMaker V2
π·1.2kGenerate personalized portrait images from your photos
- Runtime errorFeatured641
FLUX.1 [Inpainting]
π¨641 - Running on L40SFeatured1.63k
Expression Editor
π¨1.63kQuickly edit the expression of a face
- Running on ZeroFeatured934
MMAudio β generating synchronized audio from video/text
π934Generate synchronized audio for videos from text prompts
- Running on Zero326
TangoFlux
π326Text to Audio (Sound SFX) Generator
- Running on Zero463
Stable Audio Open Zero
π₯463Generate custom audio tracks from text prompts
- PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
- Running on Zero2.61k
Voice Clone
π£2.61kGenerate speech in a cloned voice
- Paused849
Ilaria RVC
π»849Convert and separate audio using models and TTS
- Running on ZeroFeatured922
Screenshot to HTML
β‘922Convert a webpage screenshot into editable HTML code
- Running on ZeroMCPFeatured33
NeuTTS-Nano Multilingual Collection
π33Generate speech with voice cloning, now in four languages!
- Running378
PDF Chatbot
π378Ask questions about PDFs using a chatbot
- Runtime errorFeatured367
Video Transcription Smart Summary
β‘367Generate summaries from YouTube videos or uploaded videos
- Running137
Quantized Retrieval
π137Efficient quantized retrieval over Wikipedia
- RunningFeatured1.31k
FineWeb: decanting the web for the finest text data at scale
π·1.31kGenerate a curated webβtext dataset for LLM training
- Sleeping40
Anime Image Classification
π40Analyze anime images for various attributes
- Running on ZeroFeatured170
PaintsUndo
π¨170Generate key frames and videos from a single image
- Running on Zero160
Kolors IP-Adapter
πΌ160Create images from text and reference photos
- Running on ZeroFeatured2.07k
PuLID-FLUX
π€2.07kGenerate custom images from text and a reference photo
- Runtime errorFeatured93
Panoptic Segment Anything
πΌ93 - Runtime errorFeatured396
Grounded Segment Anything
π396 - Running on Zero200
Inspyrenet Remove Background
π’200Remove backgrounds from images, get transparent PNGs
- Runtime errorFeatured515
Florence2 + SAM2
π₯515Segment and caption objects in images and videos
- Runtime errorFeatured113
BigVGAN
π113Generate highβquality audio from your input file with BigVGAN
- Running24
Audio Emotion Recognition
πΌ24Detect emotions from audio recordings
- RunningFeatured61
SoundwaveDemo
π61Process audio and generate text output based on instructions
- RunningFeatured71
DiffVox
π¦71Enhance vocals with professional effects using sliders
- Running on ZeroFeatured978
Tile Upscaler
π978Enhance and upscale images with HDR and tile control
- Runtime errorFeatured192
SeemoRe
π»192Enhance image details with super-resolution
- Running on Zero1.67k
Flux.1-dev Upscaler
π1.67kUpscale images with AI-powered high-resolution enhancement
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification β’ 86.6M β’ Updated β’ 526k β’ 345 - Running on Zero313
Llasa 3b Tts
π₯313Zero Shot voice cloning with llasa 3b (Unofficial Demo)
- PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
- Running on ZeroFeatured412
Zonos
π412Generate speech audio from text with voice and emotion tweaks
- RunningFeatured2.03k
Wan2.1
π»2.03kWan: Open and Advanced Large-Scale Video Generative Models
- Runtime errorMCPFeatured1.6k
Wan2.1 Fast
π₯1.6kGenerate a video from an image with a prompt
- Runtime errorFeatured72
NAG Wan2-1-fast
π’72Demo of Normalized Attention Guidance for 4 steps Wan2.1
- PausedMCPFeatured321
Self Forcing Wan 2.1
π₯321Real-time video generation
- Running43
Mediapipe Face Mesh 3d
π43create 3d-gltf face-mesh from image with mediapipe
- Running7
Mediapipe Head Pose Estimation
π72 head pose estimation with mediapipe and trained-model
- Running10
Mediapipe 68 Points Facial Mask
β‘10create facial masks from 68 points landmark
- Running on ZeroFeatured1.1k
InfiniteYou-FLUX
πΈ1.1kFlexible Photo Recrafting While Preserving Your Identity
- Running on ZeroFeatured219
MatAnyone
π€‘219Gradio demo for MatAnyone 1 & 2
- Running on ZeroFeatured590
Video Background Removal
π½590Remove/Change background of video.
- Running on ZeroFeatured111
SAM3 Video Segmentation
π111Track and label objects in videos using text prompts or clicks
- Running on Zero16
VideoMaMa
β‘16Remove video backgrounds and generate matte videos
- Build error117
Dpt Depth Estimation + 3D Voxels
π§117Create 3D models from images using depth estimation
- Running on Zero3.24k
Hunyuan3D-2.0
π3.24kText-to-3D and Image-to-3D Generation
- Running on ZeroFeatured4.78k
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
- Running on ZeroFeatured219
Video Depth Anything
π219Generate depth video from input video
- RunningFeatured178
Manimator
π178Transform research papers and mathematical concepts into stu
- PausedFeatured179
Gaze Demo
π179Gaze detection using Moondream
- Running11
Metropolitan Museum
π¨11The Metropolitan Museum of Art Collection
- SleepingFeatured119
CountGD_Multi-Modal_Open-World_Counting
π119Count objects in images using text, visual examples, or both
- Running on ZeroFeatured934
MMAudio β generating synchronized audio from video/text
π934Generate synchronized audio for videos from text prompts
- Running on Zero326
TangoFlux
π326Text to Audio (Sound SFX) Generator
- Running on Zero463
Stable Audio Open Zero
π₯463Generate custom audio tracks from text prompts
- PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
- Running on ZeroFeatured570
Midi Music Generator
πΌ570Generate AIβcomposed MIDI music
- PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
- Paused51
Open SUNO
π©51Your Lyrics into Complete Songs with Vocals in Multilingual
- Running on ZeroFeatured679
DiβͺβͺRhythm
πΆ679Blazingly Fast and Embarrassingly Simple Song Generation
- Running on ZeroFeatured259
SD3 Long Captioner
π259Generate detailed captions for any image
- Runtime errorFeatured111
ChartGemma
π¨111Generate insights from charts using text prompts
- Running on Zero90
AuraFlow-v0.3 with Captioner
πΌ90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification β’ 0.4B β’ Updated β’ 9.08M β’ 1.97k
- Running on Zero2.61k
Voice Clone
π£2.61kGenerate speech in a cloned voice
- Paused849
Ilaria RVC
π»849Convert and separate audio using models and TTS
- Running on ZeroFeatured922
Screenshot to HTML
β‘922Convert a webpage screenshot into editable HTML code
- Running on ZeroMCPFeatured33
NeuTTS-Nano Multilingual Collection
π33Generate speech with voice cloning, now in four languages!
- Running378
PDF Chatbot
π378Ask questions about PDFs using a chatbot
- Runtime errorFeatured367
Video Transcription Smart Summary
β‘367Generate summaries from YouTube videos or uploaded videos
- Running137
Quantized Retrieval
π137Efficient quantized retrieval over Wikipedia
- RunningFeatured1.31k
FineWeb: decanting the web for the finest text data at scale
π·1.31kGenerate a curated webβtext dataset for LLM training
- Runtime errorFeatured462
Omni-Zero
π§462Restylize & repose person ID
- Running on Zero1.2k
PhotoMaker V2
π·1.2kGenerate personalized portrait images from your photos
- Runtime errorFeatured641
FLUX.1 [Inpainting]
π¨641 - Running on L40SFeatured1.63k
Expression Editor
π¨1.63kQuickly edit the expression of a face
- Sleeping40
Anime Image Classification
π40Analyze anime images for various attributes
- Running on ZeroFeatured170
PaintsUndo
π¨170Generate key frames and videos from a single image
- Running on Zero160
Kolors IP-Adapter
πΌ160Create images from text and reference photos
- Running on ZeroFeatured2.07k
PuLID-FLUX
π€2.07kGenerate custom images from text and a reference photo
- Runtime errorFeatured93
Panoptic Segment Anything
πΌ93 - Runtime errorFeatured396
Grounded Segment Anything
π396 - Running on Zero200
Inspyrenet Remove Background
π’200Remove backgrounds from images, get transparent PNGs
- Runtime errorFeatured515
Florence2 + SAM2
π₯515Segment and caption objects in images and videos
- Runtime errorFeatured113
BigVGAN
π113Generate highβquality audio from your input file with BigVGAN
- Running24
Audio Emotion Recognition
πΌ24Detect emotions from audio recordings
- RunningFeatured61
SoundwaveDemo
π61Process audio and generate text output based on instructions
- RunningFeatured71
DiffVox
π¦71Enhance vocals with professional effects using sliders
- Running on ZeroFeatured978
Tile Upscaler
π978Enhance and upscale images with HDR and tile control
- Runtime errorFeatured192
SeemoRe
π»192Enhance image details with super-resolution
- Running on Zero1.67k
Flux.1-dev Upscaler
π1.67kUpscale images with AI-powered high-resolution enhancement
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification β’ 86.6M β’ Updated β’ 526k β’ 345 - Running on Zero313
Llasa 3b Tts
π₯313Zero Shot voice cloning with llasa 3b (Unofficial Demo)
- PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
- Running on ZeroFeatured412
Zonos
π412Generate speech audio from text with voice and emotion tweaks