SAM Audio
Segment any sound with text, visual, or time prompts
About
Meta Segment Anything Model Audio (SAM Audio) is an AI research model from Meta that allows users to accurately separate any sound from any audio or audio-visual source using simple text, visual, or span prompts. It is a state-of-the-art, unified multimodal model capable of isolating general sounds (e.g., traffic, barking), music (instruments, vocals), and speech (speaker isolation, voice separation) from complex mixtures. The model is generative, powered by a flow-matching Diffusion Transformer, and operates in a DAC-VAE latent space. Meta has also released a first-of-its-kind open-source evaluation dataset for prompted audio separation. The technology offers real-world opportunities, particularly for the disabled community and in hearing technology, as highlighted by 2gether-International and Starkey. SAM Audio is part of the broader Segment Anything Model family, which includes SAM 3 for image/video object segmentation and SAM 3D for 3D reconstruction.
Categories & Tags
Color Palette
Background White
#FFFFFF
Text Black/Dark Gray
#000000
Meta Blue (Links/Branding)
#0078FF
Separator Gray
#CCCCCC
Typography
Sans-serif
Headings, Body Text
Design Review
Similar Products
Clear for Slack
Clear messages get answered quicker
Griply 2026
Achieve your goals with a goal-oriented task manager
vibecoder.date
Find who you vibe with, git commit to love
HappyMail
We made email simple again
Blober.io
The easiest way to transfer files between cloud providers.
Supaguard
Scan, Detect & Protect Your Supabase Data
Timelines Time Tracking 4
Track your time to achieve your New Year resolutions.
SoftReveal — Reveal less. Engage more.
Hide Content, Reveal on Click
CalPal
The notebook calculator that thinks for you (now with AI).
Reword
Rewrite messages without leaving your workflow
Radial
Your shortcuts, one gesture away
MoovAI
Launch viral AI ads & pro social content in minutes