The post Meta Introduces SAM Audio for Advanced Sound Isolation Using Multimodal Prompts appeared on BitcoinEthereumNews.com. Tony Kim Dec 16, 2025 16:47 MetaThe post Meta Introduces SAM Audio for Advanced Sound Isolation Using Multimodal Prompts appeared on BitcoinEthereumNews.com. Tony Kim Dec 16, 2025 16:47 Meta

Meta Introduces SAM Audio for Advanced Sound Isolation Using Multimodal Prompts



Tony Kim
Dec 16, 2025 16:47

Meta’s SAM Audio leverages multimodal prompts for audio separation, offering intuitive sound isolation capabilities. The model introduces state-of-the-art features for various audio processing tasks.

Meta AI has unveiled SAM Audio, a groundbreaking model designed to transform audio processing by enabling the isolation of sounds from complex audio mixtures using intuitive, multimodal prompts. This innovative model allows users to employ text, visual cues, or time segment marking to separate audio components, according to Meta AI.

Revolutionizing Audio Processing

Building on previous advancements, SAM Audio employs the Perception Encoder Audiovisual (PE-AV), a technical engine enhancing its performance in various audio separation tasks. This model mirrors the functionality of the Segment Anything Model (SAM), which revolutionized object segmentation in images and videos. SAM Audio aims to make audio separation more accessible and practical by adopting a user-friendly approach that aligns with natural human interaction with sound.

Technical Innovations

The core of SAM Audio is its ability to perform across multiple modalities, such as text, visual, and temporal cues, providing users with precise control over audio separation. This is achieved through three primary methods:

  • Text Prompting: Allows users to type specific sounds, like “dog barking,” to isolate them.
  • Visual Prompting: Enables clicking on objects or speakers in videos to isolate their audio.
  • Span Prompting: An innovative approach allowing users to mark time segments for target audio isolation.

The model’s architecture leverages a flow-matching diffusion transformer, encoding audio mixtures and prompts into a shared representation to generate target and residual audio tracks. This is supported by a robust data engine that synthesizes large-scale, high-quality separation data, enhancing the model’s applicability in real-world scenarios.

PE-AV: The Engine Behind SAM Audio

PE-AV, built on Meta’s open-source Perception Encoder model, extends advanced computer vision capabilities to audio. It aligns video features with audio, allowing accurate separation of visually grounded sources and inferring off-screen events. This temporal alignment supports high-precision multimodal audio separation, crucial for flexible and perceptually accurate outcomes.

Benchmarking and Evaluation

Meta has introduced SAM Audio Judge and SAM Audio-Bench to evaluate and benchmark audio separation models. SAM Audio Judge offers a reference-free, objective metric for assessing audio segmentation quality, while SAM Audio-Bench provides a comprehensive benchmark covering speech, music, and general sound effects using multimodal prompts.

These innovations position SAM Audio as a leading model in audio separation technology, achieving state-of-the-art results across various tasks and outperforming previous models in efficiency and quality. While challenges remain, such as the separation of similar audio events, the model’s capabilities in handling mixed-modality prompts demonstrate significant advancements in the field.

Looking Ahead

Meta envisions SAM Audio as a tool for empowering creators, researchers, and developers to explore new forms of expression and application development. The collaboration with partners like Starkey and 2gether-International highlights the model’s potential in advancing accessibility. SAM Audio marks a step towards more inclusive and creative AI, paving the way for future innovations in audio-aware technologies.

Image source: Shutterstock

Source: https://blockchain.news/news/meta-introduces-sam-audio-for-advanced-sound-isolation

Market Opportunity
LiveArt Logo
LiveArt Price(ART)
$0.0005203
$0.0005203$0.0005203
+0.03%
USD
LiveArt (ART) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Is Doge Losing Steam As Traders Choose Pepeto For The Best Crypto Investment?

Is Doge Losing Steam As Traders Choose Pepeto For The Best Crypto Investment?

The post Is Doge Losing Steam As Traders Choose Pepeto For The Best Crypto Investment? appeared on BitcoinEthereumNews.com. Crypto News 17 September 2025 | 17:39 Is dogecoin really fading? As traders hunt the best crypto to buy now and weigh 2025 picks, Dogecoin (DOGE) still owns the meme coin spotlight, yet upside looks capped, today’s Dogecoin price prediction says as much. Attention is shifting to projects that blend culture with real on-chain tools. Buyers searching “best crypto to buy now” want shipped products, audits, and transparent tokenomics. That frames the true matchup: dogecoin vs. Pepeto. Enter Pepeto (PEPETO), an Ethereum-based memecoin with working rails: PepetoSwap, a zero-fee DEX, plus Pepeto Bridge for smooth cross-chain moves. By fusing story with tools people can use now, and speaking directly to crypto presale 2025 demand, Pepeto puts utility, clarity, and distribution in front. In a market where legacy meme coin leaders risk drifting on sentiment, Pepeto’s execution gives it a real seat in the “best crypto to buy now” debate. First, a quick look at why dogecoin may be losing altitude. Dogecoin Price Prediction: Is Doge Really Fading? Remember when dogecoin made crypto feel simple? In 2013, DOGE turned a meme into money and a loose forum into a movement. A decade on, the nonstop momentum has cooled; the backdrop is different, and the market is far more selective. With DOGE circling ~$0.268, the tape reads bearish-to-neutral for the next few weeks: hold the $0.26 shelf on daily closes and expect choppy range-trading toward $0.29–$0.30 where rallies keep stalling; lose $0.26 decisively and momentum often bleeds into $0.245 with risk of a deeper probe toward $0.22–$0.21; reclaim $0.30 on a clean daily close and the downside bias is likely neutralized, opening room for a squeeze into the low-$0.30s. Source: CoinMarketcap / TradingView Beyond the dogecoin price prediction, DOGE still centers on payments and lacks native smart contracts; ZK-proof verification is proposed,…
Share
BitcoinEthereumNews2025/09/18 00:14
ServicePower Closes Transformative Year with AI-Driven Growth and Market Expansion

ServicePower Closes Transformative Year with AI-Driven Growth and Market Expansion

Double-digit growth, 50% team expansion, and accelerated innovation define 2025 momentum MCLEAN, Va., Dec. 18, 2025 /PRNewswire/ — ServicePower, a leading provider
Share
AI Journal2025/12/18 23:32
Franklin Templeton CEO Dismisses 50bps Rate Cut Ahead FOMC

Franklin Templeton CEO Dismisses 50bps Rate Cut Ahead FOMC

The post Franklin Templeton CEO Dismisses 50bps Rate Cut Ahead FOMC appeared on BitcoinEthereumNews.com. Franklin Templeton CEO Jenny Johnson has weighed in on whether the Federal Reserve should make a 25 basis points (bps) Fed rate cut or 50 bps cut. This comes ahead of the Fed decision today at today’s FOMC meeting, with the market pricing in a 25 bps cut. Bitcoin and the broader crypto market are currently trading flat ahead of the rate cut decision. Franklin Templeton CEO Weighs In On Potential FOMC Decision In a CNBC interview, Jenny Johnson said that she expects the Fed to make a 25 bps cut today instead of a 50 bps cut. She acknowledged the jobs data, which suggested that the labor market is weakening. However, she noted that this data is backward-looking, indicating that it doesn’t show the current state of the economy. She alluded to the wage growth, which she remarked is an indication of a robust labor market. She added that retail sales are up and that consumers are still spending, despite inflation being sticky at 3%, which makes a case for why the FOMC should opt against a 50-basis-point Fed rate cut. In line with this, the Franklin Templeton CEO said that she would go with a 25 bps rate cut if she were Jerome Powell. She remarked that the Fed still has the October and December FOMC meetings to make further cuts if the incoming data warrants it. Johnson also asserted that the data show a robust economy. However, she noted that there can’t be an argument for no Fed rate cut since Powell already signaled at Jackson Hole that they were likely to lower interest rates at this meeting due to concerns over a weakening labor market. Notably, her comment comes as experts argue for both sides on why the Fed should make a 25 bps cut or…
Share
BitcoinEthereumNews2025/09/18 00:36