aiOla Launches Whisper-NER Model to Protect Audio Transcription Privacy in Real-Time AI NEWS

Home
AInews
aiOla Launches Whisper-NER Model to Protect Audio Transcription Privacy in Real-Time

aiOla Launches Whisper-NER Model to Protect Audio Transcription Privacy in Real-Time

2024-11-21

AI startup aiOla has introduced a new model called Whisper-NER, designed to address privacy concerns that businesses might face when using artificial intelligence for audio transcription. This model is built upon OpenAI's open-source Whisper model and incorporates both automatic speech recognition (ASR) and named entity recognition (NER). During the transcription process, Whisper-NER automatically identifies and obscures sensitive information such as names, phone numbers, and addresses, ensuring privacy and compliance with data protection regulations while handling speech content.

This new model is now released as fully open-source and is available on Hugging Face and GitHub for enterprises, organizations, and individuals to use, modify, and deploy. Users can try a demo of the model on Hugging Face, experiencing the ability to record audio snippets and automatically mask designated terms in the final text transcription. Tests have demonstrated that the model effectively masks specific terms, including proper nouns and jargon.

Gill Hetz, aiOla's Vice President of Research, stated that the development of this open-source tool aims to advance privacy protection in the AI field. By reducing the need for additional software steps, Whisper-NER assists users in masking sensitive data without increasing complexity. Compared to traditional multi-stage systems, this model eliminates the risk of data exposure during intermediate processing stages, thereby reducing the likelihood of data breaches.

Whisper-NER's source code is released under the MIT License, allowing for free adoption and modification for both community and commercial purposes. The model can be accessed through GitHub and Hugging Face, with its advanced features widely available. Additionally, a demo version is provided, enabling users to explore its functionalities and adaptability.

In terms of training methodology, Whisper-NER is trained using synthetic speech and text-based NER datasets, enabling it to perform transcription and entity recognition tasks simultaneously, thereby enhancing accuracy. The model is designed for zero-shot learning, meaning it can identify and mask entity types that were not explicitly included during training.

For application scenarios where masking is not required, Whisper-NER can be configured to merely tag sensitive entities, offering organizations customizable options according to their needs. Hetz noted that highly regulated industries such as healthcare and legal sectors would benefit the most from this privacy-focused approach, although companies handling less sensitive data can also leverage this technology.

Visual Electric

Visual Electric - AI image generator for collaborative design projects

Marvel

Marvel - Interactive prototyping tool for seamless team collaboration

Coolors

Coolors - Generate custom color palettes

Khroma

Khroma - AI tool for generating personalized color palettes

Kiro AI

Kiro AI - AI IDE transforming prompts into actionable specs

Watermark Remover

Watermark Remover - AI tool for automatic watermark removal

Geo Finder AI

Geo Finder AI - AI tool for identifying locations in media

RECENT AI TOOLS

Dia Browser

Visual Electric

Marvel

Coolors

Khroma

RECENT AI NEWS

AWS Launches Vector Capabilities on Amazon S3

Google Launches Opal, a No-Code Tool for Building AI Mini-Apps

Qwen Launches Qwen3-Coder: Large Agent-Based Coding Model with Open Tools

New ChatGPT Agent Enables Booking, Browsing, and Form Filling—But Trust It Carefully

Trump Reveals Consideration of Splitting NVIDIA During AI Plan Speech

Cognition's AI Developer 'Devin' Eyes $10 Billion Valuation

Leena AI Introduces Voice-Functional AI 'Colleague' to Enhance Workplace Collaboration

Elon Musk Announces AI-Powered Reboot of Vine

RECENT AI TOOLS