AI-Powered Audio Transcriptions

Update: This feature is out of beta and in general availability.

At Dyte, we are committed to revolutionizing real-time communication, and we're thrilled to announce an exciting addition to our platform: AI-powered audio transcriptions. This feature is designed to enhance your communication experience by effortlessly converting spoken words into written text.

We're introducing an efficient and accurate way to transcribe your conversations in meetings. We provide transcriptions in 2 forms:

Live transcriptions: The transcripts can be consumed on the client side using the Dyte SDK that's suitable for your platform. These transcripts are generated on the server in real-time.
Post-meeting webhooks: The meeting transcript can be consumed via a webhook after the meeting ends.

This release marks the step in a journey toward providing you with cutting-edge AI capabilities. We’re also in the process of developing features to support other AI features like meeting agenda generation and meeting summarization.

Usage

Dyte's transcription APIs likely offer a programmatic way for developers to integrate transcription capabilities into their applications or services. To know more about the transcription APIs in detail, check out this guide.

As always, making sure that our features are developer-friendly is our top priority. Thus, we provide a very simple-to-use API in our client SDKs for you to be able to consume real-time audio transcriptions. Here’s an example of how to use it in our web core SDK.

meeting.ai.on('transcript', (transcriptionData) => {
    console.log(transcriptionData);
});

The transcriptionData object consists of the following information:

An ID to uniquely identify the transcript
The name of the speaker
The ID of the speaker
The transcribed speech
A timestamp of when the speaker had spoken

The transcriptionData object can be represented with the help of the following interface.

export interface Transcript {
  id: string;
  name: string;
  peerId: string;
  transcript: string;
  date: Date;
}

The meeting.ai object emits transcripts only when it’s enabled in the preset of the participant who is speaking. To learn more about how to enable this feature for a participant, check out the transcription guide.

Key Features

Dyte's transcriptions offer a robust suite of features, as mentioned below.

Real-Time Accuracy: Our AI engine provides instant and accurate audio transcriptions, ensuring you stay in sync with the conversation.
Speaker Identification: Easily identify speakers with our speaker attribution feature, making it clearer who said what.
Searchable Transcripts: Search through the transcript to quickly locate specific points in the conversation, streamlining post-meeting analysis.

Pricing

Our AI-driven audio transcription service is priced at $0.015 per minute, offering precise and rapid transcription for any volume of content. This straightforward rate ensures transparent billing for all your transcription needs.

AI-Powered Audio Transcriptions

Usage

Key Features

Pricing

AI Generated Background Images for Video Calls

The Future of AI in Dyte

Supercharge Your Meetings with Dyte Meeting AI

Start building the future on live video

Start building the
future on live video