Learn more about our ✨ new ✨ native MCP
Directus Logo
  • Use Cases and Features
    • Headless CMS
      Manage and deliver content with ease
    • Backend-as-a-Service
      Build and ship applications faster
    • Headless Commerce
      A single source of truth for products
    • 100+ More Use Cases
      Build anything (or everything)
    • Instant APIs
      Connect a database, get REST + GraphQL APIs
    • Granular Policy-Based Auth
      Provide secure, autonomous data access
    • Visual Automation Builder
      Automate content and data workflows with ease
    • 50+ More Features
      Get everything you need out-of-the-box
    Project Showcase
    Built With Directus

    Built With Directus

    See what everyone's been building with Directus

  • Learn More
    • Blog
      Read our latest articles and guides
    • Case Studies
      Case studies and success stories
    • Community
      Join our new community forum.
    • Agency Directory
      Browse our list of agency partners
    • About Us
      Learn more about Directus and the team
    • Wall of Love
      See what others are saying about us
    • Contact
      Have a general inquiry or question for us?
    • Support
      Reach out to Directus support
    Watch Directus TV
    Directus TV
    Video

    Directus TV

    Go down the rabbit hole with hours of original video content from our team.

  • Developers
  • Enterprise
  • Pricing
Book a DemoGet StartedLog In
GitHub logo33,618
Directus Marketplace
Marketplace
  1. Extensions
  2. Integrations
  3. Templates
Back to Extensions
operation

AI Transcription Operation

Use Deepgram's Speech Recognition API to generate transcripts from audio files.

AI Transcription Operation

Generate transcripts from audio files within Directus Flows with this custom operation, powered by Deepgram.

The AI Transcription operation, showing a Deepgram API Key field and a File URL field.

This operation contains the following configuration options:

Required

  • Deepgram API Key
  • File URL - public URL of the file to transcribe

Optional

  • Callback URL - process the transcript asynchronously by providing a callback URL. The Deepgram API will make a POST request to this url once transcription is completed.
  • Diarization - include speaker changes in the transcript.
  • Keywords - Uncommon proper nouns or other words to transcribe that are not a part of the model’s vocabulary. These follow this format keyword:intensifer.

If a Callback URL is NOT provided, the operation returns a JSON object containing the transcript, a breakdown of each word and timestamp, a paragraph-formatted transcript, and a breakdown of each paragraph and timestamp.

The output showing a JSON object containing a transcript and words.

You can use the operation to then save the result to a file description or item, or run further automation on the output.


If a Callback URL is provided, the operation does not wait for the transcript to finish generation. It immediately returns a JSON object containing the request_id provided by the Deepgram API and a message confirming successful submission.

The output showing a JSON object containing a confirmation message and request_id.

The request_id should be stored and used to identify the incoming transcript from Deepgram.

Once the transcript has been generated, the Deepgram API will make a POST request to the Callback URL you have provided. The request_id will be included in the metadata of that request.

Make sure that your Callback URL is configured to accept and handle the response from Deepgram's API. This could be another Directus Flow with an incoming Webhook Trigger or some other system.

Output

If no Callback URL is provided, the operation outputs a JSON object with the following structure:

{
    "confidence": 0.9995117,
    "transcript": "Hi. This is a long string with the full transcript. Complete with punctuation and capitalization."
    "words": [
        {
            "confidence": 0.9995117,
            "word": "word",
            "punctuated_word": "Hi.",
            "start": 0.08,
            "end": 0.3999998,
        }
    ],
    "paragraphs": {
        "transcript": "Hi.\n\nThis is a long string with the full transcript.\n\nComplete with punctuation and capitalization.",
        "paragraphs": [
            {
                "sentences": [
                    {
                        "start": 0.08,
                        "end": 0.3999998,
                        "text": "Hi."
                    }
                ],
                "num_words": 14,
                "start": 0.08,
                "end": 0.7075
            }
        ]
    }
}

If a Callback URL is provided, the operation outputs a JSON object with the following structure:

{
  "request_id": "42fc4c2b-09b3-4f5f-af04-3d1c9e9dc185",
  "message": "Transcription request submitted for processing."
}

Flow Setup

Automatically Transcribe New Files

Create a Flow with an Event Hook action trigger and a scope of files.upload. Use the AI Transciption operation, setting the File URL to https://your-directus-project-url/assets/{{ $trigger.key }}, being sure to provide your specific Directus Project URL.

This will work if your file is public, but if it isn't, you can append ?access_token=token to the File URL, replacing the value with a valid user token that has access to the file.

This operation will trigger on every new file upload, regardless of location or filetype. You may wish to add a conditional step between the trigger and transcription operation. The following condition rule will check that the file is audio:

{
    "$trigger": {
        "payload": {
            "type": {
                "_contains": "audio"
            }
        }
    }
}

Transcribe On-Demand

Create a Flow with a Manual trigger and select a collection with a file field, which should contain an audio file. Set the Location to Item Page Only. To receive the file ID create a Read Data operation for the same collection, setting IDs to {{ $trigger.body.keys[0] }}.

Use the AI Transciption operation, setting the File URL to https://your-directus-project-url/assets/{{ $last.file_field_name }}, being sure to provide your specific Directus Project URL and file field name.

@directus-labs/ai-transcription-operation

npm i @directus-labs/ai-transcription-operation
This extension is sandboxed and can be installed through the Directus Marketplace. Learn more.
Mike Elsmore
Mike Elsmore
Verified
1.1.0
Works with v10.10.0+
Updated 11 months ago
MIT License
9.2 KiB (4 files)
npm
  • Directus LogoDirectus Logo

    A composable backend to build your Headless CMS, BaaS, and more. 

  • Solutions
    • Headless CMS
    • Backend-as-a-Service
    • Product Information
    • 100+ Things to Build
  • Resources
    • Documentation
    • Guides
    • Community
    • Release Notes
  • Support
    • Issue Tracker
    • Feature Requests
    • Community Chat
    • Cloud Dashboard
  • Organization
    • About
    • Careers
    • Brand Assets
    • Contact
©2025 Monospace Inc
  • Cloud Policies
  • License
  • Terms
  • Privacy