Workflow

Create An Agentic Chat For Your Audio Files

Turn your audio recordings into an AI-powered chat. Transcribe MP3 files from Google Drive with speaker diarization, then ask questions and get answers from your audio content.

Needle Team

Last updated

February 4, 2026

Connectors used

assemblyai
google_drive

Tags

Audio TranscriptionAgentic ChatAssemblyAIGoogle DriveAI ChatSpeaker DiarizationKnowledge BaseRAG

Key Takeaways

  • Chat with any audio - Transcribe recordings and ask questions through AI chat
  • Speaker diarization - AI identifies who said what in multi-speaker recordings
  • Batch processing - Upload a folder of audio files and process them all at once
  • No coding required - Visual workflow builder with drag-and-drop setup
  • Multiple formats - Supports MP3, WAV, M4A, FLAC, and OGG

What This Workflow Does

This Needle workflow turns audio files stored in Google Drive into a searchable, chat-enabled knowledge base. It transcribes recordings with speaker labels, formats the output as markdown, and adds everything to a Needle collection for AI-powered Q&A.

Use cases:

  • Meeting archives - find decisions and action items from past meetings
  • Podcast research - ask questions across multiple podcast episodes
  • Interview analysis - query customer interviews for insights
  • Lecture notes - turn recorded lectures into an AI study resource
  • Sales calls - extract objections, questions, and opportunities

How It Works

StepWhat Happens
1. Connect Google Drive folderThe workflow lists all audio files in a specified Google Drive folder
2. Download and queueEach audio file is downloaded and its URL is prepared for transcription
3. Transcribe with AssemblyAIAudio is converted to text with speaker diarization, punctuation, and timestamps
4. Format as markdownTranscripts are converted to clean markdown with speaker labels, duration, word count, and confidence score
5. Add to Needle collectionFormatted transcripts are added to your Needle collection, enabling semantic search and AI chat

Supported Audio Formats

FormatExtension
MP3.mp3
WAV.wav
M4A.m4a
FLAC.flac
OGG.ogg

AssemblyAI Transcription Features

FeatureWhat It Does
Speech-to-TextConverts audio to text
Speaker DiarizationLabels who said what
PunctuationAdds periods and commas
TimestampsMarks time positions in the audio

Requirements

ToolCostPurpose
Needle AccountFreeWorkflow + RAG
Google DriveFreeAudio storage
AssemblyAIPay-per-minuteTranscription

Setup Instructions

  1. Add the workflow template to Needle
  2. Click the "List Files" node and connect your Google Drive account
  3. Paste your Google Drive folder URL in the instructions
  4. Sign up at AssemblyAI and get your API key
  5. Click the "Transcribe" node and connect with your AssemblyAI API key
  6. Click the "Add Files to Collection" node and choose your target Needle collection (or create a new one)

Customization

What You Can ChangeHow
Audio source folderUpdate the Google Drive folder URL in the "List Files" node
Target collectionSelect a different Needle collection in the "Add Files to Collection" node
Audio formats to processAdjust the file filter in the workflow to include or exclude specific formats
Transcript formattingModify the markdown formatting step to change the output structure

Tips for Better Results

TipWhy
Use clear recordingsBetter audio quality produces more accurate transcriptions
Name files descriptivelyDescriptive filenames make it easier to identify transcripts later
Batch by projectCreate separate collections for different topics or projects
Review first transcriptCheck accuracy before processing large batches

FAQ

Q: Does it work with multiple languages? A: AssemblyAI supports 100+ languages. The workflow uses automatic language detection.

Q: Can I transcribe video files? A: Yes, AssemblyAI extracts audio from video files. Supported formats include MP4, MOV, and AVI.

Q: What if a transcription fails? A: The workflow has error handling. Check the AssemblyAI dashboard for details on any failures.

Q: How many speakers can it identify? A: Speaker diarization works best with 2-10 speakers. More speakers may reduce accuracy.

Want to showcase your own workflows?

Become a Needle workflow partner and turn your expertise into recurring revenue.

Try Needle today

Streamline AI productivity at your company today

Join thousands of people who have transformed their workflows.

Agentic workflowsAutomations, meet AI agents
AI SearchAll your data, searchable
Chat widgetsDrop-in widget for your website
Developer APIMake your app talk to Needle
    Needle LogoNeedle
    Like many websites, we use cookies to enhance your experience, analyze site traffic and deliver personalized content while you are here. By clicking "Accept", you are giving us your consent to use cookies in this way. Read our more on our cookie policy .