Easily build production-ready

Viral Clip Finder

in days not months

Create powerful media processing pipelines with just one API call. Our open-source platform makes it simple to build and scale your workflows.

What is Mediatoad?

Mediatoad is a open-source, well-structured, enterprise-grade media processing platform that emphasizes developer experience, scalability, and maintainability while providing powerful media processing capabilities through a unified API.

npm install @mediatoad/sdk

*av-processing

Leverage the power of FFmpeg for robust audio and video processing tasks, orchestrated reliably by MediaToad.
1const mt = new MediaToad();
2
3const job = await mt.createJob({
4 assets: [{ name: "input_vid", url: "s3://bucket/video.mp4" }],
5 tasks: [
6 { operation: "encode", asset: "input_vid", preset: "mp4:720p", outputAsset: "vid_720p" },
7 { operation: "encode", asset: "input_vid", preset: "mp3:192k", outputAsset: "audio_192k" },
8 { operation: "thumbnails", asset: "input_vid", timestamps: [10, 30, 60] },
9 { operation: "clip", asset: "input_vid", start: 5, end: 20, outputAsset: "short_clip" }
10 ],
11 storage: { bucket: "output-bucket", base: "processed/" },
12 notify: { url: "https://your-webhook.com/notify" }
13});
Comprehensive FFmpeg Support
Handle encoding, transcoding, thumbnails, clipping, merging, and scene detection.
Couple with other AI modules
Integrate with specialized AI modules to enhance your media processing pipeline.
/transcriptionunified API for transcripts
/segmentsfind specific parts of audio and video for processing
Scalable Workers
Run FFmpeg workers on any infrastructure, from local machines to cloud clusters.
Simplified API
Define complex FFmpeg operations through simple JSON configurations.

~doc-processing

Process large documents like PDFs reliably. Extract text, generate summaries, or create audiobooks.
1// Conceptual: Process a PDF to extract text and generate audio
2const mt = new MediaToad();
3
4const job = await mt.createJob({
5 assets: [{ name: "report", url: "s3://docs/report.pdf" }],
6 tasks: [
7 // Extract text (assuming a future 'extract-text' operation)
8 { operation: "extract-text", asset: "report", outputAsset: "report_text" },
9 // Generate speech from the extracted text
10 { operation: "tts", asset: "report_text", provider: "elevenlabs", outputAsset: "report_audio" }
11 ],
12 storage: { bucket: "processed-docs" }
13});
14
15console.log("Document Processing Job ID:", job.jobId);
Durable PDF Processing
Handle large or complex PDFs without timeouts using durable workflows.
Text Extraction & Analysis
Extract text content and integrate with LLMs for summarization or analysis.
Audiobook Generation
Combine text extraction with Text-to-Speech (TTS) tasks to create audio versions.
Scalable Architecture
Process numerous documents in parallel across distributed workers.

/workflows

Build complex, multi-step media pipelines that run reliably, powered by Temporal's durable execution.
1// Define a multi-step workflow for finding viral clips
2const mt = new MediaToad();
3
4const job = await mt.createJob({
5 assets: [{ name: "youtube_vid", url: "youtube.com/watch?v=..." }],
6 tasks: [
7 { operation: "download", asset: "youtube_vid", outputAsset: "local_vid" },
8 { operation: "encode", asset: "local_vid", preset: "mp3:128k", outputAsset: "local_audio" },
9 { operation: "transcription", asset: "local_audio", provider: "deepgram", outputAsset: "transcript" },
10 // Use LLM to analyze transcript and find engaging segments
11 { operation: "segmentation", asset: "transcript", clipLength: 45, model: "gemini-1.5-flash", outputAsset: "viral_segments" },
12 // Create clips based on the identified segments
13 { operation: "clip", asset: "local_vid", segmentsAsset: "viral_segments", batchClip: true }
14 ],
15 storage: { bucket: "output-clips", base: "viral/" },
16 notify: { url: "https://your-webhook.com/notify" }
17});
18
19console.log("Viral Clips Job ID:", job.jobId);
Fault Tolerant
Workflows automatically resume from failures, with configurable retries.
Stateful & Observable
Track progress, inspect state, and query workflow status at any time.
Parallel & Sequential Execution
Define task dependencies and let MediaToad optimize for parallel execution.
Run Anywhere
Deploy workers to your preferred environment - cloud, on-prem, or local.

^programmatic-video

[Coming Soon] Generate dynamic videos from templates and JSON data, similar to tools like Synthesia.
1// Conceptual: Generate video from JSON data (Coming Soon)
2const mt = new MediaToad();
3
4const job = await mt.createJob({
5 tasks: [
6 {
7 operation: "generate-video", // Future operation type
8 templateId: "marketing-template-v1",
9 data: {
10 title: "New Product Launch!",
11 logoUrl: "s3://bucket/logo.png",
12 scenes: [
13 { type: "intro", duration: 5 },
14 { type: "feature", text: "Feature 1...", image: "s3://assets/feat1.jpg" },
15 { type: "outro", text: "Visit mediatoad.com" }
16 ]
17 },
18 outputAsset: "generated_promo.mp4"
19 }
20 ],
21 storage: { bucket: "generated-videos" }
22});
23
24console.log("Video Generation Job ID:", job.jobId);
Template-Based Generation
Create videos using predefined templates and dynamic data inputs.
API Driven
Programmatically generate personalized videos at scale via API.
Asset Integration
Combine generated elements with existing images, videos, and audio.
Use Cases
Ideal for personalized marketing, automated onboarding videos, dynamic presentations.

UI Components for Advanced Media Workflows

Supercharge your AI applications with ready-to-use UI components

Media Editor SDK

Drop-in components for audio/video editing with AI-assisted transformations

  • AI-powered effects and filters
  • Non-destructive editing pipeline
  • Frame-accurate trimming and cutting
@mediatoad/media-editor

Text-Based Media Editor

Edit media by simply describing changes in natural language

  • Transcript editor
  • Regenerate video, images and audio
  • Content-aware editing suggestions
@mediatoad/text-editor
npm install @mediatoad/ui-components← Get started in minutes

Powerful Media Processing Features

MediaToad provides a comprehensive suite of tools for handling all your media processing needs.

Workflow Orchestration

Powered by Temporal for fault-tolerant job execution with automatic retries

  • Complex multi-step operations
  • State tracking and resumption
  • Scalable processing architecture

Media Processing Engine

Built on FFmpeg with an intuitive API layer

  • Video/audio encoding and transformations
  • Scene detection and segmentation
  • Thumbnail generation and metadata extraction

Cloud-Agnostic Storage

Works with AWS S3, Azure Blob Storage, MinIO, and more

  • No vendor lock-in
  • Optimized for large-file handling
  • Seamless media uploads/downloads

AI & Automation

Integrated AI-driven video analysis and processing

  • Speech-to-text transcription
  • Object/person detection
  • Content moderation

Job Tracking & Notifications

Real-time monitoring and integration capabilities

  • Webhook notifications
  • Server-Sent Events (SSE)
  • Automatic retries and failure handling

Developer-Friendly API

Simple JSON-based job definitions

  • Intuitive job configuration
  • Comprehensive documentation
  • Flexible integration options
Global Infrastructure

Deploy and scale your media processing workflows across the globe with ease

Roadmap & Changelog

Upcoming Features

Text-to-Speech (TTS)

in-progress

AI-driven voice synthesis from transcripts

Expected: Q2 2025

Programmatic Video Generation

planned

Automated creation of media content from templates

Expected: Q2 2025

Advanced Face/Object Detection

planned

Enhanced video analytics powered by AI

Expected: Q2 2025

Real-time Processing

planned

Support for live streaming and real-time analysis

Expected: Q2 2025

Recent Updates

v1.2.0

Mar 15, 2024
  • Added support for WebM format
  • Improved transcoding speed by 40%
  • New REST API endpoints

v1.1.0

Mar 1, 2024
  • Introduced batch processing
  • Fixed memory leaks in long operations
  • Updated documentation

How MediaToad Works

A simple, powerful workflow for all your media processing needs.

1

Submit via API

Define your media operations with a simple JSON request

2

Temporal Orchestration

Workflows are managed with retries, state tracking, and scalable processing

3

Media Processing

FFmpeg and AI services handle encoding, transcription, and analysis

4

Storage & Delivery

Processed files are stored in your specified cloud backend

5

Notifications

Receive webhook/SSE notifications when jobs complete

6

Analytics & Insights

Track performance metrics and monitor workflow efficiency

Who Benefits from MediaToad?

MediaToad serves a wide range of users with diverse media processing needs.

For LLM Providers

Streamline your media processing pipeline for production

LLM providers can leverage MediaToad to efficiently process and analyze large volumes of media content, enabling better training data preparation and content moderation.

  • Scale media processing for training data
  • Automate content moderation workflows
  • Integrate with existing AI systems

For Developers

Enhance developer experience with simplified media handling

Developers can focus on building great applications without worrying about the complexities of media processing, storage, and workflow management.

  • Intuitive API for complex media operations
  • Reliable, fault-tolerant processing
  • Flexible integration with existing systems

For Content Platforms

Scale your media infrastructure reliably

Content platforms can handle growing media libraries with a scalable, reliable solution that adapts to changing requirements and traffic patterns.

  • Process user-generated content at scale
  • Automate thumbnail and preview generation
  • Implement content analysis and moderation

For Enterprises

Avoid vendor lock-in with a flexible solution

Enterprises can maintain control over their media processing infrastructure while avoiding the limitations and costs of proprietary solutions.

  • Cloud-agnostic architecture
  • Customizable to specific business needs
  • Integrate with existing enterprise systems

Ready to Transform Your Media Processing?

Get started with MediaToad today and experience the power of open-source media processing at scale.