Easily build production-ready
Viral Clip Finder
Viral Clip Finder
in days not months
Create powerful media processing pipelines with just one API call. Our open-source platform makes it simple to build and scale your workflows.
What is Mediatoad?
Mediatoad is a open-source, well-structured, enterprise-grade media processing platform that emphasizes developer experience, scalability, and maintainability while providing powerful media processing capabilities through a unified API.
npm install @mediatoad/sdk
*av-processing
1const mt = new MediaToad();23const job = await mt.createJob({4 assets: [{ name: "input_vid", url: "s3://bucket/video.mp4" }],5 tasks: [6 { operation: "encode", asset: "input_vid", preset: "mp4:720p", outputAsset: "vid_720p" },7 { operation: "encode", asset: "input_vid", preset: "mp3:192k", outputAsset: "audio_192k" },8 { operation: "thumbnails", asset: "input_vid", timestamps: [10, 30, 60] },9 { operation: "clip", asset: "input_vid", start: 5, end: 20, outputAsset: "short_clip" }10 ],11 storage: { bucket: "output-bucket", base: "processed/" },12 notify: { url: "https://your-webhook.com/notify" }13});
~doc-processing
1// Conceptual: Process a PDF to extract text and generate audio2const mt = new MediaToad();34const job = await mt.createJob({5 assets: [{ name: "report", url: "s3://docs/report.pdf" }],6 tasks: [7 // Extract text (assuming a future 'extract-text' operation)8 { operation: "extract-text", asset: "report", outputAsset: "report_text" },9 // Generate speech from the extracted text10 { operation: "tts", asset: "report_text", provider: "elevenlabs", outputAsset: "report_audio" }11 ],12 storage: { bucket: "processed-docs" }13});1415console.log("Document Processing Job ID:", job.jobId);
/workflows
1// Define a multi-step workflow for finding viral clips2const mt = new MediaToad();34const job = await mt.createJob({5 assets: [{ name: "youtube_vid", url: "youtube.com/watch?v=..." }],6 tasks: [7 { operation: "download", asset: "youtube_vid", outputAsset: "local_vid" },8 { operation: "encode", asset: "local_vid", preset: "mp3:128k", outputAsset: "local_audio" },9 { operation: "transcription", asset: "local_audio", provider: "deepgram", outputAsset: "transcript" },10 // Use LLM to analyze transcript and find engaging segments11 { operation: "segmentation", asset: "transcript", clipLength: 45, model: "gemini-1.5-flash", outputAsset: "viral_segments" },12 // Create clips based on the identified segments13 { operation: "clip", asset: "local_vid", segmentsAsset: "viral_segments", batchClip: true }14 ],15 storage: { bucket: "output-clips", base: "viral/" },16 notify: { url: "https://your-webhook.com/notify" }17});1819console.log("Viral Clips Job ID:", job.jobId);
^programmatic-video
1// Conceptual: Generate video from JSON data (Coming Soon)2const mt = new MediaToad();34const job = await mt.createJob({5 tasks: [6 {7 operation: "generate-video", // Future operation type8 templateId: "marketing-template-v1",9 data: {10 title: "New Product Launch!",11 logoUrl: "s3://bucket/logo.png",12 scenes: [13 { type: "intro", duration: 5 },14 { type: "feature", text: "Feature 1...", image: "s3://assets/feat1.jpg" },15 { type: "outro", text: "Visit mediatoad.com" }16 ]17 },18 outputAsset: "generated_promo.mp4"19 }20 ],21 storage: { bucket: "generated-videos" }22});2324console.log("Video Generation Job ID:", job.jobId);
UI Components for Advanced Media Workflows
Supercharge your AI applications with ready-to-use UI components
Media Editor SDK
Drop-in components for audio/video editing with AI-assisted transformations
- ✓AI-powered effects and filters
- ✓Non-destructive editing pipeline
- ✓Frame-accurate trimming and cutting
@mediatoad/media-editor
Text-Based Media Editor
Edit media by simply describing changes in natural language
- ✓Transcript editor
- ✓Regenerate video, images and audio
- ✓Content-aware editing suggestions
@mediatoad/text-editor
npm install @mediatoad/ui-components
← Get started in minutesPowerful Media Processing Features
MediaToad provides a comprehensive suite of tools for handling all your media processing needs.
Workflow Orchestration
Powered by Temporal for fault-tolerant job execution with automatic retries
- Complex multi-step operations
- State tracking and resumption
- Scalable processing architecture
Media Processing Engine
Built on FFmpeg with an intuitive API layer
- Video/audio encoding and transformations
- Scene detection and segmentation
- Thumbnail generation and metadata extraction
Cloud-Agnostic Storage
Works with AWS S3, Azure Blob Storage, MinIO, and more
- No vendor lock-in
- Optimized for large-file handling
- Seamless media uploads/downloads
AI & Automation
Integrated AI-driven video analysis and processing
- Speech-to-text transcription
- Object/person detection
- Content moderation
Job Tracking & Notifications
Real-time monitoring and integration capabilities
- Webhook notifications
- Server-Sent Events (SSE)
- Automatic retries and failure handling
Developer-Friendly API
Simple JSON-based job definitions
- Intuitive job configuration
- Comprehensive documentation
- Flexible integration options
Roadmap & Changelog
Upcoming Features
Text-to-Speech (TTS)
in-progressAI-driven voice synthesis from transcripts
Programmatic Video Generation
plannedAutomated creation of media content from templates
Advanced Face/Object Detection
plannedEnhanced video analytics powered by AI
Real-time Processing
plannedSupport for live streaming and real-time analysis
Recent Updates
v1.2.0
- Added support for WebM format
- Improved transcoding speed by 40%
- New REST API endpoints
v1.1.0
- Introduced batch processing
- Fixed memory leaks in long operations
- Updated documentation
How MediaToad Works
A simple, powerful workflow for all your media processing needs.
Submit via API
Define your media operations with a simple JSON request
Temporal Orchestration
Workflows are managed with retries, state tracking, and scalable processing
Media Processing
FFmpeg and AI services handle encoding, transcription, and analysis
Storage & Delivery
Processed files are stored in your specified cloud backend
Notifications
Receive webhook/SSE notifications when jobs complete
Analytics & Insights
Track performance metrics and monitor workflow efficiency
Who Benefits from MediaToad?
MediaToad serves a wide range of users with diverse media processing needs.
For LLM Providers
Streamline your media processing pipeline for production
LLM providers can leverage MediaToad to efficiently process and analyze large volumes of media content, enabling better training data preparation and content moderation.
- Scale media processing for training data
- Automate content moderation workflows
- Integrate with existing AI systems
For Developers
Enhance developer experience with simplified media handling
Developers can focus on building great applications without worrying about the complexities of media processing, storage, and workflow management.
- Intuitive API for complex media operations
- Reliable, fault-tolerant processing
- Flexible integration with existing systems
For Content Platforms
Scale your media infrastructure reliably
Content platforms can handle growing media libraries with a scalable, reliable solution that adapts to changing requirements and traffic patterns.
- Process user-generated content at scale
- Automate thumbnail and preview generation
- Implement content analysis and moderation
For Enterprises
Avoid vendor lock-in with a flexible solution
Enterprises can maintain control over their media processing infrastructure while avoiding the limitations and costs of proprietary solutions.
- Cloud-agnostic architecture
- Customizable to specific business needs
- Integrate with existing enterprise systems
Ready to Transform Your Media Processing?
Get started with MediaToad today and experience the power of open-source media processing at scale.