I will add accurate subtitles, captions and SRT to your video
About this gig
I will add accurate, perfectly-timed subtitles, captions and SRT files to your video so every word lands clearly on screen, even with music, accents or background noise.
Whether you are publishing on YouTube, Instagram, TikTok, LinkedIn, a course platform or your own website, captions are no longer optional. The majority of social video is watched on mute, and accessibility expectations keep rising. I transcribe your audio by ear, clean it up into readable lines, time each cue frame-accurately, and hand you files that just work. No auto-generated mush, no random line breaks mid-sentence, no captions that flash by before anyone can read them.
What you get
- A clean, human-checked transcription of your spoken audio, punctuated and proofread for spelling and grammar
- Frame-accurate timing so each caption appears and disappears in sync with the speech
- An SRT file (and VTT on request) ready to upload to YouTube, Vimeo, Facebook, LinkedIn, or any standard player
- Optional burned-in (hardcoded) captions rendered directly onto the video, exported as an MP4 — ideal for TikTok, Reels and Shorts where uploaded caption files are not supported
- Sensible reading speed and line breaks (typically 1-2 lines, ~42 characters per line) so viewers can actually keep up
- Speaker labels for interviews, podcasts and panels when more than one person talks
- Consistent handling of names, brands, technical terms and acronyms — tell me the tricky ones and I will spell them correctly throughout
- A quick quality pass against your video before delivery, so nothing drifts out of sync at the end
Plans
| Feature | Basic | Standard | Premium |
|---|---|---|---|
| Video length covered | Up to short clip | Up to mid-length | Up to long-form |
| Transcription + proofreading | Yes | Yes | Yes |
| SRT file | Yes | Yes | Yes |
| VTT file | On request | Yes | Yes |
| Speaker labels | — | Yes | Yes |
| Burned-in captions (MP4 export) | — | Optional | Yes |
| Caption style / font / position tuning | — | Basic | Custom to your brand |
| Revisions | 1 | 2 | 3 |
| Delivery speed | Standard | Faster | Priority |
Lengths are indicative — send me your exact runtime and I will confirm the right tier before we start. Longer or multi-video projects are welcome; just message me first.
How it works
- You send the video and details. Share the file (or a private link) plus the spoken language, any names or jargon to get right, and where you plan to publish so I pick the correct format.
- I transcribe by ear. I listen through and write out what is actually said, fixing false starts and filler only as far as you want — verbatim or lightly cleaned, your call.
- I time the cues. Each line is anchored to the audio so captions track the speech, with comfortable in/out points and readable durations.
- I format and proof. Line lengths, breaks, punctuation and casing get a dedicated pass for readability, then a spell-check against your supplied terms.
- I render or export. You receive the SRT (plus VTT and/or a burned-in MP4 depending on your plan).
- You review, I refine. Check it against your video; I apply the revisions included in your plan until the captions read clean and stay in sync.
Why choose this
Auto-captions are a starting point, not a finished product. They mangle names, miss punctuation, ignore where one thought ends and the next begins, and break in awkward places that make viewers work harder than they should. I treat captions as part of the edit, not an afterthought.
Timing is where most subtitle work falls apart, and it is where I spend the most care. A caption that appears half a second late, lingers over the wrong shot, or crams three lines on screen at once quietly hurts retention. I keep cues tight, balanced and easy to read at a glance.
I am also realistic about scope. I will tell you up front what your audio allows. Crisp, clearly-spoken speech transcribes beautifully; heavy crosstalk, very thick background noise or inaudible passages get flagged honestly rather than guessed at and buried. You always know what you are getting.
Who it's for / use cases
- Creators and influencers adding captions to YouTube videos, Reels, Shorts and TikToks to boost watch time and mute-viewing
- Businesses and marketers captioning ads, explainers, testimonials and product demos for social and landing pages
- Course creators and educators making lessons and webinars accessible and skimmable
- Podcasters turning audio-with-video episodes into captioned clips for promotion
- Interviews, panels and documentaries that need clear speaker labels
- Teams meeting accessibility requirements (ADA / WCAG-style) who need compliant caption files
- Anyone localizing reach by giving non-native and hard-of-hearing viewers a reliable on-screen transcript
FAQ
Q: What is the difference between an SRT file and burned-in captions? An SRT is a separate text file you upload alongside your video; the viewer (or platform) can toggle it on or off, and you keep an editable version. Burned-in captions are rendered permanently onto the video pixels, so they always show — best for platforms that do not accept caption files, like Reels, Shorts and TikTok.
Q: Which file formats do you deliver? SRT by default, VTT on request, and a captioned MP4 if you want burned-in captions. Tell me your platform and I will deliver exactly what it expects.
Q: Do you transcribe by hand or use automatic captions? I transcribe and check by ear. I may use software to speed up drafting, but every line is reviewed, corrected and timed by a human before delivery — that is the whole point of paying for captions.
Q: Can you handle accents, technical terms or brand names? Yes. Clear accents are no problem. Send me a short list of names, product names, acronyms and jargon and I will spell them consistently throughout.
Q: This is a captioning service in the same language as the audio, right? Do you translate? Correct — this service is same-language transcription, timing and captioning. It is not a translation/foreign-subtitle service. If you need translated subtitles, message me first so we can scope it properly.
Q: What do you need from me to start? The video file or a private download link, the spoken language, your target platform, whether you want SRT only or burned-in captions, and any names or terms to get right. Verbatim or lightly-cleaned style is your choice.
Q: How accurate will it be? With clear audio, expect a polished, publication-ready result. Where audio is genuinely inaudible or buried under noise, I mark those moments honestly rather than inventing words, and I will tell you in advance if a section is at risk.
Q: What if a caption is mistimed or a word is wrong? That is what the included revisions are for. Point out anything off and I will correct the text or nudge the timing until it reads clean and stays in sync with your video.
Reviews★4.5(6)
- @kailabs★★★★★5
Sent over a long interview and got back perfectly timed subtitles with proper line breaks throughout. Will be back for the rest of the episodes.
- @irisj★★★★★5
Honestly impressed how well it handled my accent, nearly every word was spot on and the SRT loaded instantly on YouTube.
- @noracodes★★★★★4
Accurate captions and the SRT was formatted right, just had two small typos I cleaned up myself. Still really solid work overall.
- @hana99★★★★★5
Captions were dead accurate, even caught a couple of mumbled words I assumed would just get skipped. Burned-in version looked clean against my footage too.
- @mintmind★★★★★5
The subtitles synced perfectly with every line and the SRT dropped straight into my editor without any timing fixes needed. Could not be happier.
- @mayaj★★★★★3
Captions were accurate enough but I had to ask for a fix where two people talked over each other. Got sorted in the end so no real complaints.