What it is
An automated pipeline that takes audio/video input and produces formatted subtitles in 30+ languages using OCI AI services.
Why it matters
- Eliminates manual transcription bottleneck.
- Enables rapid localization for global audiences.
- Reduces subtitle production cost vs. manual services.
- Maintains professional quality with accurate timestamps.
How it works
- Audio processing: FFmpeg extracts and preprocesses audio.
- Speech recognition: OCI Speech service transcribes with timestamp alignment.
- Translation: OCI AI Language service for context-aware translation.
- Subtitle generation: Auto-formats to SRT/VTT standards with proper timing.
Tech
- Language: Python
- Speech: OCI Speech service
- Translation: OCI AI Language
- Processing: FFmpeg
My role & links
- Built the end-to-end transcription and translation pipeline.
- Integrated OCI Speech and AI Language services.
- Authored technical documentation.
- Code: GitHub