视频处理在线字幕提取与SRT编辑工具 AI 提示词 (Prompts)

本篇只聚焦一个类别：视频处理。以下 Prompts 用于让 AI 直接产出可运行/可构建/可部署的在线字幕工具项目（包含完整代码、文件结构、运行命令、部署说明，以及测试用例或 QA checklist）。

在线视频/音频字幕提取与 SRT 导出工具

上传视频或音频，自动转写并生成可下载的 SRT/VTT，同时保留时间轴与置信度信息。

英文 Prompt:

You are a senior full-stack engineer. Build a production-ready web app named "Sub Extractor". Goal - Users upload a video (mp4/mov/webm) or audio (mp3/wav/m4a). - The app extracts audio, runs speech-to-text, and exports sub s. Hard requirements - Output: SRT and WebVTT downloads. - Provide: full source code, complete file tree, and step-by-step run commands. - Provide: Dockerfile + docker-compose for local deployment. - Provide: a minimal REST API (upload -> job -> result) with job status polling. - Provide: at least 8 test cases (unit/integration) OR a QA checklist with expected results. Tech constraints - Use Node.js 20 + Type . - Frontend: Next.js (App Router) + Tailwind. - Backend: Next.js route handlers OR separate Express service; choose one and justify. - Use FFmpeg (server-side) to extract audio; do not rely on system-specific paths. - For STT: implement a pluggable adapter interface; include a local "mock" provider and document how to switch to Whisper. UX - Drag-and-drop upload. - Show progress: upload %, processing steps, ETA. - Show a tran preview with timestamps. Deliverables 1) File tree 2) All code files 3) Commands: install, dev, test, build 4) Docker deployment instructions 5) Tests/QA checklist 6) Security notes (file size limits, MIME validation, temp file cleanup)

中文释义:

让 AI 生成一个可部署的字幕提取站点：支持上传媒体、服务端用 FFmpeg 抽取音频、可替换的转写适配器、任务队列与状态轮询，并输出 SRT/VTT；同时给出 Docker 部署与测试/QA 清单，确保真实可用。

在线 SRT 时间轴校对与批量替换编辑器

针对已有字幕文件进行可视化修正：时间轴拖拽微调、批量查找替换、合并/拆分字幕段落。

英文 Prompt:

Build a web- d SRT editor. Features - Import/export: SRT + WebVTT. - Parse, validate, and repair common issues: overlapping cues, invalid timestamps, missing indices. - Timeline editor: shift selected cues by +/- ms, stretch/compress segment, snap to grid. - Batch operations: find/replace, regex replace (safe mode), remove filler words list. - Undo/redo history. Tech stack - React + Type + Vite. - No backend needed for core editing; everything runs in-browser. - Provide an optional Node API to save/load projects. Deliverables - Full file tree + complete code. - Run commands. - A set of 10 example SRT fixtures (as strings in tests) that cover edge cases. - Unit tests for parser, serializer, and repair functions. UI requirements - Two-pane: left cue list, right cue editor. - Keyboard shortcuts: J/K navigation, Ctrl+F search, Ctrl+Z undo.

中文释义:

让 AI 产出一个前端为主的在线字幕编辑器：重点是 SRT/VTT 的解析、修复与批处理能力，配合时间轴微调与撤销重做；并用测试夹具覆盖重叠、断行、时间戳非法等常见坑。

在线字幕翻译与双语对齐工具（SRT -> 双语 SRT）

把原字幕翻译成目标语言，并生成双语字幕（同时间轴，单行/双行可切换）。

英文 Prompt:

Create a full-stack web tool: "Bilingual Sub Aligner". Input - Upload SRT. - Choose source language and target language. Processing - Translate each cue text using an LLM translation adapter (pluggable). - Preserve timestamps exactly. - Support two output modes: 1) Single cue with two lines (original + translated) 2) Two consecutive cues (original then translated) with same time window Quality controls - Glossary terms: user-defined list of forced translations. - Max line length: automatically reflow lines. - Detect and warn if translated text expands too much. Deliverables - Next.js + API routes + a simple job queue in-memory. - Full project code + file tree. - Commands + Docker. - Provide at least 12 QA checklist items (encoding, RTL languages, punctuation, emoji, line breaks, long sentences, SRT BOM, etc.). Do NOT - Do not generate images or suggest image generation prompts. - Keep it strictly sub text processing.

中文释义:

让 AI 输出一个双语字幕对齐站点：输入 SRT，调用可替换的翻译适配器生成译文，严格保留时间轴；同时提供术语表、换行控制、膨胀检测等质量开关，并给出 Docker 与 QA 校验点。

在线字幕一致性检查与术语纠错工具

用于团队字幕规范：统一人名/品牌名译法、标点风格、数字/单位格式，并生成可下载的修订版。

英文 Prompt:

Build a web app that audits sub consistency. Inputs - SRT upload - A "style guide" JSON (user editable): - preferred terms map - punctuation rules - number formatting - profanity filter (optional) Outputs - A report (HTML + JSON) listing issues by cue index/time. - A "fixed" SRT download. Implementation - Type library for parsing + rules engine. - UI to toggle rules and preview diffs per cue. - Add a diff viewer (before/after) with highlight. Deliverables - Monorepo with packages: parser, rules-engine, web. - Tests for at least 15 rule cases. - Explain how to add a new rule plugin.

中文释义:

让 AI 生成一个字幕质量审计工具：核心是“规则引擎 + 报告 + 可回写修订”，适合批量统一术语/标点/数字格式，并以差异高亮方式让编辑快速确认修改。

在线字幕与媒体时长校验工具（异常检测与修复建议）

检查字幕是否超出视频时长、是否存在空洞/重叠、是否出现过长显示时间，并给出一键修复建议。

英文 Prompt:

Create a tool that validates sub s against media duration. Inputs - Upload video file AND SRT Checks - Sub end time must not exceed video duration. - Detect gaps > N seconds (configurable). - Detect overlaps and extremely short cues. - Reading speed estimate (chars/sec) and flag too-fast cues. Fix suggestions - Shift all cues by offset. - Clamp cues to duration. - Auto-merge cues that are too short. Tech - Backend uses FFmpeg/ffprobe to read duration. - Frontend shows charts: timeline heatmap, warnings list. Deliverables - Full code, file tree, run commands. - Docker. - At least 10 tests for the validation logic and parsing. Note - Strictly text/video processing; no image generation features.

中文释义:

让 AI 产出一个“字幕 vs 视频”对账工具：用 ffprobe 获取时长，再做重叠/空洞/阅读速度等校验，提供偏移、截断、合并等修复建议；适合发布前批量做质量门禁。

在线字幕片段剪辑清单生成器（按关键词自动打点）

输入字幕与关键词集合，自动输出“精彩片段清单”（起止时间、命中关键词、上下文），便于后续剪辑或标注。

英文 Prompt:

Build a web tool: "Sub Highlight Finder". Inputs - SRT upload - Keywords list (comma-separated) + optional regex mode Output - A CSV + JSON timeline of highlight segments: - start/end time - matched keyword(s) - cue indices - context text Features - Merge nearby hits into a single segment if within X seconds. - Provide a preview player that seeks to the segment times (use an HTML5 video element). Deliverables - Full code (React + Node). - Example dataset and tests for keyword matching + segment merging. - Provide a CLI as well (node) to process files in batch.

中文释义:

让 AI 生成一个“字幕打点”在线工具：以 SRT 为输入，按关键词/正则自动合并相邻命中，输出 CSV/JSON 的片段清单，并提供可跳转预览；同时附带一个可批处理的 CLI，方便团队流水线使用。

在线字幕格式转换与编码修复工具（SRT/VTT/ASS）

解决跨平台字幕兼容性：转换格式、修复编码与换行、统一时间戳精度，并输出校验报告。

英文 Prompt:

Create a sub format converter web app. Supported - Input: SRT, WebVTT, ASS (subset) - Output: SRT and WebVTT (required), ASS (optional) Repair - Detect encoding issues (UTF-8 BOM, Windows-1252) and normalize to UTF-8. - Normalize line breaks (CRLF/LF). - Normalize timestamp precision. Deliverables - Full code + file tree. - Explain parsing limitations for ASS. - Provide 20 fixture samples and tests. - Provide a "lint" report output with warnings and suggested fixes.

中文释义:

让 AI 产出一个字幕格式转换站点：重点在“编码与格式修复 + 可解释的 lint 报告”，将输入统一到 UTF-8，并把常见兼容性问题（BOM、换行、时间戳精度）显式提示出来。

在线字幕项目打包与交付物生成器（文件树/命令/部署/QA 一键导出）

把字幕项目当作交付件：生成标准化项目结构、README、Docker、CI 配置与 QA 清单，便于团队交接。

英文 Prompt:

Build a web app that turns sub work into a reproducible deliverable package. Input - Upload one or more sub files (SRT/VTT) + optional media data JSON. Output - A downloadable zip containing: - /sub s originals and normalized versions - /reports lint report - /tools a small Node to validate timestamps - README with commands and environment requirements - Dockerfile + docker-compose - QA checklist markdown Requirements - The site should generate the zip on the server. - Must include a deterministic file tree. - Provide tests ensuring zip structure and deterministic ordering. Deliverables - Full code, file tree, commands, Docker. - Security constraints: zip bomb prevention, size limits, temp cleanup.

中文释义:

让 AI 生成一个“字幕交付包生成器”：上传字幕后自动产出规范化文件树、报告、校验脚本、README、Docker 与 QA 清单，并以 zip 形式一键下载；适合把一次字幕处理工作变成可复现、可交接的项目交付物。