音频处理 播客响度标准化与静音剪切工作台 AI 提示词 (Prompts)

这篇文章只围绕一个类别:音频处理。下面每条 Prompt 都要求 AI 直接产出一个可运行/可构建/可部署的在线工具(完整项目代码 + 文件结构 + 运行命令 + 部署说明 + 测试用例/QA checklist)。主题聚焦:播客/访谈音频的 响度标准化(LUFS)静音段检测剪切(静音剪切=删除或标记现有音频中的静音区间,不是生成图片)。

浏览器端:FFmpeg.wasm + WebAudio 的响度标准化与静音剪切工具

适合无需上传到服务器、强调隐私的本地处理场景。

英文 Prompt:

Build a production-ready web app that runs fully in the browser to: - Detect silence segments in an audio file (threshold + min duration). - Normalize loudness to a target LUFS (e.g., -16 LUFS for podcasts) using EBU R128. - Export WAV and MP3. Tech stack: - Vite + React + Type - ffmpeg.wasm (in-browser) - WebAudio API for preview Hard requirements: 1) Output a complete project with file tree and all source code. 2) Provide exact install/run commands. 3) Provide a clear UI spec: upload, waveform/scrub preview, parameters, job progress, download results. 4) Provide an offline-friendly mode (service worker optional) and explain limitations. 5) Provide at least 8 QA checks + 5 automated tests (Vitest) covering: silence detection accuracy, LUFS target, export integrity, edge cases (short clips, stereo/mono, VBR mp3). 6) Include a README with deployment steps (static hosting) and troubleshooting. Do NOT generate images. Do NOT mention Midjourney/Stable Diffusion.

中文释义: 让 AI 直接交付一个“纯前端”的音频处理工具项目,包含完整代码、运行命令、部署与测试,能做静音段检测剪切与 LUFS 标准化并导出。

服务端队列:Next.js + BullMQ + FFmpeg 的批量处理工作台

适合需要同时处理多文件、并发与任务监控的团队场景。

英文 Prompt:

Create an online batch audio processing tool with: - Multi-file upload (drag & drop), per-file status, retry, cancel. - Pipeline: analyze loudness -> normalize to target LUFS -> silence detect -> trim/keep with configurable rules -> export. - Job queue + worker. Tech stack: - Next.js (App Router) + Type - Node.js worker using ffmpeg (native) + fluent-ffmpeg - BullMQ + Redis for queue - SQLite (or Postgres) for job data Deliverables: - Full repo code + file tree. - Docker Compose for app + redis. - CLI s for local dev. - API routes (upload, create job, job status, download). - Security: rate limit, file size limits, safe temp storage cleanup. - Minimum 10-item QA checklist + at least 6 automated tests (Jest) including: queue behavior, parameter validation, download auth, cleanup. No image generation prompts.

中文释义: 让 AI 输出一个带队列/Worker 的在线“音频批处理工作台”,重点是可运行、可部署、可测试,以及任务可视化与安全限制。

Python 后端:FastAPI + Celery 的响度分析与标准化 API + Web UI

适合想用 Python 生态快速扩展分析报告的场景。

英文 Prompt:

Design and implement a web tool that provides: - Upload audio -> compute loudness metrics (integrated LUFS, true peak, loudness range). - Normalize audio to target LUFS with limiter. - Silence detection with a report: segments list (start/end), total silence duration. - Export processed audio and JSON report. Tech stack: - FastAPI + Python 3.12 - Celery + Redis - FFmpeg + ffprobe - Simple frontend (HTMX or React) with job polling Must include: - Full source code and file tree. - Commands to run locally and with Docker. - Data schema for job tracking (SQLite ok). - At least 5 test cases (pytest) + a QA checklist. - Clear deployment guide (Docker) and security notes (upload validation, path traversal prevention, temp cleanup). No image generation content.

中文释义: 让 AI 产出“音频响度分析 + 标准化 + 静音报告”的在线工具,强调 API + UI、可部署与测试。

参数可解释:面向播客的预设与一键处理(-16 LUFS / -19 LUFS)

适合非技术用户,降低参数门槛,提供明确预设。

英文 Prompt:

Build a web app focused on podcast presets: - Presets: Podcast stereo -16 LUFS, Podcast mono -19 LUFS, Audiobook -18 LUFS. - Advanced settings hidden behind a toggle (silence threshold, min silence, padding). - Before/after A/B preview (30s snippet) and a final export. Stack: - Remix or Next.js + Type - Backend FFmpeg processing Requirements: - Provide full code + file tree. - Provide exact run commands. - Include UX copy (Chinese UI ok) explaining what LUFS means and when to pick each preset. - Add at least 8 QA checks and 5 automated tests. - Add a "dry run" mode that outputs only the analysis report without modifying audio. No image generation.

中文释义: 让 AI 交付一个带“播客预设”的在线工具,提供一键 LUFS 标准化 + 静音剪切,并支持干跑输出分析报告。

静音剪切策略:删除、保留、或只做标记(生成时间轴报告)

适合编辑工作流:先做“报告/标记”,再决定是否剪切。

英文 Prompt:

Create an online tool that supports 3 silence handling modes: 1) Remove silence segments. 2) Keep silence but lower volume (ducking). 3) Keep audio unchanged and generate an edit decision list (EDL/JSON) with silence segments. Include: - A timeline UI with segments list and timestamps. - Export JSON + CSV reports. - FFmpeg commands that implement each mode. Deliver: - Full project code, file tree, commands. - Docker deployment. - Minimum 10-item QA checklist + at least 6 automated tests. No image generation instructions.

中文释义: 让 AI 输出一个“静音处理策略可选”的在线工具,既能剪切也能只生成时间轴报告,便于接入剪辑流程。

批量命名与打包:处理结果自动打包 ZIP 并附带报告

适合一次处理多集播客或多段采访录音的交付场景。

英文 Prompt:

Build a batch processing web tool: - Upload multiple files. - Configure naming template: {original}-{lufs}-{mode}.mp3 - After processing, generate a ZIP containing all outputs plus per-file JSON reports. Stack: - Node.js backend + FFmpeg - Frontend with progress per file Must include: - Full code + file tree. - Commands to run. - Storage strategy (local disk temp) with automatic cleanup. - Security checks (file type validation, size limits). - At least 5 automated tests + 10 QA checklist items. No image generation.

中文释义: 让 AI 交付一个面向“批量交付”的在线工具:批量处理、统一命名规则、最终打包 ZIP,并输出报告与测试。

导出格式:WAV/MP3/AAC 互转 + 元数据保留(标题/作者/封面不处理)

适合需要标准化输出格式与元数据的发布流程。

英文 Prompt:

Create a web app that processes audio and exports to WAV/MP3/AAC. - Preserve basic audio data tags where possible ( /artist/album). - Do NOT generate or modify cover images; ignore artwork fields if present. - Provide a clear explanation of what data is preserved per format. Include: - Full code + file tree. - Run/deploy commands. - QA checklist (at least 10 items) and automated tests (at least 5). - Edge cases: VBR MP3, different sample rates, mono/stereo conversions, clipping prevention.

中文释义: 让 AI 输出一个“格式转换 + 响度/静音处理”的在线工具,并明确元数据处理边界(不做封面图相关操作)。

可测试交付:附带 Playwright 端到端用例与可复现测试音频生成脚本

适合团队协作与持续集成,保证工具行为稳定可回归。

英文 Prompt:

Enhance the web tool by adding a robust test suite: - Provide Playwright E2E tests that cover: upload, parameter changes, job creation, progress UI, download. - Provide s to generate deterministic test audio (tones + silence) using FFmpeg so tests are reproducible. Requirements: - Full code additions with file tree updates. - CI workflow (GitHub Actions) running unit + e2e tests. - At least 8 QA checklist items. No image generation prompts.

中文释义: 让 AI 给在线工具补齐自动化测试与可复现测试音频生成脚本,便于 CI 跑回归。

用户评论 (0)

登录后参与讨论

立即登录 注册账号

暂无评论,快来抢沙发吧~

操作成功