Skip to content

Latest commit

 

History

History
19 lines (15 loc) · 938 Bytes

TODO.md

File metadata and controls

19 lines (15 loc) · 938 Bytes

TODO List

Core Features

  • Implement multi-threaded transcription for processing multiple audio files simultaneously.
  • Add language support for additional languages supported by OpenAI's Whisper API.
  • Enhance error handling to include retries in case of API request failures.

Performance Optimization

  • Optimize logging by adding rotation or cleanup options for log files.
  • Improve file size checking and format validation for audio files before transcription.

CLI Improvements

  • Add verbose mode to show detailed progress for transcription steps.
  • Implement a progress bar to indicate transcription status in real-time.
  • Allow specifying output format (e.g., JSON, text, plain) for transcription results.

Documentation

  • Add examples of supported audio file types in the README.
  • Include benchmarks or performance metrics for processing times with different file sizes.