A Windows desktop application that can generate subtitles, translations, and summaries for videos in 8 languages using API and SDK from Tencent, Alibaba, and Baidu. You can use it for generating bilingual transcripts for videos and summarising the key points from the transcript using the PageRank algorithm: LexRank.
Just download the exe file, and run it on your windows computer.
Supports video and audio files in common formats, including:
- For videos, we support: .mp4 , .mpeg , .mkv , .wmv , .avi , .m4v , .mov , .flv , .rmvb , .3gp , .f4v .
- For audio: .mp3 , .wav , .aac , .wma , .flac , .m4a formats.
It can generate 3 types Support subtitle files including: SRT file, ordinary text, lecture summary simultaneously or seperately according to user's needs. And with bilingual translation between 10 languages including Chinese, English, Japanese, Korean, French, German, Spanish, Russian, Italian, and Thai.
- Baidu and tencent Translation SDK
- go tldr for auto-summary
- aliyun-cloud-sdk-go,aliyun-oss-go-sdk,tencentcloud-sdk-go for storing the audio files and generated transcripts
- the intelligent voice interactive service on Aliyun
- Firstly, upload the video or audio on your computer
- Secondly, submit it. The software will start to generate transcripts for you
This system uses the go walk library for development, and main.go contains the code and main logic of the main interface of the program. It calls the relevant code files in the app package to perform corresponding operations. Interface logic code, data object separation. Basically similar to the MVC pattern. It is divided into presentation layer, business logic layer and data access layer. The presentation layer is used to interact with the user, and then calls the functions of the corresponding modules in the app package to perform business operations, and the corresponding modules of the business operations then call the data layer functions to operate on the data.