AI audio and video summarization supports summarizing the subtitle content of audio and video from YouTube, Bilibili, Douyin, Xiaohongshu as well as links to mp3 and mp4 files on the Internet, and generating mind maps. It can also translate the subtitles, download them in SRT or TXT format, and use the video content as context to have conversations with AI large models, enabling users to quickly understand the video content.
Open-source version of the AI Audio and Video Summary from 302.AI. You can directly log in to 302.AI for a zero-code, zero-configuration online experience. Alternatively, customize this project to suit your needs, integrate 302.AI's API KEY, and deploy it yourself.
Based on the uploaded audio and video links or files, generate brief summaries, which include abstracts and mind maps.
Generate detailed summaries, which include outlines and mind maps.
You can conduct question-and-answer sessions with AI to learn more information related to the audio and video.
It can be easily completed by uploading videos.
It supports videos from multiple platforms: YouTube, TikTok, Bilibili, Douyin, MP4, etc.
Subtitle translation supports Chinese, English and Japanese.
Various subtitle formats can be downloaded: VTT, SRT and TXT formats are supported.
It provides a brief summary service to quickly extract the key points of videos.
It provides a detailed summary service to deeply analyze the content of videos.
iInteract with AI, which will intelligently answer questions related to videos.
Switch as you like for more comfortable eye protection.
You can share the summary and share wonderful content with friends.
- Chinese Interface
- English Interface
- Japanese Interface
With AI Audio and Video Summary, anyone can efficiently obtain video information! ππ₯ Let's explore the new world of AI-driven information acquisition together! ππ
- Expand the compatibility with other audio and video formats
- Provide personalized customization options. For example, users can choose the level of detail for the summary (brief summary or detailed analysis), the style of the mind map (logic diagram, fishbone diagram, etc.), and the language style of the translation (formal, colloquial, etc.) according to their own needs, so that the generated results better meet the usage preferences and specific application scenarios of different users
- Next.js 14
- Tailwind CSS
- Shadcn UI
- markmap
- Vercel AI SDK
- Clone the project
git clone https://github.com/302ai/302_video_summary
- Install dependencies
pnpm install
- Configure the 302 API KEY as per .env.example
- Run the project
pnpm dev
- Build and deploy
docker build -t video-summary . && docker run -p 3000:3000 video-summary
302.AI is an enterprise-oriented AI application platform that offers pay-as-you-go services, ready-to-use solutions, and an open-source ecosystem.β¨
- π§ Comprehensive AI capabilities: Incorporates the latest in language, image, audio, and video models from leading AI brands.
- π Advanced application development: We build genuine AI products, not just simple chatbots.
- π° No monthly fees: All features are pay-per-use, fully accessible, ensuring low entry barriers with high potential.
- π Powerful admin dashboard: Designed for teams and SMEs - managed by one, used by many.
- π API access for all AI features: All tools are open-source and customizable (in progress).
- π‘ Powerful development team: Launching 2-3 new applications weekly with daily product updates. Interested developers are welcome to contact us.