Skip to content

codeperfectplus/Speak2Summary

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speak2Summary

Speak2Summary is a Flask-based web application that leverages Large Language Models (LLMs) to transcribe and summarize meeting audio files. It provides an intuitive interface for users to upload audio recordings and receive concise summaries, facilitating efficient meeting reviews and note-taking.

Features

  • 🎙️ Audio Transcription: Convert spoken content from meetings into text using advanced speech-to-text capabilities.
  • 🧠 Summarization with LLMs: Generate concise summaries of transcribed text utilizing powerful language models.
  • Mind Map Generation: Create visual representations of meeting summaries to enhance understanding and retention.
  • 🖥️ User-Friendly Interface: Interact with a clean and responsive web UI built with Flask.
  • 🐳 Dockerized Deployment: Easily deploy the application using Docker and Docker Compose for a consistent environment setup.

Prerequisites

Installation and Usage

  1. Clone the Repository

    git clone https://github.com/codeperfectplus/Speak2Summary.git
  2. Navigate to the Project Directory

    cd Speak2Summary
  3. Set Environment Variables

    Export your API keys as environment variables:

    export GROQ_API_KEY=your_groq_api_key
    export OPENAI_API_KEY=your_openai_api_key

Replace your_groq_api_key and your_openai_api_key with your actual API keys.

  1. Build and Run the Docker Container

    Use Docker Compose to build and start the application:

    docker-compose up --build -d
  2. Access the Application

    Open your web browser and navigate to:

    http://localhost:5000
    

Screenshots

Screenshot from 2025-05-29 08-12-24 Screenshot from 2025-05-29 08-12-59 Screenshot from 2025-05-29 08-14-11 Screenshot from 2025-05-29 08-16-20

License

This project is licensed under the MIT License.

Acknowledgments

  • Flask - Web framework used for the application.
  • OpenAI - Provider of the language models used for summarization and transcription.
  • GROQ - API used for audio transcription and Summarization.

Contributing

We welcome contributions to Speak2Summary! If you have suggestions for improvements or new features, please open an issue or submit a pull request.