Welcome to the LLM Toolbox Suite, a powerful and versatile set of tools designed to harness the capabilities of large language models for various productive tasks. This suite is built using Streamlit and integrates APIs from OpenAI, AssemblyAI, and NVIDIA.
LLM Toolbox Suite is designed to provide users with an interactive and user-friendly interface to chat with AI, chat with multiple Documents, chat with WebSearch, effortlessly transcribe, analyze, and chat with multi-speaker audio/video conversations amd even generate Meeting of Minutes with main themes!
The suite includes:
-
ChatBuddy : An interactive chatbot powered by OpenAI's GPT and Streamlit.
-
RAG DocAI Q&A: Harness the power of Retrieval-Augmented Generation to answer questions from your documents with AI precision and efficiency.
-
Chat with Search: Enhance your conversations with integrated search capabilities, providing instant answers and information from the web.
-
AudioVideo Transcriber: Effortlessly transcribe, analyze, and chat with multi-speaker audio/video conversations.
-
YouTube Transcriber: Easily convert YouTube videos into text with detailed transcripts for better comprehension and analysis.
-
MoM Generator: Transform your meeting recordings into detailed, categorized summaries and downloadable transcripts effortlessly.
- Interactive and Responsive Design: Enjoy a user-friendly and visually appealing interface.
- API Integration: Seamlessly integrates with OpenAI, AssemblyAI, and NVIDIA APIs.
- Chat Interface: Engage in conversations with an AI assistant for seamless interaction.
- Retrieval-Augmented Generation (RAG): Leverage advanced AI to provide precise answers to questions from documents using retrieval-augmented generation techniques.
- Integrated Search Capabilities: Enhance conversations with instant access to web-based information and answers.
- Audio/Video Transcription: Effortlessly transcribe and analyze multi-speaker audio and video conversations.
- YouTube Video Transcription: Convert YouTube videos into detailed text transcripts for improved comprehension and analysis.
- Meeting Summary Generation: Automatically generate detailed, categorized summaries and downloadable transcripts from meeting audio recordings.
- PDF Export: Download your conversation as a PDF file.
Ensure you have the following installed:
- Python 3.8 or higher
- Streamlit
- OpenAI API Key
- AssemblyAI API Key
- NVIDIA API Key
-
Clone the repository:
git clone https://github.com/SaurabhBadole/llm-toolbox-suite.git cd llm-toolbox-suite
-
Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows use `venv\Scripts\activate`
-
Install the required dependencies:
pip install -r requirements.txt
-
Set up your environment variables by creating a
.env
file in the root directory and adding your API keys:ASSEMBLYAI_API_KEY=your_assemblyai_api_key HF_API_TOKEN=your_huggingface_api_token NVIDIA_API_KEY=your_nvidia_api_key OPENAI_API_KEY=your_openai_api_key
Run the main file to start the application:
streamlit run Chatbuddy.py
Here's an overview of the project's structure:
llm-toolbox-suite/
│
├──pages/
| └──1_RAG_DocAI_Q&A.py # application file for having conversation with an AI bot while can also download the conversation as PDF
| └──2_Chat_with_search.py # application file for having conversations with WebSearch
| └──3_AudioVideo_Transcriber.py # application file for transcribing, analyzing, and chatting with multi-speaker audio/video conversations
| └──4_YouTube_Transcriber.py # application file for convert YouTube videos into text with detailed transcripts
| └──5_MoM_Generator.py # application file for Meeting of Minutes generator with Main Themes and time stamps
├── .env # Environment variables
├── Chatbuddy.py # Main application file starting with main page ChatBuddy
├── htmltemplates.py # HTML templates for RAG DocAI Q&A Streamlit Interface
├── requirements.txt # Python dependencies
├── README.md # Project documentation
├── utils.py # Utility functions
├── chat_history/ # Directory for saving chat history for ChatBuddy
├── DocAI_history/ # Directory for saving DocAI RAG conversations
-
pages/RAG_DocAI_Q&A.py:
- This file provides functionality for users to interact with an AI bot, allowing them to ask questions and receive detailed responses. Additionally, it includes an option to download the entire conversation as a PDF for record-keeping or review purposes.
-
pages/Chat_with_search.py:
- This file enables users to engage in conversations with an AI that has integrated web search capabilities. The AI can fetch and provide real-time information from the web, enhancing the quality and relevance of the responses.
-
pages/AudioVideo_Transcriber.py:
- This file is designed to handle audio and video files, transcribing the spoken content into text. It can analyze multi-speaker conversations, providing insights and the ability to chat about the content. It's ideal for reviewing and interacting with recorded meetings or interviews.
-
pages/YouTube_Transcriber.py:
- This file allows users to convert YouTube videos into text. It generates detailed transcripts, making it easier to analyze and extract information from video content. This can be useful for content creators, researchers, and anyone needing text versions of video material.
-
pages/MoM_Generator.py:
- This file is aimed at generating Minutes of Meetings (MoM). It extracts the main themes and provides timestamps, making it a valuable tool for summarizing meetings and ensuring that important points are documented and easily accessible.
-
.env: Stores API keys and other environment variables.
-
Chatbuddy.py: The main Streamlit application file. Contains the core logic for the chat interface and PDF download functionality.
-
htmltemplates.py: Defines the HTML and CSS templates used for styling the chat interface.
-
requirements.txt: Lists all the required Python packages.
-
utils.py: Contains utility functions for time conversion, file reading, and chat session management.
-
chat_history/: Directory for saving chat history files.
-
DocAI_history/: Directory for saving DocAI RAG conversations
- Enter API Keys: Start the application and enter your API keys for tools wherever the api key is asked in the sidebar.
- Chat with multiple tools: Begin chatting with the AI assistant by typing your message and then download conversations in pdf format.
- Download Conversation: After your session, download the chat history in PDF format.
- Download Transcripts: After transcribing the YT videos, you can downlownload it in txt and SRT format.
- Clear Chat: Use the 'Clear Chat' button to start a new session.
For any inquiries, please contact Saurabh Khushal Badole.