AI KNOWLEDGE HUB 2.0
DOI:
https://doi.org/10.61808/jsrt264Keywords:
AI, PyPDF2, TTSAbstract
The AI Knowledge Hub is an interactive application that enables users to access and analyze multimedia content like PDFs, YouTube videos, and images. It uses advanced AI models and natural language processing to extract relevant information from these sources. The system extracts text from PDFs using PyPDF2, retrieves video transcripts via the YouTube Transcript API, and processes images using Optical Character Recognition (OCR). The extracted text is processed using LangChain and stored in FAISS, a vector database optimized for fast similarity searches. The Gemini AI model analyzes user queries, generating context-aware responses. The application supports multilingual translation using Googletrans and a text-to-speech (TTS) feature powered by gTTS. The intuitive Streamlit-based interface allows for quick navigation. The AI Knowledge Hub is designed for real-time processing, making it useful for students, researchers, and professionals. Future enhancements include support for additional file formats, cloud storage integration, advanced video analysis, offline processing capabilities, and compatibility with voice assistants like Alexa and Google Assistant.