Work Description
The Smart Document Search Engine is a web-based application designed to search and retrieve information from multiple PDF documents efficiently. The system supports both Arabic and English languages, enabling users to quickly find relevant content within large document collections.
The platform uses the TF-IDF (Term Frequency–Inverse Document Frequency) algorithm to analyze and rank search results based on their relevance to the user’s query. This ensures that the most meaningful and related results appear first.
Users can search across multiple PDF files simultaneously through a simple and user-friendly web interface built with Flask. The system also includes advanced text processing features such as removing common words and normalizing text to improve search accuracy.
Overall, the project provides a practical solution for quickly locating information inside documents, making it useful for students, researchers, and professionals who frequently work with large PDF files. 🚀