الصورة الشخصية

Mohamed Elmesawy

Machine Learning Eng   مصر

نبذة عني

لم يكتب نبذة شخصيةExperienced Machine/Deep Learning engineer skilled in Computer Vision, NLP, and Reinforcement Learning. Expertise includes Generative Models (LLMs, GANs, Diffusion Models) and Graph Neural Networks. I have Master in Generative AI and I am a certified NVIDIA Ambassador in Generative AI. A Researcher and Lecturer, blending industry insights with academic excellence. Passionate about pushing ML boundaries and fostering the next generation of innovators. Lead AI team to develop cutting-edge Machine Learning solutions in Computer Vision, Natural Language Processing, and Speech Recognition. My work includes creating advanced chatbots using LLMs, implementing document understanding systems, and executing innovative computer vision projects. # Specifically, I have extensive experience with the following technologies : • Python • PyTorch • TensorFlow/Keras • OpenCV • C# • Java • JavaScript • MLOPS # Sample of the projects I worked on so far: • Generative Chat-Bot: Use generative LLM models besides the intent classification. • Fine-tuning LLM: Fine-tune LLM on custom dataset using LoRA technique. • Face Recognition: Detect faces then classify based on custom faces database . • Document Understanding: Extract information from documents like invoices, and contracts. • OCR: OCR documents in multi-lingual. • Image Segmentation: Image segmentation service. • Recommender System: Suggest documents based on document similarity. • Textual RAG: Retrieval Augmented Generation system on customer documents. • Multi-modal RAG: Retrieval Augmented Generation system on different types of media: images, audio, and text. • Speech to Text - STT: Speech to text service using Kaldi and open source models. • Text to Speech - TTS: Text to speech service. • Speaker Diarization: Automatically identifying and segmenting an audio recording into distinct speech segments.

احصائيات
التقييمات
انشاء الحساب منذ أسبوعين
آخر تواجد منذ أسبوع
مركز المساعدة