FIFA World Cup 2022 Data analysis
تفاصيل العمل
This project focuses on developing a comprehensive data pipeline to process and analyze data from the FIFA World Cup 2022. The pipeline extracts data from various sources, transforms it, and loads it into a centralized data warehouse. This enables detailed analysis and valuable insights into the performance of teams and players throughout the tournament. The project uses Python to extract data from multiple sources such as: CSV files, operational database, and data scraped from the web. After extraction, the Python scripts perform transformations to clean and format data (e.g., handling missing values, unifying data types), integrate data from multiple sources if needed (e.g., joining tables from CSV and OLTP database), ensure compatibility with the data warehouse schema. The transformed data is loaded into SQL Server, where it’s structured into fact and dimension tables to support analytical queries
مهارات العمل
بطاقة العمل
طلب عمل مماثل