```html Khaled Alruwita - Portfolio

Khaled Alruwita

Data Engineer & Analyst
Profile

About Me

Data Engineer and Analyst with expertise in designing end-to-end data pipelines, building scalable platforms, and automating ETL workflows. Proven track record in delivering reliable datasets through data warehousing, orchestration, and quality practices.

Certifications

• CDMP® Associate - Dama International

• Business Analysis Fundamentals - Tuwaiq Academy

Featured Projects

SITE

SITE – Data Engineering Toolkit

Internal toolkit automating SQL validation and optimization with lineage visibility and pipeline health monitoring.

Built an internal data engineering toolkit to automate SQL validation and optimization, providing clear visibility into data quality, lineage, and transformation health.

Reduced manual workload by ~80% and enabled executives and leadership to monitor data reliability and pipeline health.

Key Features:
  • Automated SQL checks and optimization analysis
  • Data lineage and impact visibility
  • Daily pipeline health monitoring
  • Used by engineers, managers, and leadership
Python SQL Automation
Cars ETL

Used Cars ETL + Price Estimator

ETL pipeline with PostgreSQL + PySpark and a LightGBM + Gradio estimator.

ETL pipeline: PostgreSQL staging → PySpark cleansing → star-schema mart. Includes a LightGBM + Gradio price estimator with optional P10–P90 range.

R² ≈ 0.85 MAE ≈ 3,140 RMSE ≈ 5,768
PySpark ML PostgreSQL
Traffic Simulation

Traffic Simulation

Campus simulation with decision-tree analysis. Award-winning project.

Built a traffic simulation at PSAU to evaluate stop time, waiting time, and throughput across multiple scenarios. Decision tree analysis supported data-driven campus traffic improvements.

Won 1st place in Student Project Exhibition.

Simulation Analytics Award
Transaction System

Transaction Parsing System

Web + iOS Shortcuts inputs to parse bank messages and store structured transactions.

Web + iOS Shortcuts inputs to parse bank messages and store structured transactions, with a page to review stored records.

iOS Automation

Technical Skills

Tools and technologies I work with

Data Engineering

  • ETL / ELT Pipelines
  • Data Warehousing
  • Dimensional Modeling
  • SQL Optimization
  • Data Lineage

Back-End

  • Python
  • MongoDB
  • Flask
  • Django
  • PostgreSQL
  • Apache Airflow

Cloud Computing

  • AWS (EC2, S3, RDS, Lambda)
  • Alibaba Cloud (ECS)
  • Oracle Cloud Infrastructure

DevOps

  • Docker
  • Nginx
  • Linux
  • Git
  • Cloud Deployment

Contact Me

alruwita.k@gmail.com
+966566200419
Riyadh, Saudi Arabia

Let's Connect