Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Generic

Albin Jose

Philadelphia,PA

Summary

Dynamic Azure-focused Data Engineer with over 3 years of experience designing and optimizing scalable data solutions within cloud environments. Proven expertise in constructing enterprise-grade data pipelines using Azure Data Factory, Databricks (PySpark), and Azure Synapse, with a strong emphasis on Lakehouse and Medallion Architecture. Skilled in integrating on-premises Oracle and CRM data into cloud-based platforms to ensure seamless data flow and accessibility. Collaborative team player adept at driving continuous integration and deployment (CI/CD) practices through Azure DevOps, supporting comprehensive end-to-end analytics initiatives.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Exelon
01.2025 - Current
  • Led a strategic enterprise data migration from legacy on-premise systems (Oracle, CRM platforms) to Azure Cloud, delivering a seamless transition with zero data loss and minimal business disruption.
  • Designed and implemented highly scalable ETL/ELT pipelines leveraging Azure Data Factory, Azure Databricks, and PySpark to enable efficient processing and transformation of large-scale datasets.
  • Developed robust end-to-end data ingestion frameworks to extract, cleanse, and integrate data into Azure Data Lake Gen2 and Azure SQL Database for analytics and reporting.
  • Worked on Medallion Architecture to enhance data organization, accelerate query performance, and optimize storage in Azure Synapse Analytics.
  • Supported CI/CD pipelines in Azure DevOps to streamline deployments, enforce version control, and improve cross-functional collaboration
  • Monitored and optimized data pipeline performance to ensure scalability, high availability, and fault tolerance throughout migration and ongoing operations.

Data Engineer

Veranova
04.2022 - 09.2024
  • Designed and deployed high-performance ETL pipelines using Azure Data Factory and SQL to transform operational and CRM data into analytical models powering executive dashboards and KPI reporting in Azure.
  • Integrated and standardized data from on-premise Oracle systems, third-party CRMs, and external APIs into a centralized Azure Data Lake, enabling consistent KPI definitions and calculations.
  • Collaborated with business leaders and analysts to define KPIs, establish calculation logic, and ensure alignment between data pipelines and strategic reporting objectives.
  • Performed comprehensive data profiling, validation, and schema standardization to guarantee accuracy, completeness, and consistency of KPI inputs.
  • Implemented strong data governance practices, including metadata management and data lineage tracking, to ensure transparency, auditability, and trust in KPI-driven insights.

Education

MS - Data Science

Eastern University
11.2024

Bachelor’s - Data Analytics

Amity University
10.2020

Skills

  • Languages & Frameworks: Python, SQL, PySpark, Spark, R, Git
  • Cloud & Big Data: Azure Data Factory, Azure Data Lake Gen2, Azure Databricks, Synapse Analytics, Microsoft Fabric
  • Databases: SQL Server, MySQL, Oracle, PostgreSQL
  • Data Analytics & ML: Pandas, NumPy, Scikit-learn, TensorFlow, Keras
  • Visualization & Reporting: Power BI, Tableau, Spotfire
  • Tools & Platforms: MySQL, Excel, Jupyter Notebook, RStudio
  • Core Strengths: Data Pipeline Design, Data Lake Architecture, Data Governance, EDA, Machine Learning, AI Integration, Problem-Solving

Certification

AZ-900: Microsoft Azure Fundamentals Certification

Languages

English
Full Professional
Hindi
Professional Working

Timeline

Data Engineer

Exelon
01.2025 - Current

Data Engineer

Veranova
04.2022 - 09.2024

Bachelor’s - Data Analytics

Amity University

MS - Data Science

Eastern University
Albin Jose