About
Seasoned Senior Data Engineer with over 7 years of experience in architecting robust data platforms, building scalable ETL pipelines, and driving data-centric solutions across banking, healthcare, and technology sectors. Expert in cloud-native data engineering using platforms such as Google Cloud Platform (GCP), Amazon Web Services (AWS), and Azure, with a proven track record of designing real-time streaming systems, automating workflows, and ensuring data integrity and security.
Proficient in a wide array of technologies including Apache Airflow, Spark, Kafka, Hadoop, Docker, and Terraform, with strong command over SQL, Python, Scala, and Java. Adept at managing big data ecosystems, implementing CI/CD pipelines, and integrating machine learning workflows using TensorFlow, Databricks, and MLflow for predictive analytics.
Demonstrated expertise in:
Real-time data streaming and orchestration using Kafka and Airflow
Cloud-based data storage and processing (GCS, BigQuery, AWS Glue, S3, EMR, Snowflake, Databricks)
ETL/ELT development, workflow automation, and infrastructure as code using Terraform and Docker
Data visualization using Power BI, Tableau, and Jupyter Notebooks
Database management with PostgreSQL, Oracle, SQL Server, and MongoDB
Implementing governance, data quality, and security frameworks for regulatory compliance
A strong proponent of Agile methodologies, with leadership experience in delivering complex, cross-functional data projects. Passionate about enabling data democratization, ensuring reliable data pipelines, and delivering actionable business insights that support strategic decision-making.