Yasir Khalid

Assistant Manager (Data Engineering/Analytics)

About Me


I am an Assistant manager at KPMG UK, and implement data-driven solutions for Tier-1 banks using Spark, Elasticsearch, Microservices and AWS. Over the past 4 years I have specialised towards data engineering, analytical workloads, backend and entity resolution frameworks (Quantexa).


"Yasir's technical and problem-solving prowess combined with his ability to convey ideas and communicate effectively make him a superb Change-professional and will give him an extremely high ceiling for his career progress."

Sam Heard
(Project Manager @ NatWest)

Experience

Placeholder image

KPMG UK

Assistant Manager (Data Engineering) - Jul 2022 - Present

Helping Tier-1 banking clients with large scale data ingestion, entity resolution and risk monitoring solutions using Quantexa platform.
Coordinated design and implementation for ETL jobs, processing TBs of structured/semi-structured data using Scala, Apache Spark and Elasticsearch, and Apache Airflow

Quantexa AWS Microservices Apache Spark Docker Kubernetes Elasticsearch Teamcity GitLab Google Cloud (GCP) Linux Release Management Apache Airflow Databricks
Placeholder image

Ipsos UK

Data Scientist - Feb 2022 - Jun 2022

Developed ETL jobs in Python involving cloud data lakes (GCS) and warehouse (BigQuery), while collaborating with a team of 15+ developers
Responsible for automating reports in Google Data Studio using SQL scripts in Google BigQuery

Python Google BigQuery ETL Solutions Delievery GCP Data Ingestion Docker

Projects

Sport Scanner

Just like SkyScanner, this service helps users to find available sports bookings across London at a unified interface without having to check different organisation's sites. The service is available at www.sportscanner.co.uk

  • Backend crawlers developed in Python using Python, Async and Postgres and deployed as cron jobs
  • CI pipeline setup using GitHub Actions that runs unit tests, builds images and stores in GitHub container registry
  • Website uptime status monitoring integrated via ping requests available at status.sportscanner.co.uk
Python Docker Continuous Integration (CI) GitHub Actions Asynchronous Postgres Streamlit Cron

Recommendations

"He was always the one who identified problems from afar and was always prepared for them. The one thing I can guarantee about Yasir, is that he Delivers, every single time"

Syeda Khadija Gardezi
(Programme Director @ Hult Prize on-campus)

"I have worked with Yasir on data engineering projects and can say he is a great team member to work with. His technical capabilities do really stand out."

Sarg Senthil
(Metaverse Lead @ Kagool)

"..He is always motivated, pays attention to the details, deep dives into problems and provides robust solutions.."

Alexandros Nezeris
(Data Science Manager @ Holland & Barrett)

"He is such a hard working individual who went above and beyond on all tasks that were assigned to him"

Hanaa Lakhani
(Co-Founder @ Roshni Rides)

"I was able to give him complex tasks which he was able to work independently on and present the findings in a clear, concise and timely manner"

Tim Venison
(Head of Product @ ERM)