Amir Saleem Hello! I am

Amir Saleem

I'm a Data Scientist

Transforming raw data into actionable insights with Python, SQL & machine learning.

Islamabad, Pakistan

Resume Experience Contact Me

Skills

Tools and domains grouped by how often I use them—no invented percentages. One timeline below is anchored to real dates from my experience.

Python

8+ years in data-focused roles

From first data-science internship (2018) through present. The bar is a date span, not a skill score.

Engineering

Daily
Python Linux / Bash Git / GitHub
Occasionally
FastAPI / Django Web Scraping Bitbucket / GitLab

Data science (core)

Daily
SQL Problem Solving
Occasionally
R

Data engineering

Daily
PostgreSQL MySQL
Regularly
ETL
Occasionally
Amazon S3 Amazon Athena Amazon Redshift GraphQL

IDE & notebooks

Daily
Jupyter Cursor DBeaver
Occasionally
Colab Workbench Databricks

Data analysis

Daily
Pandas Google Sheets Pivot Tables
Regularly
Excel Data Mining Statistical Data Analysis

Machine learning

Regularly
Scikit-learn PyTorch MLOps Predictive Analytics Vision

Cloud & delivery

Regularly
CI/CD GitHub Actions
Occasionally
AWS Lambda AWS CloudWatch

Experience

Python Data Scientist

Turing
Remote (USA)

Nov 2023 - Mar 2026
(2 years, 5 months)

  • Improved model accuracy by 22% using advanced feature engineering.
  • Enhanced, evaluated, and reviewed large language model (LLM) through prompt engineering techniques, reducing LLM error rate by 35%
  • Designed and maintained high-throughput ETL pipelines and data workflows using Python and SQL, processing ~ 5M+ records daily with improved reliability.
  • Eliminated repetitive data processing tasks using Python scripting, which reduced manual effort by 40%.
  • Partnered with cross-functional teams (engineering, product, and business) to convert data insights into actionable product and strategy improvements.
  • Optimized existing data pipelines for scalability and performance, enabling faster iteration cycles for the data science team.

Data Engineer ( Freelancer )

Medusa
Remote (USA)

Jun 2023 - Feb 2024
(9 months)

  • Built and managed ETL processes on AWS using Redshift, Athena, and MySQL within containerized AWS Lambda functions running on Docker.
  • Integrated with existing CI/CD pipelines via Bitbucket to enable automated, reliable code deployment workflows.
  • Deployed Docker images to AWS infrastructure and monitored system performance and health using AWS CloudWatch dashboards and alerts.
  • Implemented reverse ETL pipelines for Monday.com using GraphQL API integration, enabling seamless bi-directional data synchronization.

Data Scientist

LoveForData
On-site (Karachi - Pakistan)

Jan 2023 - Nov 2023
(11 months)

  • Designed and implemented End-to-End ETL workflows to extract, transform, and load data from diverse sources into a centralized data warehouse.
  • Performed exploratory data analysis (EDA) using Python and SQL, identifying key business patterns and trends that guided strategic decisions.
  • Built and deployed predictive machine learning models using scikit-learn and PyTorch, improved recall from 55% to 73% while maintaining constant precision for a default prediction model

Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Jan 2021 - Dec 2022
( 2 years )

  • Developed Django-based web applications for interactive data visualization and self-service analytics, increasing stakeholder data accessibility by 50%
  • Built and deployed a Python-based ETL pipeline on Azure Functions, enabling serverless, scalable data processing on the cloud.
  • Applied market basket analysis (association rules) to a restaurant client's transaction data, generating actionable insights that improved menu design, marketing strategy, and customer experience.
  • Developed multiple production-grade web scraping solutions in Python to automate data collection from external websites.
  • Shipped a forecasting model, supporting data-driven planning and operational decisions.

Junior Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Apr 2019 - Dec 2020
( 1 year, 9 months )

  • Performed comprehensive data audits to identify and correct data quality issues, reducing data entry errors by 25% across 5 client databases.
  • Built an interactive, real-time dashboard using R-Shiny for data visualization and exploratory analysis, enabling stakeholder self-service reporting.
  • Developed data workflows to extract, transform, and load data from multiple heterogeneous sources into a structured data warehouse.

Data Science Trainee

LoveForData
On-site (Karachi - Pakistan)

Oct 2018 - Mar 2019
( 6 months )

  • Automated data extraction from 20+ external websites daily.
  • Developed an NLP pipeline supporting sentiment analysis, text summarization, and question answering using Python-based NLP libraries.

Data Science Intern

LoveForData
On-site (Karachi - Pakistan)

Aug 2018 - Sep 2018
( 2 months )

  • Collected, cleaned, and structured datasets.

Certifications

Fundamentals of Visualization with Tableau

Fundamentals of Visualization with Tableau

Essential Design Principles for Tableau

Essential Design Principles for Tableau

Visual Analytics with Tableau

Visual Analytics with Tableau

Introduction to Big Data with Spark and Hadoop

Introduction to Big Data with Spark and Hadoop

Python for Data Science, AI & Development

Python for Data Science, AI & Development

Python Project for Data Engineering

Python Project for Data Engineering

Databases and SQL for Data Science with Python

Databases and SQL for Data Science with Python

Hands-on Introduction to Linux Commands and Shell Scripting

Hands-on Introduction to Linux Commands and Shell Scripting

Relational Database Administration (DBA)

Relational Database Administration (DBA)

ETL and Data Pipelines with Shell, Airflow and Kafka

ETL and Data Pipelines with Shell, Airflow and Kafka

Getting Started with Data Warehousing and BI Analytics

Getting Started with Data Warehousing and BI Analytics

Introduction to NoSQL Databases

Introduction to NoSQL Databases

Introduction to Statistics

Introduction to Statistics

How Google does Machine Learning

How Google does Machine Learning

Google Cloud Big Data and Machine Learning Fundamentals

Google Cloud Big Data and Machine Learning Fundamentals

Introduction to Data Engineering

Introduction to Data Engineering

Introduction to Relational Databases (RDBMS)

Introduction to Relational Databases (RDBMS)

Google Data Analytics (Professional Certificate)

Google Data Analytics (Professional Certificate)

Foundations: Data, Data, Everywhere

Foundations: Data, Data, Everywhere

Share Data Through the Art of Visualization

Share Data Through the Art of Visualization

Ask Questions to Make Data-Driven Decisions

Ask Questions to Make Data-Driven Decisions

Prepare Data for Exploration

Prepare Data for Exploration

Data Analysis with R Programming

Data Analysis with R Programming

Google Data Analytics Capstone: Complete a Case Study

Google Data Analytics Capstone: Complete a Case Study

Process Data from Dirty to Clean

Process Data from Dirty to Clean

Python Project: pillow, tesseract, and opencv

Python Project: pillow, tesseract, and opencv

Python 3 Programming (Specialization)

Python 3 Programming (Specialization)

Introduction to Probability and Data with R

Introduction to Probability and Data with R

Using Python to Access Web Data

Using Python to Access Web Data

Using Databases with Python

Using Databases with Python

Capstone: Retrieving, Processing, and Visualizing Data with Python

Capstone: Retrieving, Processing, and Visualizing Data with Python

Python

Python

Data Collection and Processing with Python

Data Collection and Processing with Python

Python Functions, Files, and Dictionaries

Python Functions, Files, and Dictionaries

Python Classes and Inheritance

Python Classes and Inheritance

Python for Everybody (Specialization)

Python for Everybody (Specialization)

Programming for Everybody (Getting Started with Python)

Programming for Everybody (Getting Started with Python)

Python Data Structures

Python Data Structures

Linux for Developers

Linux for Developers

Linux Server Management and Security

Linux Server Management and Security

Analyzing Police Activity with pandas

Analyzing Police Activity with pandas

Introduction to Applied Machine Learning

Introduction to Applied Machine Learning

Machine Learning Algorithms: Supervised Learning Tip to Tail

Machine Learning Algorithms: Supervised Learning Tip to Tail

Data for Machine Learning

Data for Machine Learning

R Programming

R Programming

Exploratory Data Analysis

Exploratory Data Analysis

Getting and Cleaning Data

Getting and Cleaning Data

The Data Scientist's Toolbox

The Data Scientist's Toolbox

Excel Skills for Business: Essentials

Excel Skills for Business: Essentials

Excel Skills for Business: Intermediate I

Excel Skills for Business: Intermediate I

Introduction to Data Science in Python

Introduction to Data Science in Python

Learn to Program: The Fundamentals

Learn to Program: The Fundamentals

Skill assessments

SQL (Basic)

SQL (Basic)

SQL (Intermediate)

SQL (Intermediate)

SQL (Advanced)

SQL (Advanced)

Python (Basic)

Python (Basic)

Software Engineer

Software Engineer

Software Engineer

Problem Solving (Basic)

Projects

Let's Connect

Location

Islamabad, Pakistan