Hello! I am

Amir Saleem

I'm a Data Scientist

Transforming raw data into actionable insights with Python, SQL & machine learning.

Islamabad, Pakistan

View Portfolio Experience Contact Me

Experience

Python Data Scientist

Turing
Remote (USA)

Nov 2023 - Mar 2026
(2 years, 5 months)

  • LLM Development & Data Collaboration: Gathered and analyzed data, formulated complex questions across EDA, statistics, ML, logic, and open-ended domains, and developed code solutions. Collaborated with the LLM to enhance performance by identifying errors and providing feedback, and worked with colleagues to validate results and refine model training.
  • AI Evaluation & SWE-Bench: Evaluated LLM-generated code using rigorous rubrics and engineered "bug-planting" scenarios in production-grade codebases to benchmark debugging capabilities.
  • Data Synthesis & Visualization Collaborated with a team to generate a dataset of 100,000+ plots, enhancing LLM proficiency in producing accurate data visualizations.
  • Distributed Collaboration: Contributed to high-impact projects within large-scale, cross-functional remote teams.

Data Engineer ( Freelancer )

Medusa
Remote (USA)

Jun 2023 - Feb 2024
(9 months)

  • Implemented ETL processes using Redshift, Athena, and MySQL on AWS Lambda functions within Docker containers.
  • Incorporated existing CI/CD pipelines via Bitbucket to enable smooth code deployment processes.
  • Executed code pushes to trigger Bitbucket pipelines for automated builds.
  • Managed Docker image deployments to AWS and monitored performance via CloudWatch.
  • Implemented reverse ETL processes for Monday.com utilizing GraphQL with their API.

Data Scientist

LoveForData
On-site (Karachi - Pakistan)

Jan 2023 - Nov 2023
(11 months)

  • Conducting data analysis, data mining, and modeling to extract insights and support decision-making processes.
  • Collaborating with cross-functional teams to define project objectives and deliver actionable recommendations.
  • Designing and deploying data pipelines and workflows to process and analyze datasets.

Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Jan 2021 - Dec 2022
( 2 years )

  • Gathered, cleansed, and analyzed data from various sources to identify patterns and trends.
  • Conducted statistical analysis and hypothesis testing to derive meaningful conclusions.
  • Collaborated with team members to develop data-driven strategies and recommendations.
  • Assisted in the development and maintenance of data analytics tools and infrastructure.

Junior Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Apr 2019 - Dec 2020
( 1 year, 9 months )

  • Assisted in data collection, cleaning, and validation processes.
  • Conducted data analysis and generated descriptive reports.
  • Supported senior analysts in data visualization and report creation.
  • Collaborated with team members to enhance data quality and accuracy.
  • Assisted in the implementation and maintenance of data management systems.

Data Science Trainee

LoveForData
On-site (Karachi - Pakistan)

Oct 2018 - Mar 2019
( 6 months )

  • Participated in hands-on training in data analysis, machine learning, and data visualization.
  • Assisted in data preprocessing and feature engineering tasks.
  • Contributed to exploratory data analysis and insights generation.
  • Learned and applied best practices in data science and analytics.

Data Science Intern

LoveForData
On-site (Karachi - Pakistan)

Aug 2018 - Sep 2018
( 2 months )

  • Assisted in data collection, cleaning, and organization.
  • Gained exposure to various data analysis tools and techniques.

Certifications

SQL

hackerrank
Issued: June 2023

Verify

SQL

hackerrank
Issued: June 2023

Verify

Fundamentals of Visualization with Tableau

University of California, Davis
Issued: May 2023

Verify

Essential Design Principles for Tableau

University of California, Davis
Issued: May 2023

Verify

Visual Analytics with Tableau

University of California, Davis
Issued: May 2023

Verify

Python

Hackerrank
Issued: Feb 2023

Verify

Introduction to Big Data with Spark and Hadoop

IBM Skills Network
Issued: Dec 2022

Verify

Python for Data Science, AI & Development

IBM Skills Network
Issued: Nov 2022

Verify

Python Project for Data Engineering

IBM Skills Network
Issued: Nov 2022

Verify

Databases and SQL for Data Science with Python

IBM Skills Network
Issued: Nov 2022

Verify

Hands-on Introduction to Linux Commands and Shell Scripting

IBM Skills Network
Issued: Nov 2022

Verify

Relational Database Administration (DBA)

IBM Skills Network
Issued: Nov 2022

Verify

ETL and Data Pipelines with Shell, Airflow and Kafka

IBM Skills Network
Issued: Nov 2022

Verify

Getting Started with Data Warehousing and BI Analytics

IBM Skills Network
Issued: Nov 2022

Verify

Introduction to NoSQL Databases

IBM Skills Network
Issued: Nov 2022

Verify

Introduction to Statistics

Stanfor University
Issued: Oct 2022

Verify

How Google does Machine Learning

Google Cloud
Issued: Oct 2022

Verify

Google Cloud Big Data and Machine Learning Fundamentals

Google Cloud
Issued: Oct 2022

Verify

Introduction to Data Engineering

IBM Skills Network
Issued: Oct 2022

Verify

Introduction to Relational Databases (RDBMS)

IBM Skills Network
Issued: Oct 2022

Verify

Google Data Analytics (Professional Certificate)

Google
Issued: Sep 2022

Verify

Foundations: Data, Data, Everywhere

Google
Issued: Aug 2022

Verify

Share Data Through the Art of Visualization

Google
Issued: Aug 2022

Verify

Ask Questions to Make Data-Driven Decisions

Google
Issued: Aug 2022

Verify

Prepare Data for Exploration

Google
Issued: Aug 2022

Verify

Data Analysis with R Programming

Google
Issued: Aug 2022

Verify

Google Data Analytics Capstone: Complete a Case Study

Google
Issued: Aug 2022

Verify

Process Data from Dirty to Clean

Google
Issued: Aug 2022

Verify

Python Project: pillow, tesseract, and opencv

University of Michigan
Issued: Oct 2021

Verify

Python 3 Programming (Specialization)

University of Michigan
Issued: Oct 2021

Verify

Introduction to Probability and Data with R

Duke University
Issued: Oct 2021

Verify

Using Python to Access Web Data

University of Michigan
Issued: Sep 2021

Verify

Using Databases with Python

University of Michigan
Issued: Sep 2021

Verify

Capstone: Retrieving, Processing, and Visualizing Data with Python

University of Michigan
Issued: Sep 2021

Verify

Python

University of Michigan
Issued: Sep 2021

Verify

Data Collection and Processing with Python

University of Michigan
Issued: Sep 2021

Verify

Python Functions, Files, and Dictionaries

University of Michigan
Issued: Sep 2021

Verify

Python Classes and Inheritance

University of Michigan
Issued: Sep 2021

Verify

Python for Everybody (Specialization)

University of Michigan
Issued: Sep 2021

Verify

Programming for Everybody (Getting Started with Python)

University of Michigan
Issued: Aug 2021

Verify

Python Data Structures

University of Michigan
Issued: Aug 2021

Verify

Linux for Developers

The Linux Foundation
Issued: Nov 2020

Verify

Linux Server Management and Security

University of Colorado System
Issued: Oct 2020

Verify

Analyzing Police Activity with pandas

datacamp
Issued: Sep 2020

Verify

Introduction to Applied Machine Learning

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

Machine Learning Algorithms: Supervised Learning Tip to Tail

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

Data for Machine Learning

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

R Programming

John Hopkins University
Issued: Aug 2019

Verify

Exploratory Data Analysis

John Hopkins University
Issued: Aug 2019

Verify

Getting and Cleaning Data

John Hopkins University
Issued: Jul 2019

Verify

The Data Scientist’s Toolbox

John Hopkins University
Issued: Jun 2019

Verify

Excel Skills for Business: Essentials

Macquarie University
Issued: Aug 2018

Verify

Excel Skills for Business: Intermediate I

Macquarie University
Issued: Aug 2018

Verify

Introduction to Data Science in Python

University of Michigan
Issued: Apr 2018

Verify

Learn to Program: The Fundamentals

University of Toronto
Issued: Dec 2017

Verify

Skills

  • Data Science: Experience in data analysis, data mining, modeling, and deploying data pipelines.
  • Machine Learning: Developing predictive models and implementing machine learning algorithms.
  • LLM Development & Collaboration: Formulating complex questions (EDA, statistics, ML, logic), developing code solutions with LLMs, and collaborating to improve model performance through feedback and validation.
  • AI / LLM Evaluation: Evaluating LLM-generated code with rubrics, designing bug-planting scenarios in production codebases, and benchmarking debugging capabilities (e.g. SWE-Bench).
  • Data Synthesis for AI: Generating and curating large-scale datasets (e.g. 100k+ visualizations) to enhance LLM proficiency in data visualization and analysis.
  • Data Analysis: Gathering, cleaning, and analyzing data to extract insights and support decision-making.
  • Python Programming: Proficient in Python for data analysis, scripting, and automation tasks.
  • R Programming: Good in R for data analysis, and ML modeling.
  • SQL: Skilled in querying and managing relational databases.
  • ETL Processes: Designing and implementing Extract, Transform, Load processes for data integration.
  • Linux: Experience working in Linux environments, including scripting and system management.
  • Django: Developing web applications using the Django framework.
  • Bash Scripting: Automating tasks and managing systems using Bash scripts.
  • Data Visualization: Creating visual representations of data to communicate insights effectively.
  • Web Scraping: Extracting data from websites using tools like BeautifulSoup.
  • Cloud Computing: Utilizing cloud services, including Azure Functions, for scalable computing solutions.
  • Version Control: Managing code versions and collaboration using Git.
  • Projects

    Let's Connect

    Location

    Islamabad, Pakistan