Hello! I am

Amir Saleem

I'm a Data Scientist

Transforming raw data into actionable insights with Python, SQL & machine learning.

Islamabad, Pakistan

View Portfolio Experience Contact Me

Experience

Python Data Scientist

Turing
Remote (USA)

Nov 2023 - Mar 2026
(2 years, 5 months)

  • Conducted comprehensive data gathering and analysis, leveraging EDA, statistics, and machine learning techniques to derive actionable insights.
  • Formulated complex queries to drive deeper understanding across various domains, enhancing data-driven decision-making.
  • Developed and implemented robust code solutions, streamlining processes and improving efficiency.
  • Collaborated with LLM to identify performance issues, providing constructive feedback to enhance model accuracy.
  • Partnered with cross-functional teams to validate results and optimize model training, ensuring high-quality outcomes.
  • Evaluated LLM-generated code against stringent rubrics, contributing to code quality and reliability in production environments.
  • Engineered "bug-planting" scenarios, enhancing testing protocols and fostering a culture of continuous improvement.

Data Engineer ( Freelancer )

Medusa
Remote (USA)

Jun 2023 - Feb 2024
(9 months)

  • Developed and optimized ETL processes leveraging Redshift, Athena, and MySQL to enhance data accessibility and analysis.
  • Integrated CI/CD pipelines using Bitbucket to streamline deployment workflows, improving efficiency and reducing errors.
  • Automated code deployment by executing pushes that trigger Bitbucket pipelines, ensuring timely updates and consistency.
  • Oversaw Docker image deployments to AWS, utilizing CloudWatch for performance monitoring and resource optimization.
  • Executed reverse ETL processes to facilitate data extraction and transformation using GraphQL APIs, enhancing data usability for analytics.
  • Collaborated with cross-functional teams to translate complex data sets into actionable insights, driving data-informed decision-making.

Data Scientist

LoveForData
On-site (Karachi - Pakistan)

Jan 2023 - Nov 2023
(11 months)

  • ETL Development: Designed and optimized ETL processes to efficiently extract, transform, and load data from diverse sources into a robust data warehouse, enhancing data accessibility.
  • Advanced Data Analysis: Conducted in-depth data analysis using Python and SQL, uncovering actionable insights and trends that informed strategic decision-making.
  • Predictive Modeling: Engineered and deployed predictive models, leveraging machine learning techniques to forecast outcomes and drive business growth.
  • Data Visualization: Created compelling visualizations to communicate findings effectively to stakeholders, facilitating data-driven discussions.
  • Collaboration: Partnered with cross-functional teams to align data initiatives with business objectives, ensuring a comprehensive approach to data management.

Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Jan 2021 - Dec 2022
( 2 years )

  • Django Development: Developed and maintained robust Django applications for enhanced data visualization and analysis, improving user engagement and decision-making.
  • Predictive Modeling: Designed and implemented predictive models that accurately forecasted future trends, leading to data-driven strategic decisions.
  • Advanced Data Analysis: Conducted in-depth data analysis using Python and SQL, uncovering critical patterns and trends that informed business strategies.
  • Market Basket Analysis: Utilized market basket analysis techniques to derive actionable insights, optimizing restaurant menu offerings and marketing strategies to enhance customer satisfaction and retention.
  • Data-Driven Solutions: Collaborated with cross-functional teams to deliver data-driven solutions that supported operational efficiency and business growth.

Junior Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Apr 2019 - Dec 2020
( 1 year, 9 months )

  • Data Quality Improvement: Conducted comprehensive data audits to pinpoint and rectify inaccuracies, enhancing clients' understanding of their data integrity challenges.
  • Data Visualization: Created an interactive dashboard utilizing R-Shiny, enabling stakeholders to visualize and analyze complex datasets effectively.
  • ETL Development: Engineered robust ETL processes to extract, transform, and load data from diverse sources into a centralized data warehouse, streamlining data accessibility and reporting.
  • Analytical Skills: Leveraged statistical analysis and data modeling techniques to derive actionable insights, supporting data-driven decision-making.
  • Collaboration: Worked closely with cross-functional teams to align data strategies with business objectives, fostering a culture of data literacy.

Data Science Trainee

LoveForData
On-site (Karachi - Pakistan)

Oct 2018 - Mar 2019
( 6 months )

  • Web-Scraping: Engineered a robust web scraping application to efficiently extract and preprocess data from diverse online sources, enhancing data availability for analysis.
  • NLP: Designed and implemented an advanced NLP model for sentiment analysis, text summarization, and question answering, improving insights and decision-making capabilities.
  • Data Analysis: Conducted exploratory data analysis (EDA) to uncover trends and patterns, facilitating data-driven strategies.
  • Data Engineering: Collaborated on data pipeline development to ensure seamless data flow and integration for analytics projects.
  • Programming: Utilized Python and libraries such as Beautiful Soup, Pandas, and NLTK to optimize data extraction and analysis processes.

Data Science Intern

LoveForData
On-site (Karachi - Pakistan)

Aug 2018 - Sep 2018
( 2 months )

  • Assisted in data collection, cleaning, and organization.
  • Gained exposure to various data analysis tools and techniques.

Certifications

SQL

hackerrank
Issued: June 2023

Verify

SQL

hackerrank
Issued: June 2023

Verify

Fundamentals of Visualization with Tableau

University of California, Davis
Issued: May 2023

Verify

Essential Design Principles for Tableau

University of California, Davis
Issued: May 2023

Verify

Visual Analytics with Tableau

University of California, Davis
Issued: May 2023

Verify

Python

Hackerrank
Issued: Feb 2023

Verify

Introduction to Big Data with Spark and Hadoop

IBM Skills Network
Issued: Dec 2022

Verify

Python for Data Science, AI & Development

IBM Skills Network
Issued: Nov 2022

Verify

Python Project for Data Engineering

IBM Skills Network
Issued: Nov 2022

Verify

Databases and SQL for Data Science with Python

IBM Skills Network
Issued: Nov 2022

Verify

Hands-on Introduction to Linux Commands and Shell Scripting

IBM Skills Network
Issued: Nov 2022

Verify

Relational Database Administration (DBA)

IBM Skills Network
Issued: Nov 2022

Verify

ETL and Data Pipelines with Shell, Airflow and Kafka

IBM Skills Network
Issued: Nov 2022

Verify

Getting Started with Data Warehousing and BI Analytics

IBM Skills Network
Issued: Nov 2022

Verify

Introduction to NoSQL Databases

IBM Skills Network
Issued: Nov 2022

Verify

Introduction to Statistics

Stanfor University
Issued: Oct 2022

Verify

How Google does Machine Learning

Google Cloud
Issued: Oct 2022

Verify

Google Cloud Big Data and Machine Learning Fundamentals

Google Cloud
Issued: Oct 2022

Verify

Introduction to Data Engineering

IBM Skills Network
Issued: Oct 2022

Verify

Introduction to Relational Databases (RDBMS)

IBM Skills Network
Issued: Oct 2022

Verify

Google Data Analytics (Professional Certificate)

Google
Issued: Sep 2022

Verify

Foundations: Data, Data, Everywhere

Google
Issued: Aug 2022

Verify

Share Data Through the Art of Visualization

Google
Issued: Aug 2022

Verify

Ask Questions to Make Data-Driven Decisions

Google
Issued: Aug 2022

Verify

Prepare Data for Exploration

Google
Issued: Aug 2022

Verify

Data Analysis with R Programming

Google
Issued: Aug 2022

Verify

Google Data Analytics Capstone: Complete a Case Study

Google
Issued: Aug 2022

Verify

Process Data from Dirty to Clean

Google
Issued: Aug 2022

Verify

Python Project: pillow, tesseract, and opencv

University of Michigan
Issued: Oct 2021

Verify

Python 3 Programming (Specialization)

University of Michigan
Issued: Oct 2021

Verify

Introduction to Probability and Data with R

Duke University
Issued: Oct 2021

Verify

Using Python to Access Web Data

University of Michigan
Issued: Sep 2021

Verify

Using Databases with Python

University of Michigan
Issued: Sep 2021

Verify

Capstone: Retrieving, Processing, and Visualizing Data with Python

University of Michigan
Issued: Sep 2021

Verify

Python

University of Michigan
Issued: Sep 2021

Verify

Data Collection and Processing with Python

University of Michigan
Issued: Sep 2021

Verify

Python Functions, Files, and Dictionaries

University of Michigan
Issued: Sep 2021

Verify

Python Classes and Inheritance

University of Michigan
Issued: Sep 2021

Verify

Python for Everybody (Specialization)

University of Michigan
Issued: Sep 2021

Verify

Programming for Everybody (Getting Started with Python)

University of Michigan
Issued: Aug 2021

Verify

Python Data Structures

University of Michigan
Issued: Aug 2021

Verify

Linux for Developers

The Linux Foundation
Issued: Nov 2020

Verify

Linux Server Management and Security

University of Colorado System
Issued: Oct 2020

Verify

Analyzing Police Activity with pandas

datacamp
Issued: Sep 2020

Verify

Introduction to Applied Machine Learning

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

Machine Learning Algorithms: Supervised Learning Tip to Tail

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

Data for Machine Learning

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

R Programming

John Hopkins University
Issued: Aug 2019

Verify

Exploratory Data Analysis

John Hopkins University
Issued: Aug 2019

Verify

Getting and Cleaning Data

John Hopkins University
Issued: Jul 2019

Verify

The Data Scientist’s Toolbox

John Hopkins University
Issued: Jun 2019

Verify

Excel Skills for Business: Essentials

Macquarie University
Issued: Aug 2018

Verify

Excel Skills for Business: Intermediate I

Macquarie University
Issued: Aug 2018

Verify

Introduction to Data Science in Python

University of Michigan
Issued: Apr 2018

Verify

Learn to Program: The Fundamentals

University of Toronto
Issued: Dec 2017

Verify

Skills

  • Data Science: Experience in data analysis, data mining, modeling, and deploying data pipelines.
  • Machine Learning: Developing predictive models and implementing machine learning algorithms.
  • LLM Development & Collaboration: Formulating complex questions (EDA, statistics, ML, logic), developing code solutions with LLMs, and collaborating to improve model performance through feedback and validation.
  • AI / LLM Evaluation: Evaluating LLM-generated code with rubrics, designing bug-planting scenarios in production codebases, and benchmarking debugging capabilities (e.g. SWE-Bench).
  • Data Synthesis for AI: Generating and curating large-scale datasets (e.g. 100k+ visualizations) to enhance LLM proficiency in data visualization and analysis.
  • Data Analysis: Gathering, cleaning, and analyzing data to extract insights and support decision-making.
  • Python Programming: Proficient in Python for data analysis, scripting, and automation tasks.
  • R Programming: Good in R for data analysis, and ML modeling.
  • SQL: Skilled in querying and managing relational databases.
  • ETL Processes: Designing and implementing Extract, Transform, Load processes for data integration.
  • Linux: Experience working in Linux environments, including scripting and system management.
  • Django: Developing web applications using the Django framework.
  • Bash Scripting: Automating tasks and managing systems using Bash scripts.
  • Data Visualization: Creating visual representations of data to communicate insights effectively.
  • Web Scraping: Extracting data from websites using tools like BeautifulSoup.
  • Cloud Computing: Utilizing cloud services, including Azure Functions, for scalable computing solutions.
  • Version Control: Managing code versions and collaboration using Git.
  • Projects

    Let's Connect

    Location

    Islamabad, Pakistan