Amir Saleem

Data Scientist and Python Developer with 6+ years of experience in data analysis, data engineering, and machine learning. Proven ability to transform raw data into actionable insights using data wrangling, feature engineering, and predictive modeling. Proficient in Python and SQL. Passionate about solving complex problems using data-driven approaches. Certified in data analysis, programming, machine learning, and data engineering.

Experience

Python Data Scientist

Turing
Remote (USA)

Nov 2023 - Present

  • Currently working on an LLM with millions of users: Gather data, formulate complex questions, develop code solutions, and collaborate with the LLM to enhance its capabilities by identifying errors and providing feedback.
  • Engage in collaborative data analysis, independently explore datasets, and craft complex inquiries covering EDA, statistics, ML, logic, and open-ended questions.
  • Develop code solutions and collaborate with colleagues to compare and validate results. Identify and flag questions with code solutions to refine LLM training, enhancing accuracy and performance.

Data Engineer ( Freelancer )

Medusa
Remote (USA)

Jun 2023 - Feb 2024
(9 months)

  • Implemented ETL processes using Redshift, Athena, and MySQL on AWS Lambda functions within Docker containers.
  • Incorporated existing CI/CD pipelines via Bitbucket to enable smooth code deployment processes.
  • Executed code pushes to trigger Bitbucket pipelines for automated builds.
  • Managed Docker image deployments to AWS and monitored performance via CloudWatch.
  • Implemented reverse ETL processes for Monday.com utilizing GraphQL with their API.

Data Scientist

LoveForData
On-site (Karachi - Pakistan)

Jan 2023 - Nov 2023
(11 months)

  • Conducting data analysis, data mining, and modeling to extract insights and support decision-making processes.
  • Collaborating with cross-functional teams to define project objectives and deliver actionable recommendations.
  • Designing and deploying data pipelines and workflows to process and analyze datasets.

Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Jan 2021 - Dec 2022
( 2 years )

  • Gathered, cleansed, and analyzed data from various sources to identify patterns and trends.
  • Conducted statistical analysis and hypothesis testing to derive meaningful conclusions.
  • Collaborated with team members to develop data-driven strategies and recommendations.
  • Assisted in the development and maintenance of data analytics tools and infrastructure.

Junior Data Analyst

LoveForData
On-site (Karachi - Pakistan)

Apr 2019 - Dec 2020
( 1 year, 9 months )

  • Assisted in data collection, cleaning, and validation processes.
  • Conducted basic data analysis and generated descriptive reports.
  • Supported senior analysts in data visualization and report creation.
  • Collaborated with team members to enhance data quality and accuracy.
  • Assisted in the implementation and maintenance of data management systems.

Data Science Trainee

LoveForData
On-site (Karachi - Pakistan)

Oct 2018 - Mar 2019
( 6 months )

  • Participated in hands-on training in data analysis, machine learning, and data visualization.
  • Assisted in data preprocessing and feature engineering tasks.
  • Contributed to exploratory data analysis and insights generation.
  • Learned and applied best practices in data science and analytics.

Data Science Intern

LoveForData
On-site (Karachi - Pakistan)

Aug 2018 - Sep 2018
( 2 months )

  • Assisted in data collection, cleaning, and organization.
  • Gained exposure to various data analysis tools and techniques.

Certifications

55- SQL (Intermeidate)

hackerrank
Issued: June 2023

Verify

54- SQL (Basic)

hackerrank
Issued: June 2023

Verify

53- Fundamentals of Visualization with Tableau

University of California, Davis
Issued: May 2023

Verify

52- Essential Design Principles for Tableau

University of California, Davis
Issued: May 2023

Verify

51- Visual Analytics with Tableau

University of California, Davis
Issued: May 2023

Verify

50- Python (Basic)

Hackerrank
Issued: Feb 2023

Verify

49- Introduction to Big Data with Spark and Hadoop

IBM Skills Network
Issued: Dec 2022

Verify

48- Python for Data Science, AI & Development

IBM Skills Network
Issued: Nov 2022

Verify

47- Python Project for Data Engineering

IBM Skills Network
Issued: Nov 2022

Verify

46- Databases and SQL for Data Science with Python

IBM Skills Network
Issued: Nov 2022

Verify

45- Hands-on Introduction to Linux Commands and Shell Scripting

IBM Skills Network
Issued: Nov 2022

Verify

44- Relational Database Administration (DBA)

IBM Skills Network
Issued: Nov 2022

Verify

43- ETL and Data Pipelines with Shell, Airflow and Kafka

IBM Skills Network
Issued: Nov 2022

Verify

42- Getting Started with Data Warehousing and BI Analytics

IBM Skills Network
Issued: Nov 2022

Verify

41- Introduction to NoSQL Databases

IBM Skills Network
Issued: Nov 2022

Verify

40- Introduction to Statistics

Stanfor University
Issued: Oct 2022

Verify

39- How Google does Machine Learning

Google Cloud
Issued: Oct 2022

Verify

38- Google Cloud Big Data and Machine Learning Fundamentals

Google Cloud
Issued: Oct 2022

Verify

37- Introduction to Data Engineering

IBM Skills Network
Issued: Oct 2022

Verify

36- Introduction to Relational Databases (RDBMS)

IBM Skills Network
Issued: Oct 2022

Verify

35- Google Data Analytics (Professional Certificate)

Google
Issued: Sep 2022

Verify

34- Foundations: Data, Data, Everywhere

Google
Issued: Aug 2022

Verify

33- Share Data Through the Art of Visualization

Google
Issued: Aug 2022

Verify

32- Ask Questions to Make Data-Driven Decisions

Google
Issued: Aug 2022

Verify

31- Prepare Data for Exploration

Google
Issued: Aug 2022

Verify

30- Data Analysis with R Programming

Google
Issued: Aug 2022

Verify

29- Google Data Analytics Capstone: Complete a Case Study

Google
Issued: Aug 2022

Verify

28- Process Data from Dirty to Clean

Google
Issued: Aug 2022

Verify

27- Python Project: pillow, tesseract, and opencv

University of Michigan
Issued: Oct 2021

Verify

26- Python 3 Programming (Specialization)

University of Michigan
Issued: Oct 2021

Verify

25- Introduction to Probability and Data with R

Duke University
Issued: Oct 2021

Verify

24- Using Python to Access Web Data

University of Michigan
Issued: Sep 2021

Verify

23- Using Databases with Python

University of Michigan
Issued: Sep 2021

Verify

22- Capstone: Retrieving, Processing, and Visualizing Data with Python

University of Michigan
Issued: Sep 2021

Verify

21- Python Basics

University of Michigan
Issued: Sep 2021

Verify

20- Data Collection and Processing with Python

University of Michigan
Issued: Sep 2021

Verify

19- Python Functions, Files, and Dictionaries

University of Michigan
Issued: Sep 2021

Verify

18- Python Classes and Inheritance

University of Michigan
Issued: Sep 2021

Verify

17- Python for Everybody (Specialization)

University of Michigan
Issued: Sep 2021

Verify

16- Programming for Everybody (Getting Started with Python)

University of Michigan
Issued: Aug 2021

Verify

15- Python Data Structures

University of Michigan
Issued: Aug 2021

Verify

14- Linux for Developers

The Linux Foundation
Issued: Nov 2020

Verify

13- Linux Server Management and Security

University of Colorado System
Issued: Oct 2020

Verify

12- Analyzing Police Activity with pandas

datacamp
Issued: Sep 2020

Verify

11- Introduction to Applied Machine Learning

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

10- Machine Learning Algorithms: Supervised Learning Tip to Tail

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

9- Data for Machine Learning

Alberta Machine Intelligence Intitute
Issued: Jan 2020

Verify

8- R Programming

John Hopkins University
Issued: Aug 2019

Verify

7- Exploratory Data Analysis

John Hopkins University
Issued: Aug 2019

Verify

6- Getting and Cleaning Data

John Hopkins University
Issued: Jul 2019

Verify

5- The Data Scientist’s Toolbox

John Hopkins University
Issued: Jun 2019

Verify

4- Excel Skills for Business: Essentials

Macquarie University
Issued: Aug 2018

Verify

3- Excel Skills for Business: Intermediate I

Macquarie University
Issued: Aug 2018

Verify

2- Introduction to Data Science in Python

University of Michigan
Issued: Apr 2018

Verify

1- Learn to Program: The Fundamentals

University of Toronto
Issued: Dec 2017

Verify

Skills

  • Data Science: Experience in data analysis, data mining, modeling, and deploying data pipelines.
  • Machine Learning: Developing predictive models and implementing machine learning algorithms.
  • Data Analysis: Gathering, cleaning, and analyzing data to extract insights and support decision-making.
  • Python Programming: Proficient in Python for data analysis, scripting, and automation tasks.
  • R Programming: Good in R for data analysis, and ML modeling.
  • SQL: Skilled in querying and managing relational databases.
  • ETL Processes: Designing and implementing Extract, Transform, Load processes for data integration.
  • Linux: Experience working in Linux environments, including scripting and system management.
  • Django: Developing web applications using the Django framework.
  • Bash Scripting: Automating tasks and managing systems using Bash scripts.
  • Data Visualization: Creating visual representations of data to communicate insights effectively.
  • Web Scraping: Extracting data from websites using tools like BeautifulSoup.
  • Cloud Computing: Utilizing cloud services, including Azure Functions, for scalable computing solutions.
  • Version Control: Managing code versions and collaboration using Git.
  • Professional Projects

    Coding Profiles

    Let's Connect