Anh Hoang Chu

Software Engineer at Microsoft

Location: Redmond, Washington, United States
E-mail: anhhchu12@gmail.com
Website: Website
Github: Github
LinkedIn: LinkedIn

I'm a Software Engineer who is passionate about working with data and bringing data insights closer to business users through the help of technology. I have experience in data engineering, big data, data science, data warehouse, back-end databases for web application on GCP, Azure and AWS. My tech stacks are Python, SQL, Linux, PySpark, Kafka, Airflow, Tableau, Kubernetes, BigQuery, Redshift, and Azure Synapse Analytics

Work

Microsoft

Software Engineer

Feb 2022 - Present

Software Engineer building, configuring, and managing back-end infrastructure on Azure and AWS for Flipgrid, a video-powered social learning platform

  • Ensure highly available and performant application by managing a multitude of Azure and AWS cloud services include Kubernetes, storage, database, data warehouse, content delivery network, monitoring, elastic searching and caching
  • Build and maintain data lakehouse platform using Kafka streaming service, Azure PostgreSQL Citus Hyperscale, Azure EventHub, and Azure Synapse Analytics

Walmart Technology

Software Engineer

Jan 2020 - Feb 2022

Software Engineer building an end-to-end analytical Supply Chain web application to track inventory and transportation from Suppliers to Stores for international markets

  • Built and maintained ETL data pipeline to calculate KPIs and load analytical datasets to MSSQL from multiple data sources in Teradata, BigQuery, Oracle Database, Informix Database and flat files using Linux shell scripts and PySpark
  • Led a team of 4 developers in the enterprise-wide effort to migrate On-prem Data Warehouse (Teradata) to Google Cloud Platform for 10 markets using Big Query, Dataproc, PySpark, and Aiflow as a Service on Astronomer

NTT Data Services

Tableau Systems Analyst

Oct 2017 - Jan 2020

Led a team of 2 Tableau developers in gathering requirements and delivering analytical projects that provide data democratization to the healthcare account IT service team

  • Designed and distributed ~50 operations and financial KPI reports to executives and leaders resulting in the reduction of outstanding IT tickets by 70% using Excel, Tableau, SQL Server and Alteryx
  • Reduced time to deliver data insights to the operations team by 90% with a new reporting process that automates existing ad-hoc reports from Excel into interactive and dynamic dashboards in Tableau

E2Open, Inc

Business Analyst Intern

Mar 2017 - Aug 2017

Worked with Director of Business Value Delivery to develop financial and supply chain KPI dashboards to leverage product offerings to potential and existing clients

  • Collected, cleaned, and prepared data from financial reports of 200 companies to calculate 20 different business and supply chain KPIs ranging from Profitability to Efficiency Indicators: Profits Margin, Cash Conversion Cycle, DIO (Days Inventory Outstanding), DSO (Days Sales Outstanding), DPO (Days Payable Outstanding)

Education

Harrisburg University

Master in Computer Science-Scientific Computing

Aug 2020 - Present

Udacity

Certification in Data Scientist NanoDegree

Aug 2019 - Feb 2020

University of Texas at Dallas

Master in Supply Chain Management

Aug 2015 - Aug 2017

Udacity

Certification in Data Analyst NanoDegree

Jul 2015 - Jan 2016

Skills

Data Engineering

PySpark Kafka Airlow Big Query Redshift Azure Synapse DataProc Teradata MSSQL

DevOps

Linux Kubernetes Azure AWS GCP

Data Analytics/Data Science

Python Tableau Machine Learning Numpy Pandas Scikitlearn Pytorch Tensorflow JupyterNotebook

Web Development

HTML CSS Javascript Ruby on Rails Django

Awards

Excellence Award

Walmart | 2022

Summa Cum Laude Graduate

UT Dallas | 2017

Fourth-place APICS Terra Grande Competition

APICS Houston | 2017

Third-place Operations Competition

UT Dallas | 2016

First-place in 2 Supply Chain Case Competitions

UT Dallas | 2016

Best Sales & Services Award

HSBC Vietnam | 2014