Biography

Zeynep Tugce Sahan is a Data Engineer at enFocus Inc. She is working cross-functionally to manage ETL processes, make improvements, code, test, debug, and maintain software.

Interests

  • Data Analytics
  • Decision Analysis
  • Data Visualization
  • Risk Communication

Education

  • MS in Industrial Engineering, 2020

    Purdue University

  • BSc in Industrial Engineering, 2017

    Antalya Bilim University

  • BSc in Computer Engineering, 2017

    Antalya Bilim University

Skills

R

Python

SQL

Data Visualization

Database Systems

Analytics

Experience

 
 
 
 
 

Data Engineer

enFocus Inc.

Feb 2021 – Present South Bend, IN
Responsibilities:

  • Build a data pipeline to monitor states of scheduled tasks and report results using Python, Sharepoint, Power Automate, and Azure DevOps.
  • Deploy an ETL pipeline on an Amazon EMR cluster that extracts data from S3, processes them using Spark, and loads the data back into S3 as a set of dimensional tables.
  • Manage stored procedures that pull in data from linked servers, make comparisons, and update the data in the local server.
  • Use various APIs to automate ETL processes from external applications to SQL Server, and vice versa.
  • Develop a custom Python module that tracks script failures, enables logging, and sends details in an email.
  • Create SSIS packages that utilize SQL and Python to identify voided invoices, increase collection rates, and save over $50K annually.
  • Work closely with a cross-functional team to troubleshoot issues and maintain automated Python and SQL scripts.
  • Leverage Git and Git repositories for version control, code reviews, and maintaining documentation.
 
 
 
 
 

Business Intelligence Developer Intern

enFocus Inc.

May 2020 – Aug 2020 South Bend, IN
Responsibilities:

  • Automated processes using SSIS, Python, T-SQL, and SQL Server to eliminate ~18 hours of manual work per week.
  • Prepared interactive dashboards using MS Power BI to visualize data for technical and non-technical audiences.
  • Designed custom reports that automatically update with user input in SAP Crystal Reports to save ~4 person-hours per month.
 
 
 
 
 

Data Analyst Intern

Purdue University Data Analytics and Information Office

Jun 2019 – Aug 2019 West Lafayette, IN
Responsibilities:

  • Designed a relational database to combine data from multiple data sources.
  • Wrote queries for extracting useful information and providing insights.
  • Created a user interface for data updates and anomaly detection using VBA.
 
 
 
 
 

Graduate Research Assistant

Purdue University

Aug 2017 – May 2020 West Lafayette, IN
Responsibilities:

  • Developed a decision support tool to help homeowners make better decisions about managing flood risks to their properties using JavaScript, HTML, CSS, ArcGIS API, and Google Maps API.
  • Performed data migration from PostgreSQL to SQL Server to meet stakeholder needs.
  • Combined data from SQL Server and ArcGIS Image Server; and analyzed it using Python.
  • Communicated complex quantitative analysis in a clear, precise, and actionable manner.
  • Collaborated with CPRA and USGS to make the product available to 2.3 million coastal Louisiana residents.
 
 
 
 
 

Software QA Intern

Interact.io Cloud Solutions GmbH

Jun 2015 – Sep 2015 Berlin, Germany
Responsibilities:

  • Coordinated API tests for a CRM platform using the Runscope API Monitoring tool.
  • Prepared API documentation of a CRM platform with examples of HTTP requests and JSON-format data.
  • Demonstrated strong organizational skills with attention to detail in a fast-paced work environment.

Accomplish­ments

Data Engineering Nanodegree

Relational and NoSQL data models, creating scalable and efficient data warehouses, working efficiently with massive datasets, building and interacting with a cloud-based data lake, automating and monitoring data pipelines, developing proficiency in Spark, Airflow, and AWS tools.

Data Science A-Z™

Cleaning and preparing data for analysis, performing visualizations and data mining in Tableau, modelling and curve-fitting data, presenting findings for audience.

Big Data Workshop

Fundamentals of Hadoop and Spark

UNIX 101

Introduction to Unix-based high-performance computing systems

Projects

Landlord Registration Dashboard

A Power BI dashboard to visualize data from Landlord Registration System.

Scheduled Tasks Monitoring in DevOps

An ETL pipeline to streamline scheduled tasks in Windows Task Scheduler and monitor their states in Azure DevOps.

Homeowner-Level Decision Support System for Mitigating Coastal Flood Risk in Louisiana

When developing policies for structure-level flood risk mitigation measures such as elevating home foundations, one of the first …

HR Analytics - Employee Turnover Prediction

A dashboard to predict turnover probability of an employee based on his/her characteristics using R.

Heuristics to Solve Multiple Objective NP-Hard Aircraft Gate Scheduling Problem

Evaluation of various scheduling methods for assigning flights to airport gates to minimize walking distance for the passengers while …

Airlines Revenue Management Decision Support Tool

An interface for revenue management including a mathematical model to maximize profit by finding number of seats to be allocated for …

Recent & Upcoming Talks

Digital Tools to Promote Nonstructural Mitigation

Contact

  • Nashville, TN