Company
MasterCard
Description
Our Purpose
- Work closely with the business owners to understand business requirements, performance metrics regarding data quality and model performance of customer facing products
- Work with multiple disparate sources of data, storage systems, and building processes and pipelines to provide cohesive datasets for analysis and modeling
- Generate and maintain and optimize data pipelines for model building and model performance evaluation
- Develop, test, and evaluate modern machine learning and A.I. models
- Oversee implementation of models
- Evaluate production models based on business metrics to drive continuous improvement
- Data engineering experience
- Experience with SQL language and one or multiple of the following database technologies: PostgreSQL, Hadoop, Netezza, Spark.
- Good knowledge of Linux / Bash environment
- Python and one of the following machine learning libraries
- Spark ML
- TensorFlow
- Scikit Learn
- XGBoost
- Good communication skills
- Highly skilled problem solver
- Exhibits a high degree of initiative
- At least an undergraduate degree in CS, or a STEM related field.
- Master’s or PhD in CS, Data Science, Machine Learning, AI or a related STEM field
- Experience in with data engineering and model building in PySpark using Spark ML on petabyte scale data
- Understands and implements methods to evaluate own work and others for bias, inaccuracy, and error
- Loves working with error-prone, messy, disparate, unstructured data
- Abide by Mastercard’s security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
Identifier
80a5c3d35091699dcc8af2deb8c5834c
Show More
Ready to join the team? We'd love to have you!