Data Engineer (Level 5), AOP
Job Description
Key job responsibilities
- Design, implement and support an analytical data infrastructure
- Managing AWS resources including EC2, EMR, S3, Glue, Redshift, etc.
- Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using SQL and AWS big data technologies
- Explore and learn the latest AWS technologies to provide new capabilities and increase efficiency
- Collaborate with Data Scientists and Business Intelligence Engineers (BIEs) to recognize and help adopt best practices in reporting and analysis
- Help continually improve ongoing reporting and analysis processes, automating or simplifying self-service support for customers
- Maintain internal reporting platforms/tools including troubleshooting and development. Interact with internal users to establish and clarify requirements in order to develop report specifications.
- Work with Engineering partners to help shape and implement the development of BI infrastructure including Data Warehousing, reporting and analytics platforms.
- Contribute to the development of the BI tools, skills, culture and impact.
- Write advanced SQL queries and Python code to develop solutions
A day in the life
This role requires you to live at the intersection of data, software, and analytics. We leverage a comprehensive suite of AWS technologies, with key tools including S3, Redshift, DynamoDB, Lambda, API's, Glue. You will drive the development process from design to release.
Managing data ingestion from heterogeneous data sources, with automated data quality checks.
Creating scalable data models for effective data processing, storage, retrieval, and archiving.
Using scripting for automation and tool development, which is scalable, reusable, and maintainable.
Providing infrastructure for self serve analytics and science use cases.
Using industry best practices in building CI/CD pipelines
About The Team
AOP (Analytics Operations and Programs) team is missioned to standardize BI and analytics capabilities, and reduce repeat analytics/reporting/BI workload for operations across IN, AU, BR, MX, SG, AE, EG, SA marketplace.
AOP is responsible to provide visibility on operations performance and implement programs to improve network efficiency and defect reduction. The team has a diverse mix of strong engineers, Analysts and Scientists who champion customer obsession.
We enable operations to make data-driven decisions through developing near real-time dashboards, self-serve dive-deep capabilities and building advanced analytics capabilities.
We identify and implement data-driven metric improvement programs in collaboration (co-owning) with Operations teams
Basic Qualifications
- 3+ years of data engineering experience
- 4+ years of SQL experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
- Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
Preferred Qualifications
- Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
- Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)