PRPradeepit Consulting Services
Data Analytics-Python Developer
Bangalore ₹4-15 LPA Posted 22 Jul 2025
FULL TIME
Machine Learning
Data Visualization
Pandas
Numpy
Data Cleaning
+4 more
Job Description
Job Summary:
The Python Data Engineer will be responsible for building, maintaining, and optimizing data pipelines using Dagster and Python for efficient ETL workflows. The role includes working with modern data libraries and modeling techniques, containerization through Docker, and integration with visualization tools like Power BI. The candidate must have strong analytical and communication skills with a focus on clean, scalable architecture and data quality.
Key Responsibilities:
- Pipeline Development
- Develop and maintain Dagster pipelines to connect with various data sources and convert raw data into Parquet format
- Optimize ETL processes for robust, scalable data workflows
- Data Analysis & Transformation
- Analyze different source systems to design efficient integration strategies
- Modify and manage Parquet datasets to ensure data quality and performance
- Technical Implementation
- Utilize Python libraries such as Pandas, Polars, and DuckDB for data processing and analysis
- Deploy applications using Docker for seamless containerization and deployment
- Data Modeling
- Design innovative data models from source tables to staging, capturing time-series data for advanced analytics
- Implement dimensional modeling concepts such as star schema and snowflake schema
- Collaboration
- Work closely with cross-functional teams in an Agile environment to understand data needs and deliver scalable solutions