ID

Data Engineer

IDESLABS
Pune4-8 LPA Posted 15 May 2025
FULL TIME
Docker
Spark
Kafka
Presto
Sap Hana
+3 more

Job Description

Job description

  • 5+ years of relevant experience in the field of Data Engineering
  • Advance skills in big data technologies like Hadoop, Python, Spark, SQL.
  • Must have experience building data APIs.
  • Bachelor s in Computer Science or related disciplines
  • Knowledge of Data Structures and Algorithm.
  • Strong Python programming skills with ability to implement OOPs and functional programming. Knowledge of Scala/Java would be plus.
  • Strong knowledge on RDBMS and NoSQL databases with the ability to implement them from scratch. Knowledge of Graph databases will be a plus.
  • Strong expertise in building & optimizing data pipelines, architectures, and data sets.
  • Experience working with different file formats like Parquet, ORC, Avro, RC, etc.
  • Experience with big data infrastructure inclusive of MapReduce, Hive, HDFS, YARN, HBase, Oozie, etc.
  • Knowledge and experience of using orchestration frameworks like Airflow, Oozie, Luigi, etc.
  • Experience using Spark, and building jobs using Python/Scala/Java.
  • Experience or Knowledge building stream processing platforms using Spark Streaming, Storm, etc. Knowledge of Kafka/Flink+Beam would be plus.
  • Knowledge of building REST API end points for data consumption.
  • Experience in building scalable data pipelines for both real time and batch using best practices in data modeling, ETL/ELT processes utilizing varioud technologies such as Spark, Kafka, Presto, SAP HANA, Airflow, informatica.
  • Perform Data analysis using Python, complex SQLs, and other tools.
  • Perform root cause analysis of issues from platform standpoint on Kubernetes, Containers, Hadoop, Spark, Hive, Presto
  • Excellent oral and written communication is a must.
  • Preferred
  • Masters in Computer Science or related disciplines
  • Experience building self-service tools for analytics would be plus.
  • Knowledge of ELK stack would be a plus.
  • Knowledge of implementing CI/CD on the pipelines is a plus.
  • Knowledge of Containerization (Docker/Kubernetes) will be plus.
  • Experience working with one of the popular Public Cloud based platforms is preferred

Join WhatsApp Channel