Job Description
Inviting applications for the role of Principal Consultant, Databricks!
In this role, In this role, the Databricks Lead is responsible for providing technical direction and lead a group of one or more developer to address a goal
Responsibilities
.Implement Databricks Lakehouse solutions (Delta Lake, Unity Catalog, SQL Endpoints/Photon, Workflows).
.Design, build, and optimize batch & streaming pipelines using Spark and Python/Scala with strong SQL.
.Define standards for CI/CD, data governance, security, and platform operations on Databricks.
.Provide technical leadership review solution designs and guide delivery teams.
Evangelize re-use through the implementation of shared assets.
.Enforce adherence to architectural standards/principles, global product-specific guidelines, usability design standards, etc.
Proactively guide engineering methodologies, standards, and leading practices.
Identify, communicate and mitigate Risks, Assumptions, Issues, and Decisions throughout the full lifecycle.
Demonstrate strong analytical and technical problem-solving skills.
Ability to analyse and operate at various levels of abstraction.
Ability to balance what is strategically right with what is practically realistic.
. Supporting and developing our people, including learning & development, certification & career development plans
. Providing technical governance and oversight for solution design and implementation
. Should have technical foresight to understand new technology and advancement.
. Leading team in the definition of best practices & repeatable methodologies in Cloud Data Engineering, including Data Storage, ETL, Data Integration & Migration, Data Warehousing and Data Governance
.Should have Technical Experience in Azure, AWS & GCP Cloud Data Engineering services and solutions.
.Contributing to Sales & Pre-sales activities including proposals, pursuits, demonstrations, and proof of concept initiatives
. Development of Whitepapers, blogs, webinars and other though leadership material
.. Working with Learning & Development teams to establish appropriate learning & certification paths for their domain.
. Build new Data capabilities, solutions, assets, accelerators, and team competencies.
. Manage multiple opportunities through the entire business cycle simultaneously, working with cross-functional teams as necessary.
Qualifications we seek in you!
Minimum Qualifications
. Excellent technical architecture skills, enabling the creation of future-proof, complex global solutions.
. Excellent interpersonal communication and organizational skills are required to operate as a leading member of global, distributed teams that deliver quality services and solutions.
. Ability to rapidly gain knowledge of the organizational structure of the firm to facilitate work with groups outside of the immediate technical team.
. Knowledge and experience in IT methodologies and life cycles that will be used.
. Managed the end to end project delivery worked with stakeholders - client, leadership and team.
. Must have strong hands-on experience on various cloud services like Azure/ADF, ADLS/S3, Security, Monitoring, Governance
. Must have experience to design platform on Databricks.
. Hands-on Experience to design and build Databricks based solution on any cloud platform.
. Hands-on experience to design and build solution powered by DBT models and integrate with databricks.
. Must be very good designing End-to-End solution on cloud platform.
. Must have good knowledge of Data Engineering concept and related services of cloud.
. Must have good experience in Python ,PySpark and SQL.
. Must have good experience in setting up development best practices.
. Intermediate level knowledge is required for Data Modelling.
. Good to have knowledge of docker and Kubernetes.
. Extensive, real-world experience designing technology components for enterprise solutions and defining solution architectures and reference architectures with a focus on cloud technologies.
. Exposure to infrastructure and application security technologies and approaches
. Familiarity with requirements gathering techniques.
Preferred Qualifications/ Skills
Must have designed the E2E architecture of unified data platform covering all the aspect of data lifecycle starting from Data Ingestion, Transformation, Serve and consumption.
. Must have excellent coding skills either Python or Scala,Pyspark preferably Python and Pyspark
. Should have experience in Insurance domain
. Must have designed and implemented at least 2-3 project end-to-end in Databricks.
. Must have experience on databricks which consists of various components as below
o Delta lake
o dbConnect
o db API 2.0
o SQL Endpoint - Photon engine
o Unity Catalog
o Databricks workflows orchestration
o Security management
o Platform governance
o Data Security
. Must have knowledge of new features available in Databricks and its implications along with various possible use-case.
. Must have followed various architectural principles to design best suited per problem.
. Must be well versed with Databricks Lakehouse concept and its implementation in enterprise environments.
. Must have strong understanding of Data warehousing and various governance and security standards around Databricks.
. Must have knowledge of cluster optimization and its integration with various cloud services.
. Must have good understanding to create complex data pipeline.
. Must be strong in SQL and sprak-sql.
. Must have strong performance optimization skills to improve efficiency and reduce cost.
. Must have worked on designing both Batch and streaming data pipeline.
. Must have extensive knowledge of Spark and Hive data processing framework.
. Must have worked on any cloud (Azure, AWS, GCP) and most common services like ADLS/S3, ADF/Lambda, CosmosDB/DynamoDB, ASB/SQS, Cloud databases.
. Must be strong in writing unit test case and integration test.
. Must have strong communication skills and have worked with cross platform team.
. Must have great attitude towards learning new skills and upskilling the existing skills.
. Responsible to set best practices around Databricks CI/CD.
. Must understand composable architecture to take fullest advantage of Databricks capabilities.
. Good to have Rest API knowledge.
. Good to have understanding around cost distribution.
. Good to have if worked on migration project to build Unified data platform.
. Good to have knowledge of DBT.
. Experience around DevOps including docker and Kubernetes.
. Software development full lifecycle methodologies, patterns, frameworks, libraries, and tools
. Knowledge of programming and scripting languages such as JavaScript, PowerShell, Bash, SQL, Java, Python, etc.
. Experience with data ingestion technologies such as Azure Data Factory, SSIS, Pentaho, Alteryx
. Should have experience with visualization tools such as Tableau, Power BI
. Experience in distilling complex technical challenges to actionable decisions for stakeholders and guiding project teams by building consensus and mediating compromises when necessary.
. Experience coordinating the intersection of complex system dependencies and interactions
. Experience in solution delivery using common methodologies especially SAFe Agile but also Waterfall, Iterative, etc.
Demonstrated knowledge of relevant industry trends and standards