Design and implement Cloudera-based data platforms, including cluster sizing, configuration, and optimization.
Install, configure, and administer Cloudera Manager and CDP clusters, managing all aspects of the cluster lifecycle.
Monitor and troubleshoot platform performance, identifying and resolving issues in a timely manner.
Review the maintain the data ingestion and processing pipelines on the Cloudera platform.
Collaborate with data engineers and data scientists to design and optimize data models, ensuring efficient data storage and retrieval.
Implement and enforce security measures for the Cloudera platform, including authentication, authorization, and encryption.
Manage platform user access and permissions, ensuring compliance with data privacy regulations and internal policies.
Experience in creating Technology Road Maps for Cloudera Platform. Stay up-to-date with the latest Cloudera and big data technologies, and recommend and implement relevant updates and enhancements to the platform.
Experience in Planning, testing, and executing upgrades involving Cloudera components and ensuring platform stability and security.
Document platform configurations, processes, and procedures, and provide training and support to other team members as needed.
Requirements:
Proven experience as a Cloudera platform engineer or similar role, with a strong understanding of Cloudera Manager and CDH clusters.
Expertise in designing, implementing, and maintaining scalable and high-performance data platforms using Cloudera technologies such as Hadoop, Spark, Hive, Kafka.
Strong knowledge of big data concepts and technologies, data modeling, and data warehousing principles.
Familiarity with data security and compliance requirements, and experience implementing security measures for Cloudera platforms.
Proficiency in Linux system administration and scripting languages (e.g., Shell, Python).
Strong troubleshooting and problem-solving skills, with the ability to diagnose and resolve platform issues quickly.
Excellent communication and collaboration skills, with the ability to work effectively in cross-functional teams.
Experience on Azure Data Factory/Azure Databricks/Azure Synapse is a plus