Data Quality and Reliability :
- Implement methods to improve data reliability and quality.
- Combine raw data from different sources to create consistent and machine-readable formats.
Structural Development :
Develop and test data structure that enable data extraction and transformation for predictive or prescriptive modeling.Process Definition :
Define and set development, test, release, update, and support processes for data engineering operations. Troubleshoot and fix code bugs.Big Data Models :
Develop big data models / use cases based on the data structure and prepare them for data operations end users.Query Execution :
Create and execute queries on structured and unstructured data sources to identify process issues or perform mass updates.Feature Layer :
Develop and implement the feature zone with the required features / KPIs required by different stakeholders and that supports building robust machine learning models.Batch Scheduling and Reporting :
Ensure that batch production scheduling and report distribution are accurate and timely.Ad Hoc Requests :
Perform ad hoc requests from users such as data research, file manipulation / transfer, and research of process issues.Requirements
Core Competencies (Level 1,2) :
Performance ExcellenceCollab. & Creating SynergyAgility & ResiliencePeople CentricityTechnical Competencies :
Basic to Intermediate knowledge in Java, C#, and Python for developing robust applicationsBasic to Moderate Experience with Cloudera or any Big data platform with the complementary service like Apache Hive, Apache Scala, BizSpark, Impala, Apache Spark, Data Security, Kafka, HBase, Sqoop, NiFi, Python for Programming, Python for Data Analysis & ML, …etc.Proficiency in handling and processing large datasets using distributed computing frameworks.Moderate knowledge of SQL for querying and managing relational databases.Understanding of data warehousing principles and best practices.Proficiency in handling and processing large datasets using distributed computing frameworksModerate experience in data analysis and visualization toolsExpertise in writing shell scripts using Bash, KornShell (ksh), or Bourne Shell (sh)Domain Expertise :
Bachelor Degree in Computer Science / Engineering is preferredMaster / Certificate needed is preferredBanking experience is preferred