MUST HAVES:
- Data engineering (pandas, pyspark)
- Data lakehouse solutions such as Databricks (Delta Lake).
- Working with XLSX, CSV, JSON files, relational databases, cloud storage, structured and unstructured data
- Experience using AWS Services such as Glue, StepFunctions, Lambda, S3
- Extract/Transform/Load data using tools such as Informatica IDMC
Experience and Skill Set Requirements
Data engineering Experience
- Programming & scripting: Python, SQL, Linux shell, PowerShell.
- Data manipulation/analysis using pandas and pyspark.
- Working with XLSX, CSV, JSON files, relational databases, cloud storage, structured and unstructured data
40%
Cloud experience
- AWS and/or Azure services.
- Cloud data warehouse solutions such as AWS Redshift.
- Data lakehouse solutions such as Databricks (Delta Lake).
- Data processing o...