Experience: 6.0 years
Remote
6+ yrs
We are seeking an experienced and detail-oriented AWS Cloud Data Engineer to design, build, and optimize scalable cloud-based data solutions. The ideal candidate will have strong data engineering fundamentals with deep expertise in AWS data services, enabling reliable, secure, and high-performance data platforms that support business and analytics needs.
· Design and implement scalable data storage solutions using AWS services such as S3 and AWS Data Lake.
· Build, manage, and optimize cloud data warehouses including Amazon Redshift.
· Ensure performance tuning and cost-efficient cloud resource utilization.
· Develop, maintain, and optimize ETL/ELT pipelines using AWS Glue and cloud-native tools.
· Automate data ingestion and transformation from multiple structured and unstructured data sources.
· Ensure high availability, scalability, and reliability of data pipelines.
· Implement workflow orchestration using Apache Airflow or AWS Step Functions.
· Automate infrastructure provisioning and deployments using CI/CD pipelines.
· Use Infrastructure-as-Code (IaC) tools such as Terraform or AWS CloudFormation.
· Implement cloud security best practices including IAM roles, encryption, and access controls.
· Monitor, troubleshoot, and optimize data pipelines using AWS CloudWatch and logging tools.
· Ensure data compliance, quality, and reliability across systems.
· Work closely with data scientists, software engineers, and business stakeholders to gather requirements.
· Provide technical documentation and operational support for data platforms.
· Participate in code reviews and knowledge-sharing sessions.
· Stay current with emerging AWS services and data engineering best practices.
· Proactively recommend enhancements to improve scalability, performance, and reliability.
Required Qualifications
· Bachelor’s degree in Computer Science, Engineering, Mathematics, or equivalent experience.
· 5+ years of hands-on experience with AWS cloud platform, particularly data services.
· 3+ years of experience building cloud-native data pipelines (e.g., AWS Glue).
· Strong understanding of AWS networking and security concepts (VPC, IAM, encryption).
· Proficiency in Python, Java, or Scala.
· Strong SQL skills for data querying and modeling.
· Experience with Apache Spark, Kafka, or similar distributed data processing frameworks.
· Working knowledge of Git and CI/CD pipelines.
Preferred Qualifications
· Advanced experience with Terraform or AWS CloudFormation.
· Exposure to Docker and Kubernetes.
· Knowledge of cloud cost optimization strategies (AWS Cost Explorer).
· Experience with ML pipelines or MLOps workflows in the cloud.