We're looking for an experienced and detail-oriented AWS DevOps Engineer to join our team and lead the deployment and optimization of scalable infrastructure and AI/ML solutions in a cloud-native environment. The ideal candidate will have a strong background in AWS services, DevOps practices, and cloud-based AI/ML deployments.
Requirements
Key Responsibilities:
Design, implement, and manage cloud infrastructure using a broad range of AWS services such as EC2, Lambda, API Gateway, EKS, Auto Scaling, S3, RDS, CloudFront, VPC, CloudWatch, and more.
Lead deployment and lifecycle management of AI/ML models using AWS Bedrock, SageMaker, and other relevant tools.
Apply DevOps best practices including CI/CD pipelines (e.g., Jenkins) and infrastructure as code.
Monitor infrastructure and optimize performance, availability, and cost efficiency using tools like ECS and CloudWatch.
Implement robust logging and monitoring solutions to ensure traceability and operational insight for deployed models.
Manage and preprocess data to ensure high-quality inputs for machine learning workflows.
Perform data validation and cleansing to uphold data integrity across pipelines.
Ensure all systems comply with security and data protection regulations; proactively manage risks and vulnerabilities.
Ideal Candidate Will Have:
Minimum 5 years of hands-on experience in DevOps, SysOps, and cloud architecture within AWS environments.
Deep expertise in AWS services and infrastructure management.
Proven experience with CI/CD tools and methodologies.
AWS Certification (Associate or Professional level) is required.
Strong analytical and troubleshooting skills with a high attention to detail.
Excellent communication and collaboration skills in cross-functional team environments.
Familiarity with cost and performance optimization for AI/ML workloads on AWS.
Strong grasp of data security, compliance, and governance in cloud environments.
Key Skills:
AWS DevOps, AWS Bedrock, SageMaker, CI/CD, Jenkins, Infrastructure as Code, AI/ML Deployment, Cloud Monitoring, Data Management, Security Compliance