Job Title:
Site Reliability Engineer (Junior)
Company: CES
Location: Kannur, Kerala
Created: 2025-08-23
Job Type: Full Time
Job Description:
We’re looking for a highly skilled Site Reliability Engineer to help us build, manage, and scale modern infrastructure systems for high-availability applications. If you're passionate about automation, cloud platforms, and solving tough operational challenges, we would love to hear from you.Key Skills and Competencies3+ years of extensive experience with Infrastructure as Code (IaC) and Desired State Configuration (DSC) tools like Terraform, CDK, and ChefExperience in packaging, deploying, and managing containerized workloads on Docker and KubernetesExpertise in managing AWS infrastructure at scale – EC2, S3, ELB, Lambda, Route 53, ECS, SQS, CloudWatchPrior experience working in DevOps or SRE environmentsStrong automation/scripting skills using PowerShell, Ruby, Go, Python, and BashHands-on with monitoring and reporting tools – ELK Stack, Dynatrace, New Relic, NagiosExperience with IIS management, performance monitoring, and troubleshootingBackground in web farm management for high-traffic SaaS applicationsStrong problem-solving and root-cause analysis skillsExperience working with .NET application architectures – caching, content delivery, high availability, load balancingFamiliarity with CI/CD pipelines and tools – TeamCity, Octopus Deploy, GitHub, Jenkins, Codefresh, etcResponsibilities:Drive initiatives to improve platform scalability and operational efficiencyLead standardization efforts across engineering and infrastructure teamsIdentify opportunities to improve and automate deployments, visibility, and managementApply cloud security best practices to ensure infrastructure safetyProvide full-stack diagnostics and resolve complex infrastructure issuesTrack performance metrics and make data-backed improvement decisionsProactively suggest infrastructure or process changes for system reliabilityEnsure disaster recovery readiness and implement high availability systemsBuild support workflows and assist with incident responseOwn and improve the customer experience through system reliability and uptimePersonal Attributes:Passionate about learning and applying new technologiesA strong collaborator who believes in team successExcellent communicator – verbal, written, and virtualHigh integrity and commitment to ethical standardsSelf-motivated, driven, and detail-orientedAble to work independently on short-term projects