Site Reliability Engineer (SRE) Job at Cloudious LLC, Ontario, CA

TVR4NjE5TjJPWk9EdytmYlNBR0ZzWFFDK1E9PQ==
  • Cloudious LLC
  • Ontario, CA

Job Description

$60/hr CAD

Glider MUST

Job Summary:

We are seeking an experienced Site Reliability Engineer (SRE) with advanced DevOps expertise to help build, scale, and maintain our infrastructure and services.

You will play a critical role in ensuring high availability, performance, scalability, and security of our production systems, while enabling continuous deployment and rapid delivery of features to our customers.

Key Responsibilities:

  • Design, build, and maintain reliable, scalable, and secure cloud-based infrastructure (AWS, Azure, or GCP).
  • Develop and improve observability using monitoring, ing, logging, and tracing tools (e.g., Prometheus, Grafana, ELK, Datadog, etc.).
  • Automate repetitive tasks and infrastructure using Infrastructure-as-Code (Terraform, CloudFormation, Pulumi).
  • Create and maintain CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.) to support fast and safe delivery.
  • Lead incident response, root cause analysis, and postmortems to ensure high uptime and rapid recovery.
  • Optimize system performance, reliability, and cost-effectiveness through proactive monitoring and tuning.
  • Collaborate with software engineering teams to define SLAs/SLOs and improve service reliability.
  • Implement and maintain security best practices across environments (e.g., secrets management, IAM, firewalls, etc.).
  • Maintain disaster recovery plans, backups, and high-availability strategies.

Qualifications: Required:

  • 5+ years of experience as an SRE, DevOps Engineer, or similar role.
  • Proficiency in scripting and automation (Bash, Python, Go, etc.).
  • Strong experience with containerization and orchestration (Docker, Kubernetes, Helm).
  • Solid understanding of Linux systems administration and networking fundamentals.
  • Experience with cloud platforms (AWS, Azure, or GCP).
  • Experience with IaC tools like Terraform or CloudFormation.
  • Familiarity with GitOps and modern deployment practices.
  • Hands-on experience with observability tools (e.g., Prometheus, Grafana, Datadog).
  • Strong troubleshooting and incident response skills.

Preferred:

  • Experience in a high-traffic, microservices-based architecture.
  • Exposure to service meshes (Istio, Linkerd).
  • Certifications (AWS Certified DevOps Engineer, CKA, etc.)
  • Experience with security automation and compliance (e.g., SOC2, ISO27001).

Soft Skills:

  • Strong communication and collaboration abilities.
  • Ability to thrive in a fast-paced, agile environment.
  • Analytical mindset and proactive approach to problem-solving.
  • A passion for automation, performance, and system design.

Skills

Azure, Prometheus, Terraform

Job Tags

Similar Jobs

Network Adjusters, Inc.

Bodily Injury Claims Adjuster Job at Network Adjusters, Inc.

 ...Network Adjusters is seeking skilled bodily injury insurance claims adjusters for a liability claims adjuster position. Serving the insurance industry for almost...  ...expectations based on Network's Best Practices Ability to work autonomously, maintaining accurate and up-to-date... 

A-Nu Virtual Solutions, LLC

Remote Customer Support Specialist- Be Part of our team Job at A-Nu Virtual Solutions, LLC

 ...people who are empowered, equipped, and trusted. If you enjoy working from home with the right toolsand want to join a supportive, mission...  ...teamthis position is for you. Youll have the flexibility of remote work, the stability of regular hours, and a real opportunity to... 

Hillwood Property Trust

Corporate Flight Attendant Job at Hillwood Property Trust

Hill Air Corporation operates as a Part 91 air operation, serving as the private flight department dedicated to...  ...and experienced Corporate Flight Attendant to join our Corporate Flight Department...  ...situations.Must be willing to work in a time-sensitive environment in a self-... 

Medical Solutions Allied

Travel Physical Therapy Assistant - $1,794 per week Job at Medical Solutions Allied

 ...Medical Solutions Allied is seeking a travel Physical Therapy Assistant for a travel job in Anchorage, Alaska. Job Description & Requirements ~ Specialty: Physical Therapy Assistant ~ Discipline: Therapy ~ Duration: 13 weeks ~40 hours per week ~ Shift:... 

Carle Health

Social Worker LSW - Co Responder Social Service Unit Job at Carle Health

 ...This position will be a part of the City of Peoria Social Services Unit (SSU). SSU personnel are tasked with assisting those who come into contact with, or are...  ...will consist of supervision of Licensed Social Workers and Licensed Clinical Social Workers, clinical oversight...