Welcome To Precision IT Tech - Applying For The Site Reliability Engineer (SRE) Position

Site Reliability Engineer (SRE) Job Description

Site Reliability Engineer (SRE)

Job Overview

We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) to join our dynamic engineering team. As an SRE, you will be responsible for ensuring the reliability, performance, and scalability of our core platform and services. You will work collaboratively with development, operations, and security teams to proactively identify and resolve issues, automate operational tasks, and drive continuous improvement in our systems. This role requires a strong understanding of infrastructure, automation, and monitoring, coupled with a passion for building resilient and highly available systems.

Key Responsibilities

  • Design, implement, and maintain automated monitoring and alerting systems to proactively identify and respond to system issues.
  • Develop and maintain infrastructure-as-code (IaC) using tools like Terraform, CloudFormation, or similar.
  • Automate operational tasks such as deployments, scaling, and configuration management.
  • Participate in on-call rotations to provide 24/7 support for production systems.
  • Troubleshoot and resolve complex system issues, collaborating with development teams to implement effective solutions.
  • Conduct root cause analysis of incidents to identify underlying problems and prevent future occurrences.
  • Contribute to the development and maintenance of our SRE practices and methodologies.
  • Work closely with development teams to integrate reliability considerations into the software development lifecycle (SDLC).
  • Perform capacity planning and performance tuning to optimize system performance.
  • Document system architecture, configurations, and operational procedures.

Required Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 3+ years of experience in a systems administration, DevOps, or SRE role.
  • Strong understanding of Linux operating systems and command-line tools.
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Proficiency in scripting languages such as Python, Bash, or Go.
  • Experience with containerization technologies such as Docker and Kubernetes.
  • Experience with monitoring tools such as Prometheus, Grafana, or Datadog.
  • Solid understanding of networking concepts and protocols.
  • Excellent problem-solving and analytical skills.
  • Strong communication and collaboration skills.

Preferred Qualifications

  • Experience with Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation).
  • Experience with configuration management tools (e.g., Ansible, Chef, Puppet).
  • Experience with CI/CD pipelines and automation tools.
  • Certifications in AWS, Azure, or Google Cloud Platform.
  • Experience with security best practices and compliance standards (e.g., SOC 2, HIPAA).
  • Contributions to open-source projects.

Benefits

We offer a comprehensive benefits package including:

  • Competitive Salary
  • Medical, Dental, and Vision Insurance
  • Paid Time Off (PTO) – Including Vacation and Sick Leave
  • Paid Holidays
  • 401(k) Retirement Plan with Company Match
  • Stock Options
  • Professional Development Opportunities
  • Employee Assistance Program
  • Life Insurance
  • Disability Insurance

Start Your Application

©Fit First® Technologies International, Inc. All Rights Reserved, Worldwide. Patented.