Site Reliability Engineer Job at Covetus, Overland Park, KS

UU9KYjE5TUZGU1dzUGZBZlpOdnY0dnhsS1E9PQ==
  • Covetus
  • Overland Park, KS

Job Description

Job Title : Lead SRE Engineer

Location: : Oakland Park, KS / Seattle, WA

Duration : Longterm Contract

Job Overview:

  • Client is looking at an Lead SRE Engineer
  • Experience into Lead SRE triage calls
  • Ability to resolve ticket and translate to team
  • Ability to work across the stake holders and cross stake holder management

Roles And Responsibilities

  • System Monitoring and Incident Response: for implementing monitoring solutions to track system health, performance, and availability. They proactively monitor systems, identify issues, and respond to incidents promptly, working to minimize downtime and mitigate impacts.
  • Post-Incident Analysis: Led incident response efforts, coordinated with cross-functional teams, and conducted post-incident analysis to identify root causes and implement preventive measures.
  • Continuous Improvement and Reliability Engineering: SREs drive continuous improvement efforts by identifying areas for enhancement, implementing best practices, and fostering a culture of reliability engineering.
  • They participate in post-mortems, conduct blameless retrospectives, and drive initiatives to improve system reliability, stability, and maintainability.
  • Collaboration and Knowledge Sharing: SREs collaborate closely with software engineers, operations teams, and other stakeholders to ensure smooth coordination and effective communication. They share knowledge, provide technical guidance, and contribute to the development of a strong engineering culture.
  • Support and maintain configuration management for various applications and systems Implement comprehensive service monitoring, including dashboards, metrics, and alerts.
  • Define, measure, and meet key service level objectives, such as uptime, performance, incidents, and chronic problems
  • Partner with application and business stakeholders to ensure high quality product development and release
  • Collaborate with the development team to enhance system reliability and performance.

Qualifications

  • Bachelor’s degree in Information Technology, Computer Science, or related field.
  • Strong knowledge of software development processes and procedures.
  • Strong problem-solving abilities.
  • Excellent understanding of computer systems, servers, and network systems.
  • Ability to work under pressure and manage multiple tasks simultaneously.
  • Strong communication and interpersonal skills.
  • Strong knowledge of coding languages like Python, Java, Go, etc.

Job Description

  • Experience with cloud computing platforms such as AWS, Azure, or Google Cloud
  • Experience with DevOps tools such as Git, Jenkins, Ansible, Terraform, Docker, etc.
  • Experience with monitoring tools such as Splunk, Prometheus

Skills: Problem solving, post-incident analysis,aws, monitoring tools, cloud computing, key service level objectives, reliability engineering, configuration management, devops practices, coding languages, monitoring tools (splunk, prometheus),continuous improvement, site reliability engineering, service monitoring, incident response, reliability, software development processes, system monitoring, splunk, devops tools (git, jenkins, ansible, terraform, docker), kubernetes, cloud computing (aws, azure, google cloud), devops, ansible, programming (python, java, go, c/c++, ruby, javascript).

Job Tags

Contract work,

Similar Jobs

Livewire

Live Event Audio Engineer Job at Livewire

Livewire Entertainment Media Services, LLC Position: Live Audio Engineer FTE: 1.0 | Exempt Location: Fargo, North Dakota Requires travel: Yes; company vehicle provided when necessary Opening: Immediate Last revised: 2024-09 Livewire, a company committed...

Department of Industrial Relations

Court Reporter Job at Department of Industrial Relations

 ...Job Description and Duties Under direction of the Chief Hearing Reporter and Presiding Workers Compensation Judge, the Hearing Reporter provides assistance and support to Workers Compensation Judges and to the Department of Industrial Relations in adjudication and... 

EnerMech

Oil & Gas Project Manager Job at EnerMech

A permanent opportunity has arisen for a Project Manager to join our Pipelines & Subsea business line at EnerMech in Houston. This role has primary responsibility for the cross-functional delivery of international projects ensuring efficient execution, on-time delivery...

JW Michaels & Co.

Compliance Officer Job at JW Michaels & Co.

 ...Overview: Our client is a leading global investment manager seeking a Compliance Officer to support its growing suite of investment vehicles. The Compliance team oversees multiple funds and vehicles, including several retail-oriented products such as a 40 Act... 

Insight Global

Automation Mechanic Job at Insight Global

 ...Position: Automation Mechanic Location: Upper Marlboro, MD 20772 Salary: $37-42/hr Shift: A typically working week is 50 hours (1.5x for OT)~ Must be on call on Saturdays Start times: ~6am ~3pm Must-Haves: ~5+ years of Good-to-person Automation...