Site Reliability Engineer – Triage (Remote) at GovCIO in Boise, Idaho, United States Job Description Overview GovCIO is currently hiring for a Site Reliability Engineer – Triage (Remote). This position will be fully remote. Responsibilities + Support design and implementation of improvement planning, data analysis, assessments, and organizational strategies. + Understanding the nature of incident troubleshooting processes using SRE best practices and voicing identified impacts to a larger audience during triaging events + Support and provide guidance for tracking complex business procedures to achieve goals and overcome barriers in the collection of technical information from the relevant stakeholders, or in support of content for white papers and other communication devices; and assessing and evaluating the effectiveness of executive communication to effect process improvement. + Must be a critical thinker and the ability to effectively communicate during triage events. + Proactive approach in identifying system vulnerabilities and driving to resolution during triaging events. + Must be able to effectively communicate to Executive leadership. + Organizational skills and Documentational skills are a MUST – Attention to detail. + Support Triage efforts during Major Incidents by deconstructing application performance, interoperability, instrumentation, and human factors to facilitate resolution and development of resilient solutions. + Leverage use of Monitoring tools during Triaging incidents and perform analysis. Ex. Splunk, DynaTrace, SolarWinds, AppD, + Support coordination and ensure all High Priority Incident (HPI) and Critical Priority Incident (CPI) are triaged properly and routed to the appropriate and correct groups for immediate resolution. + Provide support to Problem Management’s enterprise root cause analysis (RCA) processes in collaboration with appropriate OI&T organizations. + Demonstrate proficiency with DevOps tools, JIRA, ServiceNow, MS Project and perform tasks using the tools. + On Call Rotation availability. Qualifications Required Skills and Experience + Bachelor’s with 5 years (or commensurate experience) + Should be well versed in the concepts of DevOps and have a full understanding of Site Reliability Engineering (SRE) principles. + IT background and ability to understand technical content with expertise across multiple technology areas and the ability to diagnose complex issues throughout many technologies. + Must be able to identify and mitigate risks to the product. + Must be able to provide oral and written discussion of analytical findings using narrative and graphic forms. + Must be able to use qualitative and quantitative analytical skills to assess the effectiveness of the operations, identifying symptoms for process improvement. Preferred Skills and Experience + Certifications in relevant UX software plus 3-5 years of relevant experience + 8 to 10 years of relevant experience may be substituted for education (13-15 years total) + Analytical, investigation, and organization skills. + Attention to detail and data accuracy. + Critical thinker with a proactive approach to lead Incident troubleshooting efforts + Ability to recognize Incident triaging flow and ability to drive call to resolution. + Communications including being able to craft content for executive-level presentations. + Experience in issue tracking tools and project management software (i.e., ServiceNow, JIRA, Microsoft Office). + Utilization of monitoring tools like Splunk, DynaTrace, AppD, SolarWinds. + Clearance Required: obtain and maintain a Suitability/Public Trust clearance. Company Overview GovCIO is a team of transformers-people who are passionate about transforming government IT. Every day, we make a positive impact by delivering innovative IT services and solutions that improve how government agencies operate and serve our citizens. But we can’t do it alone. We need great people to help us do great things – for our customers, our culture, and our ability to attract other great people. We are changing the face of government IT and building a workforce that fuels this mission. Are you ready to be a transformer? We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, disability, or status as a protected veteran. EOE, including disability/vets. Posted Pay Range The posted pay range, if referenced, reflects the range expected for this position at the commencement of employment, however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, education, experience, and internal equity. The total compensation package for this position may also include other compensation elements, to be discussed during the hiring process. If hired, employee will be in an ‘at-will position’ and the GovCIO reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, GovCIO or individual department/team performance, and market factors. Posted Salary Range USD $90,000.00 – USD $105,000.00 /Yr. Submit a referral to this job (Location US-Remote ID 2024-3533 Category IT Infrastructure & Network Engineering & Operations Position Type Full-Time To view full details and how to apply, please login or create a Job Seeker account
Microsoft Project DevOps Jira Splunk ServiceNow monitoring Site Reliability Engineering (SRE) dynatrace