Position Summary
Consultant – Site Reliability Engineer
Are you someone who likes to tackle complex technical challenges? As a Consultant at Deloitte, you can help build and deliver technology solutions to meet clients’ business needs. A Consultant gains exposure to multiple technologies while delivering engagement specifics under the full breadth of services offered by Deloitte Consulting LLP.
Cloud Engineering
Our Cloud Engineering team focuses on enabling our client’s end to end journey from On-Premise to Cloud, with opportunities in the areas of: Cloud Strategy, Op Model Transformation, Cloud Development, Cloud Integration & APIs, Cloud migration, Cloud Infrastructure & Engineering, and Cloud Managed Services. We help our clients see the transformational capabilities of Cloud as an opportunity for business enablement and competitive advantage.
Cloud Engineering supports our clients as they improve agility and resilience and identifies opportunities to reduce IT operations spend through automation by enabling Cloud. We accelerate our clients towards a technology-driven future, leveraging vendor solutions and Deloitte-developed software products, tools, and accelerators.
Job Summary: We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a strong background in Linux system administration, networking, production debugging and troubleshooting, shell scripting, and experience with monitoring tools such as Grafana and Prometheus. This role will be pivotal in ensuring the reliability, availability, and performance of mission critical production systems.
Work you will do:
- Monitoring & Performance Management using SRE principles:
- Set up and manage monitoring tools such as Grafana and Prometheus.
- Create and maintain dashboards to monitor system health and performance.
- Implement alerting mechanisms to proactively identify and address issues.
- Select metrics for SLIs, set SLOs, and track error budgets to mitigate risk for your service.
- Reduce MTTD & MTTR using SRE principles.
- Scripting & Automation:
- Develop and maintain scripts using Bash, Go, and Python to automate routine tasks.
- Implement infrastructure as code (IaC) practices to manage and provision resources.
- System Administration & Networking:
- Manage and maintain Linux-based systems and network infrastructure.
- Ensure system security, performance, and reliability.
- Implement and manage network configurations, including firewalls, VPNs, and load balancers.
- Production Debugging & Troubleshooting:
- Diagnose and resolve production issues promptly to minimize downtime.
- Perform root cause analysis of incidents and implement preventive measures.
- Collaborate with development teams to improve system reliability and performance.
- Collaboration & Communication:
- Work closely with cross-functional teams to ensure seamless deployment and operation of services.
- Document processes, procedures, and system configurations.
- Provide technical guidance and mentorship to junior team members.
Required Qualifications:
- Bachelor’s degree in computer science, Information Technology, or a related field.
- 3+ Years of strong hands-on experience working as an SRE.
- Advanced knowledge of SRE.
- Linux System Administration experience.
- Proficiency in shell scripting (Bash) and programming languages such as Go and Python.
- Direct experience with monitoring tools like Grafana and Prometheus.
- Strong knowledge of networking principles and protocols.
- Ability to work on site in San Francisco or San Jose a minimum of 3 days a week.
- Ability to travel up to 50% of the time, depending on project requirements.
- Must be legally authorized to work in the United States without the need for employer sponsorship, now or at any time in the future.
Preferred Qualifications:
- On – call escalation support experience.
- Experience with cloud platforms (AWS, Azure, GCP).
- Familiarity with containerization and orchestration tools (Docker, Kubernetes).
- Knowledge of CI/CD pipelines and tools (Jenkins, GitLab CI).
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills.
From developing a stand out resume to putting your best foot forward in the interview, we want you to feel prepared and confident as you explore opportunities at Deloitte. Check out recruiting tips from Deloitte recruiters.
At Deloitte, we know that great people make a great organization. We value our people and offer employees a broad range of benefits. Learn more about what working at Deloitte can mean for you.
Our diverse, equitable, and inclusive culture empowers our people to be who they are, contribute their unique perspectives, and make a difference individually and collectively. It enables us to leverage different ideas and perspectives, and bring more creativity and innovation to help solve our client most complex challenges. This makes Deloitte one of the most rewarding places to work. Learn more about our inclusive culture.
From entry-level employees to senior leaders, we believe there’s always room to learn. We offer opportunities to build new skills, take on leadership opportunities and connect and grow through mentorship. From on-the-job learning experiences to formal development programs, our professionals have a variety of opportunities to continue to grow throughout their career.