Grainger is a leading broad line distributor with operations primarily in North America, Japan, and the United Kingdom. We achieve our purpose, We Keep the World Working, by serving more than 4.5 million customers with a wide range of products that keep their operations running and their people safe. Grainger also delivers services and solutions, such as technical support and inventory management, to save customers time and money.
We're looking for self-motivated people who can move our company forward. We have a welcoming workplace where you can build a career for yourself while fulfilling our purpose to keep the world working. We embrace new ways of thinking and recognize everyone is an individual. Find your way with Grainger today.
Grainger is seeking a Software Engineer to join our Site Reliability Engineering team. You have 3-5 years of relevant industry experience (Site Reliability Engineer (SRE), Systems Engineer, Software Engineer, DevOps Engineer, Network Engineer, Systems Administrator, Linux Administrator, Platform Engineer or similar role). Additionally, you will have experience working with application performance monitoring (APM) tools to collect telemetry about production systems and optimize software delivery.
You will be based in downtown Chicago and will report to the Senior Manager, Product Engineering.
You Will:
- Improve observability across the entire eco-system.
- Administer observability tools and solve observability related issues.
- Ensure system stability and reduce MTTR.
- Define and implement modern incident management process to support DevOps.
- Drive technical initiatives to improve automation, self healing/remediation.
- Help mentor and develop junior team members.
- Grow and evangelize the capabilities of Datadog (and other observability tools) within Grainger.
You Have:
- Bachelors' degree in Engineering or Computer Science or Information Technology, or a related technical discipline, or equivalent work experience.
- 3-5+ years of experience with an APM tool (such as Datadog, Dynatrace, AppDynamics, or Honeycomb). Demonstrated experience within the observability domain (traces, metrics, and logs).
- Ability to define the strategies, patterns, solutions to improve the reliability of the system.
- Have a knack for automation and a passion for reducing toil.
- Expertise in designing, analyzing, and troubleshooting large-scale distributed systems and conducting blameless postmortems.
- 1-2 years of experience in at least one of the following languages: Java, Python, or Go.
- Experience with cloud native technologies such as containers, Kubernetes, Istio. Familiarity with EKS is a plus.
- Understanding of Unix/Linux operating systems.
- Ability to debug, optimize code, and automate routine tasks.
- Experience implementing CI/CD pipelines.
- Experience working with microservices.
Rewards and Benefits:
With benefits starting day one, Grainger is committed to your safety, health and wellbeing. Our programs provide choice and flexibility to meet our team members' individual needs. Check out some of the rewards available to you at Grainger:
- Medical, dental, vision, and life insurance plans.
- Generous paid time off (PTO) and 6 company holidays per year.
- Automatic 6% 401(k) company contribution each pay period.
- Employee discounts, parental leave, 3:1 match on donations and tuition reimbursement.
- A comprehensive set of emotional, financial, physical and social wellbeing programs.
DEI Statement:
We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal opportunity workplace.
#J-18808-Ljbffr