Our brands bridge the gaps we see in the world. Old Navy democratizes style to ensure everyone has access to quality fashion at every price point. Athleta unleashes the potential of every woman, regardless of body size, age or ethnicity. Banana Republic believes in sustainable luxury for all. And Gap inspires the world to bring individuality to modern, responsibly made essentials.
This simple idea—that we all deserve to belong, and on our own terms—is core to who we are as a company and how we make decisions. Our team is made up of thousands of people across the globe who take risks, think big, and do good for our customers, communities, and the planet. Ready to learn fast, create with audacity and lead boldly? Join our team.
About the Role
The Site Reliability Engineering (SRE) team is part of the IT Service Management (ITSM) group at GapTech and serves engineering teams across all brands and markets. The mission of our Site Reliability team is to operate an always-on, self-healing, fault resilient, customer-centered, proactive set of systems that deliver optimal customer experience, across digital experience & commerce applications, fulfillment & distribution centers, databases, platforms, and technology stacks.
As Director of SRE, you will report to the Sr. Director of Service Management, Asset Management & Enterprise Metrics and be responsible for strategic design & development of site reliability solutions that drive resiliency and reliability across the enterprise. You will be managing and directing high performing teams and talented engineers in an Agile work environment responsible for the production support of customer-facing eCommerce applications, stores & fulfillment, infrastructure, and digital platforms across all brands and markets. In addition to the production support for Hybrid Cloud and On-Premises hosted systems and applications, you will be expected to work with management, peers, and customers to define and implement the technical vision of the team and to adopt SRE practices. You are expected to design a system of reactive problem solving as well as proactive preventative solutions removing TOIL and decreasing overall incidents by establishing roadmaps and blueprints.
What You’ll Do
Site Reliability Engineers are hybrid systems and software engineers who are responsible and take ownership for reliability, scalability, automation, and other issues related to uptime and availability of our platforms. Our goal is to build, scale and guard the systems that service the customers.
This work group demands to be available 24*7 for business-critical incidents and must be agreeable to work a flexible schedule to meet the needs of the business, including holiday, evening, overnight and weekend shifts.
Who You Are
Benefits at Gap Inc.
*For eligible employees
At A Glance A supervisory role with 70% in the field and 30% spent in the office, leading and coaching...Apply For This Job
PCC Network Solutions is a leading data network infrastructure contractor serving large clients nationally since 1985. We’re seeking a detail-oriented...Apply For This Job
It’s your turn to take the lead and deliver the future before anyone else. You?ll introduce our customers to AT&T’s...Apply For This Job
Client: Texas Children’s Hospital (TCH) Location: Houston, TX (onsite) Type: Long-Term Contract with potential to convert to FTE Project: TCH...Apply For This Job
Technical Account Manager /Technical Client Manager/Implementation Consultant /Implementation Project Manager/ Deployment Engineer/ Technical Consultant/ Solutions Engineer/Technical Customer Success Manager-remote, MarTech...Apply For This Job
Location: Akron, OH Please use Google Chrome or Mozilla Firefox when accessing Candidate Home. By joining the American Red Cross...Apply For This Job