18 years of work experience. I've been technically leading and managing organizations at Google for a total of 10+ years.
I've also recruited, mentored and grown technical managers, functional managers, and senior leaders including software engineers, systems engineers and technical program managers.
My teams have been responsible for the reliability and scalability of product launches such as Cloud Identity, Compute Engine, Hangouts Chat and Hangouts Meet.
I've proposed, reorganized and grown shared infrastructure teams responsible for software development related to datacenter services/cluster turnups, incident management and chaos engineering.
I've been mostly recently in charge of assisting GCP customers with bootstrapping and growing their SRE practices and organizations.
SRE, DevOps and Dev
SRE is an implementation of DevOps.
Google "dev" as in Software Engineers focusing on feature [dev]elopment (as opposed to security, privacy or reliability) work "de facto" as DevOps. In the absence of SRE support, devs at Google get to operate their systems themselves. SREs are generally in high demand.
Keeping me Busy
I'm a member of the CRE team within Google SRE. CREs are responsible for bringing SRE best practices including processes and software to the world. While we've been prioritizing working with Google Cloud Platform customers, we are also very much interested in enabling SRE everywhere.
I'd highly recommend reading the second SRE book, The Site Reliability Workbook which contains concrete implementation examples. Google SRE's first book was a broader reference title without as many concrete examples of how to implement our recommendations.