[Jobposts] Site Reliability Engineer for Google.com Engineering Group

Laura Grantham lgrantham at google.com
Wed Sep 29 23:07:32 UTC 2010


Who/What/Why/When?

Google.com (SRE or Site Reliability Engineering) is comprised of software
and systems engineering teams worldwide who are specialists in
troubleshooting, tools development, and production systems automation. SRE
Launch and Deployment also consults with software engineering teams during
the development cycle to help software engineers configure new services
across our infrastructure and comply with our architectural guidelines for
reliability, speed, and scalability. SRE is responsible for ongoing capacity
planning to handle Google's rapid traffic growth and global expansion. At
the launch point that a new service (product) is deployed and after services
are in production. When ongoing optimization, load balancing, and
enhancements are required, SRE is there to manage and maintain services
keeping Google.com and these services reliable and available for 100 of
millions of users around the World!



Who… We Are?

Google.com (SRE or Site Reliability Engineering) is comprised of hundreds of
engineers and technologists worldwide who are generalists in software
design, architecture, systems administration, and machines/data center
operations.



What… We Do?

Google.com/SRE is also accountable for all mission-critical Google products
(as well as the Google site) ensuring availability, performance and capacity
– these are engineers that are mobilized and deployed to troubleshoot and
‘fix’ both user-facing and internal services. The Google.com launch team is
responsible for ensuring new services meet our code and architecture
standards during launch. We deploy, maintain, and optimize a subset of
services that get huge traffic or generate revenue.



Why… We Are Needed?

Software development teams rely on us to guide their launch and deployment
of new services. Customers, like advertisers and website owners, rely on us
for reliability and performance. Users rely on us for availability and quick
response. Google relies on us for scalability and maintaining our brand
image for uptime and speed, and our engineers ensure software is designed
correctly before launching; they perform troubleshooting, reliability
planning, and performance optimization to maintain these world-class
standards.



When… Are We Needed?

Needed during the development cycle to help developers understand and comply
with our architectural guidelines for reliability, speed, and scalability.
For ongoing capacity planning to handle Google's rapid traffic growth and
global expansion. At the point that a new service (product) is deployed and
after services are in production. When ongoing optimization, load balancing,
and enhancements are required. When Google services or the .com site require
an immediate response (24/7/365) to mobilize engineers to troubleshoot,
identify threats, and bring services online.



Where… We Are?

Primarily concentrated in Mt. View, Dublin, Zurich, with distributed teams
in San Francisco, Santa Monica, Boston, London, Pittsburgh, Kirkland,
Seattle, New York, London, and Sydney.

Qualified and interested parties can email or call me directly at the
contact information listed below!

-- 


Laura Grantham
Google Staffing- Engineering
650-214-1950
lgrantham at google.com

http://www.linkedin.com/pub/laura-grantham/2/255/815

http://www.google.com/support/jobs/bin/static.py?page=gettingintogoogle.html
http://www.youtube.com/watch?v=w887NIa_V9w
www.youtube.com/lifeatgoogle


More information about the Jobposts mailing list