DigitalGov Search: Cache Me If You Can
Have you ever been frustrated when visiting a Web page that doesn’t load quickly? Have you ever left a slow Web page before it finished loading? You’re not alone.
Several recent studies have quantified customers’ frustration with slow Web pages. Customers now expect results in the blink of an eye. This expectation means that your customers are won or lost in one second. A one second delay in loading a Web page equals 11% fewer page views, 16% decrease in customer satisfaction, and 7% loss in conversions.
Slowness Kills Search Results Pages
As little time as websites have to keep users on their pages, search engines have even less time to keep searchers on their results pages. Speed is the primary factor in determining customers’ satisfaction with search results.
Google, Microsoft, and Yahoo garner 95% of the search market. Google garners two-thirds of the search market. The company’s Gospel of Speed motto is one reason why Google garners the majority of the market.
This gospel has also set a high bar for all search engines. Searchers expect results pages to load very, very quickly.
How We’ve Made Our Result Pages Load Faster
So, when we established the service’s open source architecture in 2010, the first thing we tackled was how to deliver our search results in under one second.
At around the same time, Github was experiencing exponential growth and the company’s engineers were blogging about what they did to make Github fast. To get up to speed quickly (yes, bad pun intended), we read their posts.
Leveraging some of Github’s best practices, we succeeded in delivering our results in under 700 milliseconds, on average. This was a significant accomplishment and improvement from the previous vendor-owned and -operated iterations of our service.
Over the past three years, we’ve dug in and improved our response time even more. We now deliver our results in under 380 milliseconds, on average.
We already had an architecture optimized for speed. So, how have we sped it up by 320 milliseconds?
We Cache When We Can
When a searcher enters a query, we go out to our various indexes, pull the information relevant to the searcher’s request, and put that information together on the results page.
Most queries (such as jobs, obama, unclaimed money, forms) aren’t unique and are asked by thousands of searchers each day.
We cache these so-called short head queries and store them on our servers. Caching helps us speed up the above process because searchers don’t have to wait for us to pull the information from its original source.
We Use an Asset Pipeline
We also use fingerprinting—a technique that makes a file’s name dependent on its content—within our asset pipeline. When the content changes, the name changes. For content that is static or that changes infrequently, this naming helps us tell whether two versions of a file are identical. When a filename is unique, browsers keep their own copy of the content. When the content is updated, the fingerprint changes so browsers request a new copy of the content. This approach allows us to maximize our content delivery network.
We Use a Content Delivery Network
Our static content (such as scripts and stylesheets) gets served through our content delivery network provider, currently Akamai. Akamai serves our static content from its server that is geographically closest to the searcher. The closer, the faster.
Using a content delivery network also allow us to optimize our service’s speed by:
- Directing non-cached traffic between our two data centers to create a multihomed environment. Multihoming allows us to make full use of all of our servers. By contrast, in 2010, our disaster recovery data center often sat idle.
- Reducing our need to add bandwidth or servers to handle short-term traffic spurts, such as spurts related to natural disasters.
- Protecting against denial of service attacks by spotting them before they reach our servers.
We’ve worked hard over the past three years to speed up the delivery of our results by optimizing each link in the chain.
We use several monitoring tools to measure our system’s performance. The quality of these tools is improving at a rapid pace, which in turn, shows us where and how we can improve our service.
We regularly ask ourselves, “Will this shave some time off and help us deliver our results in under 380 milliseconds?”