
Google Percolator Google has rolled out its new search platform Caffeine in June, but the company has been a little tight lipped about the implementation details. We were told that Caffeine can incrementally update the search index and produce "50% fresher" results, but Google did not go into further details. All of that has changed with two Google search engineers, Daniel Peng and Frank Dabek publishing a paper describing the technology that drives Caffeine, called Percolator. Up until now Google used a system called MapReduce to build their search index. Many crawlers would gather content from all over the web and feed it into the MapReduce system which then generates the search index in one large batch operation. Given the size of the index (more than 100 million gigabytes) this process can take many days to complete. The delay between a page being crawled and it showing up in the index used to be about 2-3 days on average, with sites being recrawled every one or two weeks. To be able to index breaking news stories Google split the index into layers, with the fastest layer being updated every 10 seconds. But the majority of the web is in the 'slow' layer which means we are stuck with a multi-day latency.
CommentRelated tweets
|
Last Comments
EventsUpcoming event
Last event
Bloggers
![]() Columns
Tagcloudppc google maps yahoo analytics maps london website street view europe seo business social event internet mobile android apple tools youtube bing realtime russia privacy google wave adwords gmail searchcowboys marketing google earth spain app search a4uexpo interview ads video facebook blog streetview newsMost Commented
Agenda |
Search
My BlogLogBlogroll |
© 2012 Searchcowboys.com - All Rights Reserved - All views and opinions expressed are those of the authors of Searchcowboys.
All trademarks, slogans, text or logo representation used or referred to in this website are the property of their respective owners. Sitemap
Comments (3)
Obviously Percolator is better for those who use google search and hence make more sense for white-hat SEO
Wed 13 Oct 2010, 01:05
See also my blog post on it http://bigdatacraft.com/archives/240
Wed 13 Oct 2010, 01:06
Yup you are right. Keyword rankings are coming faster than earlier and losing the rankings also in the same speed :) To keep the rankings we should have good content and quality back-links.
Mon 1 Nov 2010, 14:42