The Working of the Google Search Engine

Asked By 20 points N/A Posted on -
qa-featured

The Google Search Engine, is the most popular one out there. How does it work? How does it manage to carry out searches in really short time?

SHARE
Answered By 60 points N/A #186922

The Working of the Google Search Engine

qa-featured

The first step is crawling. It implies that Google follows the links from one page to another. They sort every page with respect to its content and then this all information is kept track of in the Index which is over a million GBs big. The part two of this process is algorithms. It has highly efficient keyword search algorithms. They put together words to understand what the user is looking for. Based on this information, they pull out relevant documents from the Index. The results obtained are then ranked based on some factors. All of this process takes 1/8th of a second. Any spam is dealt with appropriately.

Answered By 590495 points N/A #186924

The Working of the Google Search Engine

qa-featured

The Google search engine works by first crawling websites. Crawling or spidering is the process by which Googlebot finds new and updated web pages to be added to the Google index. Google utilizes an enormous set of computers to acquire or crawl billions of pages on the web. Google calls the program that performs the fetching Googlebot which is also known in other terms as spider, bot, or robot.

When Googlebot does the fetching, it uses an algorithmic process. In this process, the programs decide which websites to crawl, how frequent, and the amount of pages to fetch from every site. The crawling process starts with a list of web page URLs which is created from earlier crawl processes and augmented with sitemap data supplied by webmasters. As Googlebot crawls websites, the indexing process comes in.

Googlebot processes each of the pages it crawls to be able to compile or assemble an immense index of all words it sees and their location on every page. Additionally, Google process info incorporated in key content tags and attributes like Title tags and ALT attributes. Googlebot is able to process a lot of content types but not all.

Like for example, it cannot process the content of several dynamic pages or rich media files. The last process is serving the results. When a user enters a query using the Google search, Google’s computers search the index for matching pages and return the results which Google thinks are the most relevant to the user.

Take note, Google doesn’t accept any payment to crawl a site more frequently. They maintain the search side of their business separate from their revenue-earning side which is the AdWords service.

Related Questions