by Cyrus Taraporvala
Posted on January 20, 2004
The Google search engine can undoubtedly be classified as one of the best, in terms of the relevancy of the search made, for which Google determines the results shown by a system for ranking web pages with their software, known as PageRank.
As Google updates their index fairly frequently, approximately once a month, the ranking of web page results change as often, due to new sites being included, and some sites being dropped from the index, due to the possibility of your site not being accessible when the robots tried to crawl it, due to network or hosting problems.
Google may also permanently remove your site from their index if they feel that you are trying to get an unfair advantage in attempting to beat their system due to cloaking, including pages and links to your site with the sole intention of hoodwinking the robots, or including text in your pages that can only be seen by the robots, and not the user.
Google follows links from one page to the next, building and rebuilding their index. This is the biggest factor in determining which pages are indexed by the Googlebots, as it traverses the web from page to page, via hyperlinks. It is thereby extremely important that your web pages are linked from other sites, to determine the importance of your page ranking within the Google index.
The Google PageRank software uses this link structure to your site to determine the value of each individual page, and combines PageRank with sophisticated text-matching techniques to find pages that are both important and relevant to your search.
The only problem that I find with the Googlebot is that they limit the amount of crawling of dynamically generated pages. Their explanation for this is that their web crawlers can easily overwhelm and crash sites serving dynamic content, and that is why they limit the amount of dynamic pages they index. The file types that Google can index are pdf, asp, jsp, hdml, shtml, xml, cfml, doc, xls, ppt, rtf, wks, lwp, wri.
All said and done I rate Google as a top class search engine. When you perform a search, Google only produces results that match all of your search terms, either in the text of the page or in the text of the links pointing to the page, and shows a "snippet" of the text that matches your search query. You may have noticed the "I'm Feeling Lucky" button, which takes you directly to the site of the highest ranked result of your search.
Well, I certainly am a little GaGa over Google.
As Google updates their index fairly frequently, approximately once a month, the ranking of web page results change as often, due to new sites being included, and some sites being dropped from the index, due to the possibility of your site not being accessible when the robots tried to crawl it, due to network or hosting problems.
Google may also permanently remove your site from their index if they feel that you are trying to get an unfair advantage in attempting to beat their system due to cloaking, including pages and links to your site with the sole intention of hoodwinking the robots, or including text in your pages that can only be seen by the robots, and not the user.
Google follows links from one page to the next, building and rebuilding their index. This is the biggest factor in determining which pages are indexed by the Googlebots, as it traverses the web from page to page, via hyperlinks. It is thereby extremely important that your web pages are linked from other sites, to determine the importance of your page ranking within the Google index.
The Google PageRank software uses this link structure to your site to determine the value of each individual page, and combines PageRank with sophisticated text-matching techniques to find pages that are both important and relevant to your search.
The only problem that I find with the Googlebot is that they limit the amount of crawling of dynamically generated pages. Their explanation for this is that their web crawlers can easily overwhelm and crash sites serving dynamic content, and that is why they limit the amount of dynamic pages they index. The file types that Google can index are pdf, asp, jsp, hdml, shtml, xml, cfml, doc, xls, ppt, rtf, wks, lwp, wri.
All said and done I rate Google as a top class search engine. When you perform a search, Google only produces results that match all of your search terms, either in the text of the page or in the text of the links pointing to the page, and shows a "snippet" of the text that matches your search query. You may have noticed the "I'm Feeling Lucky" button, which takes you directly to the site of the highest ranked result of your search.
Well, I certainly am a little GaGa over Google.
COMMENT ON THIS ARTICLE...
No comments yet. Be the first one to comment.
It's all about Google
What Google Said When You Weren't Listening
How To Stay Out Of Google's Supplemental Index
What Google Said When You Weren't Listening
How To Stay Out Of Google's Supplemental Index
SEO Articles
Internet Marketing Articles
Development Articles
General Articles
And also in our Archives
Internet Marketing Articles
Development Articles
General Articles
And also in our Archives
Drive traffic to your business and get recognized as an industry leader by sharing your knowledge on Site-Reference. Authors are given a wide range of exclusive benefits here at SR; so checkout what we can offer to those that…

We’re always on the lookout for new writing talent so even if haven’t written for the web yet, feel free to contact us anytime
We’re always on the lookout for new writing talent so even if haven’t written for the web yet, feel free to contact us anytime





