![]() |
|
| >> Web Hosting Geeks // Web Hosting Articles // SEO - Search Engine Optimization |
|
|
Release from Google Sandbox Only to Search the Playground
The Google Sandbox Effect has been discussed at length in our case study of a new website first crawled in May by Googlebot. We can now further the case study with indexing comparisons and discuss interesting Googlebot crawler behavior after release, at the 75 day mark, of the study website from that very confining Sandbox. This case study is not for the faint of heart - those just launching a new web business on a new domain name with hopes of instant indexing and immediate traffic may find their website very lonely for two and a half months - if it is in a competitive market segment. You may as well plan to stay in the Google Sandbox for at least 45 days on average. If some early release stories are to be believed, search phrases nobody wants to play with are taken pity on by Google and sent home for early release. Those non-competitive or obscure search phrases seem to be seen as good, quiet little children, playing by themselves in Sandbox playground and are sent home early on good behavior. Googlebot probably sees good behavior as playing well with others, like a good little baby domain and NOT being competitive as some young domains can be. Throwing sand in other childrens' faces and insisting on having your site indexed, throwing sand out of the Sandbox with your bright plastic toy shovel and bucket will not be allowed. Now that the site discussed in this study is out of the Sandbox, it still lingers on the playground, unable to escape the community park and leave for the business world to play with the big boys in the outside world. It does indeed take time to grow up and be the model citizen in this new search playground. Though on the first full day after this first week of being released from the sandbox, the site has gotten 68 visitors referred by searches done at Google, the first referred search traffic coming into the site. MSN has sent 8 visitors, Yahoo has sent 6, 4 came from AOL searches, 2 from Netscape and 1 from Dogpile. The indexing behavior of Yahoo and MSN has been nothing short of bizarre with numbers of indexed pages increasing rapidly over the first two months to reflect 6,941 pages indexed until 8 weeks into this study and we outlined previously how numbers changed as you click through results pages first upward, then downward to about half the total of highest numbers listed along the top of the results pages. It appears that Yahoo and MSN are playing on the 'slippery slide' in this playground, climbing to the top of the ladder of results at about 10 week mark showing 8,210 and 6,941 pages respectively indexed, then sliding down again to 3,510 for Yahoo and 373 for MSN, as of this writing two weeks later on August 6. Still, Yahoo will show you only 1,000 (100 pages) of those results and MSN will show you only 250 results, or 25 pages, no matter how many they claim to index. MSNbot is crawling the site faster and more consistently than any of the engines, yet shows by far fewer pages indexed than the others. One of the interesting comparisons between Google and MSN in our Sandbox study is that Google will show you most of what they claim to have indexed after you click that link at the bottom of the first page showing only 3 or 4 results when you use the "site:Publish101.com" query operator then go to the bottom of the page and click the link under the line reading, "In order to show you the most relevant results, we have omitted some entries very similar to the 3 already displayed. If you like, you can repeat the search with the omitted results included." Go ahead and click that link, then you'll be presented with the claimed total of indexed pages. That number has very steadily increased since Sandbox release after 75 days from first crawling of this Sandbox study site. The timing and numbers of indexed pages at Google goes upward, and ONLY upward with VERY distinct patterns noted from raw log files. Crawling schedules seem to have been established for this site by Google and indexing changes occur on a very regular schedule. The first observation of Sandbox release was at noon on Thursday July 28, seventy-five days from first crawling by Googlebot when a search turned up 379 pages indexed with a "site:Publish101.com" query. That number increased later the same evening to 3,660 pages at a search done around the dinner hour Pacific time. Oddly, the next day, Friday July 29, the number took a slight hop upward to 3,700 pages and on the following Monday, showed 3,770 pages indexed. That schedule and pattern have repeated on the second week of Sandbox release when a "site:Publish101.com" query produced 5,660 results from from Google for the site on Thursday August 4 at just after noon and then nearly doubled at around the dinner hour to 10,700 pages on that same query. A final check just now on Saturday shows it at 12,100 pages indexed by Google. It should be pointed out to those who wonder about the total number of pages that this is a dynamic site with a very large archive of articles that increases daily as new submissions are contributed by member authors at the site. Those articles are added through a content management system on a daily basis by an editor who reviews submissions and processes them for approvals or rejections. Those approved are made live from the home page nightly. We've started doing this on the crawler's schedules as we've noted very regular visits by Yahoo's Slurp crawler to the site home page just once daily at around 5pm each evening and Googlebot visiting the home page only once, at near 11pm nightly, so we've instituted a midnight activation of each day's new article submissions on the home page of the site so that none of the new pages are missed by those crawlers. MSNbot seems to hit the home page multiple times through the day, so timing is less important for MSN. Crawler activity has been heated, with Yahoo crawling the least and the slowest, barely seeming to attempt any updates and the total of indexed pages has not changed for over three weeks since it peaked at 8,210 pages indexed and then dropped to it's current level of 3,510. As previously stated, Slurp seems to be unhindered by any form of consistency in indexing or crawling behavior. MSNbot has crawled extensively and fairly regularly for weeks, but that odd indexing behavior is a serious flaw in their utility as a search tool. It should be mentioned here that AskJeeves had been noted to crawl the site extensively early in this case study and displayed a very regular and consistent crawl, but stopped abruptly three weeks ago on july 13, after hitting most of the pages then available on the site. Teoma, their spider, has been absent ever since and they have not indexed this domain at all since first crawling on May 23, over 10 weeks ago. Clearly, Teoma appears to have the longest Sandbox of all the search engines. Much has been learned in this Sandbox case study about crawler behavior, indexing delays, robots.txt requirements and index updates at each of the top three search engines. Where that knowledge leads will, of course, change as algorithms and crawling schedules are adjusted by MSN, Yahoo and Google. But valuable information has been shared that may help other webmasters to better understand each of the factors that determine the success of any website. "Further findings in follow-up articles at the 3, 6 and 9 month marks, explore search referrals gained as Google adds more pages and rankings fluctuations begin to level. Meanwhile, we'd like to encourage others to publicly review their crawler traffic through logs to compare behavior on new domains to verify findings and disclose indexing behavior and timing for new domains and further document SE indexing as well as crawling behavior. Copyright © August 6, 2005 Previous Sandbox Case Study Articles: http://Publish101.com/Sandbox2 http://Publish101.com/Sandbox3 http://Publish101.com/Sandbox4 Mike Banks Valentine is a search engine optimization specialist
MORE RESOURCES:
Google News |
RELATED ARTICLES
Expert Help From Google Answers Web users turn to search engines for answers to their questions. This is usually done through various levels of searching the engine's database. The Business Case for SEO It's interesting how potential clients have preconceived notions about which aspects of search engine marketing have the most value. In fact, they tend to fall into two camps that are 180° apart. Google Loosing Fan Base? "Nothing last forever but the Earth and sky." - from Dust in the Wind by Kansas. PPC Advertising - The First Step In A SEO Marketing Campaign Often, sites view seo and PPC marketing as exclusive marketing techniques. Each marketing method has its advocates. Designing a Better System for Search Engines Designing a Better System for Search Engines and Information Distribution, information and knowledge are power. Having run out of things to study today, I thought about Researching and Research. What are My Chances to Get the First Place in Search Engine Listings? You must have heard the stories how people became rich and famous with their websites. How could they achieve this? Their websites took a first position in search engine listings targeting popular keywords. Work With The Search Engines - Dont try to Outsmart the Search Engines Contrary to the claims of high-priced SEO firms, optimizing your web site for search engines is not brain surgery. But you must first accept the fact that "spiders" - the search engine programs that read web pages - run away from non-HTML code. Promoting Home Business: Tips to Increase Web Site Sales You've selected an appropriate Online Business Opportunity. That is not ALL!To run a successful online ecommerce Home business, getting the targetted users to visit your site and converting them into customers is the first and foremost thing. Top 2 Ways To Get Higher Rankings in Major Search Engines Top 10 search engine rankings. Everybody wants it but a few achieve it. Complete Web-Site Optimization For Search Engines (Part 1) SEO or search engine optimization strategy now becomes widely popular among online business operators. Nothing strange about it as it allows to substantially increase your gross income, as a result of growing traffic or visitors flow. Page Rank Purgatory - Simple Things You can Do to Keep Your Web Site Out of Search Engine Hell! Are Meta Tags Really Dead?Right in their Guidelines Yahoo Tells You that Meta-Tags are Not Totally Dead and Buried(The Below Information was Taken Directly From the Yahoo Help File http://help.yahoo. Google, Adsense, SEO, and How It All Works Google uses an algorithm to determine the search engine results (SERPS). The algorithm is based upon certain factors that include keyword density, Meta Tags, anchor tags, image tags, back links, etc etc etc. Offpage Optimization: Does Article Marketing Cut the Mustard? For those who haven't heard: article marketing is the new offpage optimization strategy that works like magic and won't cost you a dime. What's the strategy? You create an arsenal of short, well-written keyword articles aimed at your target customer and include your URL at the bottom. Ten Steps To A Well Optimized Website - Step 3: Site Structure Welcome to part three in this search engine positioning series. Last week we discussed the importance and considerations that much be made while creating the content that will provide the highest ROI for your optimization efforts. Use Keyword Articles for Search Engine Optimization Search engine optimization is very important for your online business if you are interested in staying in business. The reason for this is most of the time the large search engines direct traffic to your website, however your website must include what the search engine is looking for in order for your web page to be returned as a result. Maximize Your Search Engine Traffic - 13 Ways to Pull in More Visitors From the Search Engines Maximizing traffic from the search engines to your web site is not a difficult task but it does require you to think ahead and plan your Search engine optimization strategy carefully. If you have not yet built your web site and are still in the initial planning stages then you may have an easier time of it. Googles Good-Writing Filter I was recently struck by the fact that the top-ranking web pages on Google are consistently much better written than the vast majority of what one reads on the web. Yet traditional SEO wisdom has little to say about good writing. Its Not Just All About Google Anymore Those webmasters that stick to the old ways and focus entirely on Google are missing out on a lot of search traffic these days if they are not also well ranked by Yahoo and MSN.For the first few months after Yahoo decided to go their own way with natural search (and MSN decided to get serious about the search business), the search results provided by those two could only be described as bizarre. Developing A List Of Keywords For Marketing Keywords aren't just some words that allow search engines, like Google, to find your web site. They are also key elements for creating attractive language to use in your marketing or advertising material. Search Engine Spiders Lost Without Guidance - Post This Sign! The robots.txt file is an exclusion standard required by all web crawlers/robots to tell them what files and directories that you want them to stay OUT of on your site. |
|
|
|
|
| © 2004 - 2008 "Web Hosting Geeks" | Web Hosting Reviews | Customer Reviews | RealMetrics Reviews | Hosting Articles | Directory | Partners | Contacts Over 7000 articles: web hosting, web development, domain names, ecommerce, web design, site promotion, ppc advertising, seo, site promotion and many others. Web hosting reviews, ratings and awards are not based on any incentives or commissions. Names and trademarks are the properties of their respective owners. A direct link to Web Hosting Geeks (http://webhostinggeeks.com) must be provided in order to use any of the above information. Contact us for more info. Partners: Hosts by speed, Cheap Website Hosting, Free Website Hosting, Cheap Web Hosting, Top 10 Web Hosts, Top 10 Web Hosting Deals, Best Website Hosting, Free Web Hosting, Free Web Hosting, Dedicated Server Hosting, Adult Web Hosting, Web Hosting Discussions, Dedicated Server Reviews, Best Web Hosting, Web Hosting Discounts, HostProfessor.com, rsuog, halyava, PHP Website Hosting Services, Web Hosting Reviews, Hosting Uptime, Best Web Hosting Reviews, Cheap Webhosting, Web Hosting, Flash Templates, CMS Templates, Web Hosting Reviews, Website Hosting Reviews, Web Hosting Providers, Best Web Hosting, Top Web Hosting, RSUOG Web Hosting |