Home » Web Marketing
Web Design

The SEO History

  Features Overview
  Compare Plans
  Free Reports

Webmasters and content providers began optimizing sites for search engines in the mid-1990s, Initially, all a webmaster needed to do was submit a page, or URL, to the various engines which would send a spider to "crawl" that page, extract links to other pages from it, and return information found on the page to be indexed.

Site owners started to recognize the value of having their sites highly ranked and visible in search engine results. The earliest known use of the phrase "search engine optimization" was a spam message posted on Usenet on July 26, 1997.

Early versions of search algorithms relied on webmaster-provided information such as the keyword meta tag, or index files in engines. Web content providers also manipulated a number of attributes within the HTML source of a page in an attempt to rank well in search engines.

By relying so much on factors exclusively within a webmaster's control, early search engines suffered from abuse and ranking manipulation. To provide better results to their users, search engines had to adapt to ensure their “results pages” showed the most relevant search results, rather than unrelated pages stuffed with numerous keywords by unscrupulous webmasters. Search engines responded by developing more complex ranking algorithms, taking into account additional factors that were more difficult for webmasters to manipulate.

Graduate students at Stanford University, Larry Page and Sergey Brin developed "backrub", a search engine that relied on a mathematical algorithm to rate the prominence of web pages. The number calculated by the algorithm, PageRank, is a function of the quantity and strength of inbound links. PageRank estimates the likelihood that a given page will be reached by a web user who randomly surfs the web, and follows links from one page to another. In effect, this means that some links are stronger than others, as a higher PageRank page is more likely to be reached by the random surfer.
How Indexing works

Getting indexed

The leading search engines, Google, Yahoo! and Microsoft, use crawlers to find pages for their algorithmic search results. Pages that are linked from other search engine indexed pages do not need to be submitted because they are found automatically.

Some search engines, notably Yahoo!, operate a paid submission service that guarantee crawling for either a set fee or cost per click. Such programs usually guarantee inclusion in the database, but do not guarantee specific ranking within the search results. Yahoo's paid inclusion program has drawn criticism from advertisers and competitors.

Google offers Google Sitemaps, for which an XML type feed can be created and submitted for free to ensure that all pages are found, especially pages that aren't discoverable by automatically following links.

Search engine crawlers may look at a number of different factors when crawling a site. Not every page is indexed by the search engines. Distance of pages from the root directory of a site may also be a factor in whether or not pages get crawled

Precautions to avoid getting de-indexed

To avoid undesirable content in the search indexes, webmasters can instruct spiders not to crawl certain files or directories through the standard “robots.txt” file in the root directory of the domain. Additionally, a page can be explicitly excluded from a search engine's database by using a meta tag specific to robots. When a search engine visits a site, the robots.txt located in the root directory is the first file crawled. The robots.txt file is then parsed, and will instruct the robot as to which pages are not to be crawled. As a search engine crawler may keep a cached copy of this file, it may on occasion crawl pages a webmaster does not wish crawled. Pages typically prevented from being crawled include login specific pages such as shopping carts and user-specific content such as search results from internal searches.

In March 2007, Google warned webmasters that they should prevent indexing of internal search results because those pages are considered search spam

Sign up for FREE website & Competitive Analysis Reports

 
Zirez News
 
  • 9/24/2008
    Custom Website for MotorsportsInsuranceAdvisors.com Recently Launched by Zirez Design
  • 9/22/2008
    Custom Website for Striping Unlimited Recently Launched by Zirez Design
  • 9/10/2008
    The International Coach Federation - Charlotte Area Chapter (ICF-CAC) Introduces Their New Custom Website
 
  more