Search Engine Optimization

Site Map


Site Map

 

Site Map - Site Map or Sitemap refers to 1. A web page listing all the pages in a Web site or 2. An XML or text format file that contains all links or a group of links that can be submitted to search engines to ensure that pages in your are registered.

Web Page Site Maps

HTML Site Maps were originally used simply to provide visitors with a convenient navigation tool, and to make sure that all pages were accessible without cluttering up the main page. Search engines soon began to look for such pages and to use the information in them to quickly gather information about the Web site.

Some Web page site maps use fancy Javascript menus, but since these are usually not accessible to search engines, it may defeat the purpose of the site map from the standpoint of optimization, though it may be useful or attractive to visitors.

A site map should use Absolute Path (full path) links to each page in the Web site, and the code should be as uncluttered as possible. The page should be named Site_Map.htm or site_map.html and there must be a full path link from the main page of the Web site to the site map page so that the search engines can find it. The site map for this site is here:

The site map page for this Web site is here: site map. If a site map is larger than 100 or 200 links, it is recommended that you start a new sitemap, linked to the first. However, the actual capacities or limitations of major search engine spiders are probably much larger,

Site Map Files for Search Engines

Recently a standard (the ROR standard) was introduced for XML site map files that can be submitted to different search engines. These allow you to append various information such as the frequency with which the pages might change, You can now tell search engine spiders where the XML sitemap is with this directive in your Robots.txt file:

Sitemap: http://www.mysite.com/sitemap.xml

You can and should submit site maps to Google Webmaster. Site map submissions should be consistent. Always use one form of the URL for pages. That is, list every page as

http://www.mysite.com/sitemap.xml or http://mysite.com/sitemap.xml but never both.

www.sitemaps.org/  explains the standard.

There are various free or partly free tools that supposedly help you create an XML standard site map automatically, such as this one: http://www.xml-sitemaps.com/. Many of the ones I tried are of limited capacity or do not work at all.

A Google explanation of XML site maps is here:

http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=40318

 

Google explains that Site Maps are especially important for special materials such as flash files that might not otherwise be intexed properly. Fair enough.

 

Google has a tool for generating sitemaps, but it is a python script you need to have installed on your Web server. It's OK if you have access to the server and are up for that. Read about the tool here:

http://www.google.com/support/webmasters/bin/answer.py?answer=34634

 

A word of caution: Be careful what you tell Google and other search engines about your pages in an XML site map and make sure it is correct. It is better to give no information than bad information sometimes. For example, marking pages "Dynamic" (sounds good) is a bad idea, as it may degrade the listings of those pages. The search engine algorithm "decides" that if the page content is going to change all the time, it is not worthwhile giving that page a high ranking for a particular key word. That's logical, but it can mess you up.

 

Text Site Map Files

Google and some other search engines can read text site map files. You can prepare these by hand in various ingenious ways and without too much trouble. They are just a list of URLS at the site. This page tells you how to create a text file site map:

http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=71450

Google wants you to submit the text files to their site map generate, which can then turn them into XML sitemaps, but you do not have to do that unless you need to give the search engine the special information needed in the map.

 

Usefulness of XML and Text Site Maps

If you have a site that is not being indexed properly by Google and/or other search engines then you should certainly try submitting a site map. "Indexed properly" means that a new page is indexed after a day or two for sites with Google Pagerank 6 on the main page, or it can mean that a page is indexed after a month for a small new site that has pagerank 3 or less. If a page doesn't show up for 6 months there might be something wrong with the page or the links to that page, or there just might be some glitch in the search engine that prevents it from being listed.  Search engines do not guarantee to list pages submitted by site map, but in practice, if the pages are not junk, they do seem to get listed within a few hours of submitting the site map. For small, new Web sites, this can be really important.

However, before resorting to site maps, you should try propagating the new links to Web logs and forums and directories. Web logs have RSS feeds that Google and other search engines can check to pick up new links quickly.

Obviously, if you have thousands or tens of thousands of pages at your Web site, it is impossible to put them all in a single XML site map, and you will need to submit several maps. It is usually unnecessary to submit such maps, since Google and other search engines will get around to indexing all your pages eventually especially if you have a large site. Sites with a high pagerank tend to be crawled very frequently

 

Ami Isseroff

December 1, 2008

 

Note - Definitions of Search Engine Optimization terms are based on inferences from common usage and definitions given by other sources. Conclusions about search engine behavior are based on understanding of the behavior of the most popular search engines. Both are subject to error or may change. Search engine company management may define or use a term or set or change any policy in any way they see fit, and may make these definitions and specifications public or not. These decisions and definitions are beyond our control.  

Notice: Copyright

All materials are copyright 2008 by Ami Isseroff. All rights reserved. These pages may not be reproduced in any form in electronic or printed media without express written permission from the author.

SEO Glossary

Note - Definitions of Search Engine Optimization terms are based on inferences from common usage and definitions given by other sources. Conclusions about search engine behavior are based on understanding of the behavior of the most popular search engines. Both are subject to error or may change. Search engine company management may define or use a term or set or change any policy in any way they see fit, and may make these definitions and specifications public or not. These decisions and definitions are beyond our control.  

Notice: Copyright

All materials are copyright 2008, 2009 by Ami Isseroff. All rights reserved. These pages may not be reproduced in any form in electronic or printed media without express written permission from the author.

SEO Glossary

SEO

SEO Basics

The SEO Book

SEO Articles

SEO Blog

More Links

MidEastWeb: Middle East

Zionism

SEO - Web Site Search Engine Optimization Contact: Webmaster(at)Yu-hu.com
site map
Site Map