<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>redsauce.com &#187; spidering</title>
	<atom:link href="http://www.redsauce.com/tag/spidering/feed" rel="self" type="application/rss+xml" />
	<link>http://www.redsauce.com</link>
	<description>Chips, eggs, websites - they all need redsauce</description>
	<lastBuildDate>Thu, 29 Jul 2010 14:15:48 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>XML Sitemap Basics &#8211; How To Get Your Site Crawled More Efficiently</title>
		<link>http://www.redsauce.com/xml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=xml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently</link>
		<comments>http://www.redsauce.com/xml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688#comments</comments>
		<pubDate>Wed, 04 Nov 2009 16:36:49 +0000</pubDate>
		<dc:creator>Amy</dc:creator>
				<category><![CDATA[Programming]]></category>
		<category><![CDATA[crawling]]></category>
		<category><![CDATA[sitemaps]]></category>
		<category><![CDATA[spidering]]></category>
		<category><![CDATA[XML]]></category>
		<category><![CDATA[XML sitemaps]]></category>

		<guid isPermaLink="false">http://www.redsauce.com/?p=688</guid>
		<description><![CDATA[As a developer at redsauce.com, I am always looking for the most efficient way of doing things, whether it be a PHP script or a database schema. Working on a truly huge site recently, it became obvious that google&#8217;s crawling was causing performance issues with the server the site was hosted on. If google doesn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" title="XML sitemaps" src="http://www.spycomponents.com/images/xml_at_work.gif" alt="" width="293" height="180" />As a developer at <a title="redsauce.com" href="http://www.redsauce.com/">redsauce.com</a>, I am always looking for the most efficient way of doing things, whether it be a PHP script or a database schema. Working on a truly huge site recently, it became obvious that google&#8217;s crawling was causing performance issues with the server the site was hosted on. If google doesn&#8217;t already know what pages your site contains (i.e. if its a new site), the google bot will have to go through every link on your site, possibly visiting certain pages several times.</p>
<p><span id="more-688"></span>Of course this is inefficient but we can solve this problem by helping google and other search engines to find our content with an XML sitemap. An XML sitemap can tell a search engine more information about a page such when it was last modified, how important the page is as well as how often it is updated. By providing this information to the search engines, they can crawl your site more efficiently, neither wasting their resources or yours.</p>
<p>Creating an XML sitemap is extremely simple. Here is an example:</p>

<div class="wp_syntax"><div class="code"><pre class="xml" style="font-family:monospace;"><span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;?xml</span> <span style="color: #000066;">version</span>=<span style="color: #ff0000;">'1.0'</span> <span style="color: #000066;">encoding</span>=<span style="color: #ff0000;">'UTF-8'</span><span style="color: #000000; font-weight: bold;">?&gt;</span></span>
<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;urlset</span> <span style="color: #000066;">xmlns</span>=<span style="color: #ff0000;">&quot;http://www.sitemaps.org/schemas/sitemap/0.9&quot;</span></span>
<span style="color: #009900;">        <span style="color: #000066;">xmlns:xsi</span>=<span style="color: #ff0000;">&quot;http://www.w3.org/2001/XMLSchema-instance&quot;</span></span>
<span style="color: #009900;">        <span style="color: #000066;">xsi:schemaLocation</span>=<span style="color: #ff0000;">&quot;http://www.sitemaps.org/schemas/sitemap/0.9</span>
<span style="color: #009900;">                            http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd&quot;</span><span style="color: #000000; font-weight: bold;">&gt;</span></span>
    <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;url<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
        <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;loc<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>http://www.redsauce.com/<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/loc<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
        <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;lastmod<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>2009-11-04<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/lastmod<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
        <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;changefreq<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>daily<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/changefreq<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
        <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;priority<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>1<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/priority<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
    <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/url<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/urlset<span style="color: #000000; font-weight: bold;">&gt;</span></span></span></pre></div></div>

<p>For each individual page, you can specify the URL between the &lt;loc&gt;&lt;/loc&gt; tags, when the page was last updated between the &lt;lastmod&gt;&lt;/lastmod&gt; tags, how often the page is updated between the &lt;changefreq&gt;&lt;/changefreq&gt; tags and how important the page is between the &lt;priority&gt;&lt;/priority&gt; tags.</p>
<p>The priority of a page can be between 0.1 and 1; 1 being your most important page, e.g. your home page and the deeper you go into your site the less the priority score should be. By providing all this information, search engines can crawl your pages more intelligently without wasting your server&#8217;s resources.</p>
<p>Once you have finished creating your XML file, save it to the root of your server and name it sitemap.xml for example. You can let the search engines know it exists by adding this line to a robots.txt file (placed at the root of your domain):</p>
<pre>Sitemap: http://www.<span>redsauce.com/sitemap.xml</span></pre>
<p>or by notifying google in webmaster tools about the location of your XML sitemap file.</p>
<p>An XML sitemap file can contain a maximum of 50,000 URLs and cannot exceed a maximum of 10MB file size. You can compress XML sitemap files to save your bandwidth but still mustn&#8217;t exceed 10MB once uncompressed.</p>
<p>You can write a script to create your sitemap from the pages on your site, or if your site doesn&#8217;t contain too many pages, you can use an online service such as <a title="xml-sitemaps.com" href="http://www.xml-sitemaps.com/">xml-sitemaps.com</a> to generate the sitemap file for you.</p>
<!-- Social Bookmarks BEGIN -->
<div class="social_bookmark">
<a title="Click me to see the sites." href="#" onclick="$$('div.d688').each( function(e) { e.visualEffect('slide_down',{duration:2.5}) }); return false;"><strong><em>Bookmark It</em></strong></a>
<br />
<div class="d688" style="overflow:hidden">
<br />
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://buzz.yahoo.com/submit?submitUrl=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;submitHeadline=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently&amp;submitSummary=" rel="nofollow" title="Add to&nbsp;Buzz"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/buzz.png" title="Add to&nbsp;Buzz" alt="Add to&nbsp;Buzz" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://del.icio.us/post?url=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;title=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;Del.icio.us"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/delicious.png" title="Add to&nbsp;Del.icio.us" alt="Add to&nbsp;Del.icio.us" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;title=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;digg"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/digg.png" title="Add to&nbsp;digg" alt="Add to&nbsp;digg" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.facebook.com/sharer.php?u=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688" rel="nofollow" title="Add to&nbsp;Facebook"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/facebook.png" title="Add to&nbsp;Facebook" alt="Add to&nbsp;Facebook" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;title=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;Google Bookmarks"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/google.png" title="Add to&nbsp;Google Bookmarks" alt="Add to&nbsp;Google Bookmarks" /></a>
<br />
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.mister-wong.com/index.php?action=addurl&amp;bm_url=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;bm_description=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;Mister Wong"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/misterwong.png" title="Add to&nbsp;Mister Wong" alt="Add to&nbsp;Mister Wong" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.netscape.com/submit/?U=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;T=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;Netscape"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/netscape.png" title="Add to&nbsp;Netscape" alt="Add to&nbsp;Netscape" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://reddit.com/submit?url=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;title=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;reddit"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/reddit.png" title="Add to&nbsp;reddit" alt="Add to&nbsp;reddit" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;title=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;Stumble Upon"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/stumbleupon.png" title="Add to&nbsp;Stumble Upon" alt="Add to&nbsp;Stumble Upon" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.technorati.com/faves?add=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688" rel="nofollow" title="Add to&nbsp;Technorati"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/technorati.png" title="Add to&nbsp;Technorati" alt="Add to&nbsp;Technorati" /></a>
<br />
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://tipd.com/submit.php?url=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688" rel="nofollow" title="Add to&nbsp;Tip'd"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/tipd.png" title="Add to&nbsp;Tip'd" alt="Add to&nbsp;Tip'd" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://twitter.com/home/?status=Check+out+XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently+@+http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688" rel="nofollow" title="Add to&nbsp;Twitter"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/twitter.png" title="Add to&nbsp;Twitter" alt="Add to&nbsp;Twitter" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://myweb2.search.yahoo.com/myresults/bookmarklet?u=http%3A%2F%2Fwww.redsauce.com%2Fxml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688&amp;t=XML+Sitemap+Basics+%26%238211%3B+How+To+Get+Your+Site+Crawled+More+Efficiently" rel="nofollow" title="Add to&nbsp;Yahoo My Web"><img class="social_img" src="http://www.redsauce.com/wp-content/plugins/social-bookmarks/images/yahoo.png" title="Add to&nbsp;Yahoo My Web" alt="Add to&nbsp;Yahoo My Web" /></a>
<br />
<a style="font-size:90%;text-align: right; " title="Click me to hide the sites." href="#" onclick="$$('div.d688').each( function(e) { e.visualEffect('slide_up',{duration:0.5}) }); return false;">Hide Sites</a>
</div>
</div>
<!-- Social Bookmarks END -->
<script type="text/javascript">$$('div.d688').each( function(e) { e.visualEffect('slide_up',{duration:0.5}) }); </script>]]></content:encoded>
			<wfw:commentRss>http://www.redsauce.com/xml-sitemap-basics-how-to-get-your-site-crawled-more-efficiently-688/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
