<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Managing lots of servers</title>
	<atom:link href="http://www.snell-pym.org.uk/archives/2008/08/27/managing-lots-of-servers/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.snell-pym.org.uk/archives/2008/08/27/managing-lots-of-servers/</link>
	<description>Sarah and Alaric Snell-Pym living in interesting times</description>
	<pubDate>Thu, 09 Feb 2012 02:27:03 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: @ndy</title>
		<link>http://www.snell-pym.org.uk/archives/2008/08/27/managing-lots-of-servers/comment-page-1/#comment-78153</link>
		<dc:creator>@ndy</dc:creator>
		<pubDate>Thu, 28 Aug 2008 09:53:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.snell-pym.org.uk/?p=810#comment-78153</guid>
		<description>&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;As David points out, the "Congruent Infrastructure" stuff at http://www.infrastructures.org/ is a good read.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Hi,</p>

<p>As David points out, the "Congruent Infrastructure" stuff at <a href="http://www.infrastructures.org/" rel="nofollow">http://www.infrastructures.org/</a> is a good read.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: David McBride</title>
		<link>http://www.snell-pym.org.uk/archives/2008/08/27/managing-lots-of-servers/comment-page-1/#comment-78124</link>
		<dc:creator>David McBride</dc:creator>
		<pubDate>Wed, 27 Aug 2008 22:02:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.snell-pym.org.uk/?p=810#comment-78124</guid>
		<description>&lt;p&gt;You may find &lt;a href="http://www.infrastructures.org/papers/bootstrap/bootstrap.html" rel="nofollow"&gt;"Bootstrapping an Infrastructure"&lt;/a&gt; interesting reading.  It dates from 1998, so some of the tech examples are out of date, but it describes what you're doing quite closely.&lt;/p&gt;

&lt;p&gt;The idea of having several basic -- a VM hosting layer, a distributed, reliable storage layer, and a higher layer of auto-installed VMs that provide all your core functionality -- is very attractive.  (It's what DoC's been doing for years, for various degrees of storage distribution and virtualization.)&lt;/p&gt;

&lt;p&gt;The idea that individual chunks of server hardware are expendable, however, is a useful one -- when building on unreliable commodity hardware, its vital!  Key to this is the ability to either run up a clone -- or regenerate from first principles -- the machine image that's providing a particular service, and keeping its persistent datasets on a separate reliable storage infrastructure.&lt;/p&gt;

&lt;p&gt;Cluster filesystems -- GFS, GFS2, Lustre, etc -- are starting to look exciting and usable, though I haven't had a chance to play with them yet.  They look particularly important for IO-intensive workloads.&lt;/p&gt;

&lt;p&gt;Control layers for your VMs start to become important.  Most of the time, you want scripts on the work machines to pull changes from central repositories -- see DoC's Sys::Maint infrastructure, triggered by Cron.  Their function is to take a (possibly broken, or minimally bootstrapped) machine image, and to make it good and ready for normal operations.  This can include everything from setting up NTP to installing packages to making sure the correct system configuration files have been installed.&lt;/p&gt;

&lt;p&gt;The other important control layer is the ad-hoc "push" mechanism -- for when you need to execute an important change to a (total) subset of machines &lt;em&gt;right now&lt;/em&gt;.  Batch tools that connect over SSH using some means of automated authentication is good for this; for example, see my &lt;a href="http://www.doc.ic.ac.uk/~dwm/git/remote.git" rel="nofollow"&gt;remote&lt;/a&gt; script.&lt;/p&gt;

&lt;p&gt;It sounds like you're definitely thinking along the right lines, though how much of this you need to implement may vary depending on how far up you need to scale (and with what resources).&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>You may find <a href="http://www.infrastructures.org/papers/bootstrap/bootstrap.html" rel="nofollow">"Bootstrapping an Infrastructure"</a> interesting reading.  It dates from 1998, so some of the tech examples are out of date, but it describes what you're doing quite closely.</p>

<p>The idea of having several basic -- a VM hosting layer, a distributed, reliable storage layer, and a higher layer of auto-installed VMs that provide all your core functionality -- is very attractive.  (It's what DoC's been doing for years, for various degrees of storage distribution and virtualization.)</p>

<p>The idea that individual chunks of server hardware are expendable, however, is a useful one -- when building on unreliable commodity hardware, its vital!  Key to this is the ability to either run up a clone -- or regenerate from first principles -- the machine image that's providing a particular service, and keeping its persistent datasets on a separate reliable storage infrastructure.</p>

<p>Cluster filesystems -- GFS, GFS2, Lustre, etc -- are starting to look exciting and usable, though I haven't had a chance to play with them yet.  They look particularly important for IO-intensive workloads.</p>

<p>Control layers for your VMs start to become important.  Most of the time, you want scripts on the work machines to pull changes from central repositories -- see DoC's Sys::Maint infrastructure, triggered by Cron.  Their function is to take a (possibly broken, or minimally bootstrapped) machine image, and to make it good and ready for normal operations.  This can include everything from setting up NTP to installing packages to making sure the correct system configuration files have been installed.</p>

<p>The other important control layer is the ad-hoc "push" mechanism -- for when you need to execute an important change to a (total) subset of machines <em>right now</em>.  Batch tools that connect over SSH using some means of automated authentication is good for this; for example, see my <a href="http://www.doc.ic.ac.uk/~dwm/git/remote.git" rel="nofollow">remote</a> script.</p>

<p>It sounds like you're definitely thinking along the right lines, though how much of this you need to implement may vary depending on how far up you need to scale (and with what resources).</p>]]></content:encoded>
	</item>
</channel>
</rss>

