<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>Xen hosting: Lessons from the Trenches</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/" />
    <link rel="self" type="application/atom+xml" href="http://wiki.xen.prgmr.com/xenophilia/atom.xml" />
    <id>tag:wiki.xen.prgmr.com,2008-03-02:/xenophilia/2</id>
    <updated>2010-03-09T22:31:11Z</updated>
    
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type Open Source 4.1</generator>

<entry>
    <title>Horn (and the VMs on it)  will be rebooted shortly</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/03/horn-and-the-vms-on-it-will-be-1.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.191</id>

    <published>2010-03-09T21:29:36Z</published>
    <updated>2010-03-09T22:31:11Z</updated>

    <summary><![CDATA[we've got to replace a disk.&nbsp; update:&nbsp; it is coming back up now, restoring domains.&nbsp; those of you running debian might need another reboot (I've had problems with debian save/restore)update:&nbsp; it looks like all domains but one came back up...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[we've got to replace a disk.&nbsp; <br /><br /><br />update:&nbsp; it is coming back up now, restoring domains.&nbsp; those of you running debian might need another reboot (I've had problems with debian save/restore)<br /><br />update:&nbsp; it looks like all domains but one came back up successfully (with a save/restore, no reboot)&nbsp; <br /><br />we'll be rebuilding the RAID now, so expect disk performance to suck for a while.&nbsp; <br /><br /><br />]]>
        
    </content>
</entry>

<entry>
    <title>Horn (and the VMs on it)  will be rebooted shortly</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/03/horn-and-the-vms-on-it-will-be.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.190</id>

    <published>2010-03-09T21:29:36Z</published>
    <updated>2010-03-09T21:30:21Z</updated>

    <summary><![CDATA[in order to replace a bad disk &nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[in order to replace a bad disk &nbsp; ]]>
        
    </content>
</entry>

<entry>
    <title>the next server will be limited to 512M accounts and below</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/02/the-next-server-will-be-limite.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.188</id>

    <published>2010-02-24T11:22:56Z</published>
    <updated>2010-02-24T11:48:12Z</updated>

    <summary><![CDATA[we will do our best to put anyone who is in the preorder queue who needs something larger on an older server.&nbsp; There are two reasons why I am doing this.&nbsp; First, I want to move to a system where...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[we will do our best to put anyone who is in the preorder queue who needs something larger on an older server.&nbsp; <br /><br />There are two reasons why I am doing this.&nbsp; First, I want to move to a system where the small accounts are segregated from the large accounts;&nbsp; ultimately, I'd like to put all the 64MiB customers on one server, all the 128MiB customers on one server, etc... but I don't have enough servers for that, so I'm starting this way.&nbsp; <br /><br />The reason to segregate in this manner is to isolate performance problems that might be caused by the smaller domains using swap more, ah, vigorously than the larger domains.&nbsp; As you all know, the sata disk I use is by far the weakest link in my setup, and I hope this change will help people 'get what they pay for'&nbsp;&nbsp;&nbsp; (I will be implementing other procedures to see to it that everyone gets a fair shake at the disk on these smaller servers)<br /><br />The other (and perhaps larger) reason why I want to do this is financial.&nbsp; Right now, prgmr.com has more available labor than capital, and we are bottlenecking pretty hard on capital, as evidenced by the fact that the 'we are out of servers' sign is up more often than not.&nbsp;&nbsp; Now, there are several ways this could be solved<br /><br />1. I could raise prices<br /><br />2. I could charge a setup fee whenever I got below a certain capacity (Like 1, but temporary) <br /><br />3. I could&nbsp; see to it that my more profitable customers have access to new capacity before my less profitable customers do.<br /><br />(Yes, I could also get investors or a loan, but those both come with their own irritations.&nbsp; I'm considering doing contracting, but that has it's own irritations as well.)&nbsp; <br /><br />I am trying 3, mostly because I don't like the idea of raising prices.&nbsp; (In this industry, you raise your prices by keeping your prices the same)&nbsp; My pricing model is $4 per month per account, plus $1 per month per every 64MiB ram, so a 64MiB ram guest is $5, a 128MiB guest is $6, a 256MiB guest is $8, etc...&nbsp;&nbsp; so obviously, for any given amount of ram, I make more money the more small guests I sell.&nbsp; <br /><br /><br /><br />So, here is the plan.&nbsp; Knife, the next server, will host 64MiB, 128MiB, 256MiB, and 512MiB guests.&nbsp; this server should&nbsp; be up within a day or two.&nbsp; (if I haven't made any more mistakes, it will be done tonight. )&nbsp;&nbsp;&nbsp; Note, I don't know if it will get filled by the waiting list or not.&nbsp; It very well might.&nbsp;&nbsp; <br /><br />Then, within the next two weeks (I have all the parts; ram is in the mail)&nbsp; I plan on setting up a 16GiB/ single socket server for 64MiB guests only.&nbsp;&nbsp; <br /><br />Also within the next two weeks I&nbsp; will have another 32MiB/8 core server returned from a dedicated server customer who is leaving.&nbsp; If knife fills up with 512MiB and below domains, it will be another 512MiB and below server.&nbsp;&nbsp; Otherwise, it will service 1024MiB and above domains.&nbsp; <br /><br />Beyond that, I need to buy more hardware, something that this new scheme will hopefully facilitate.&nbsp; <br /><br /><br /> ]]>
        
    </content>
</entry>

<entry>
    <title>bad disk on robe; expect perormance problems</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/02/bad-disk-on-robe-expect-perorm.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.187</id>

    <published>2010-02-22T16:04:51Z</published>
    <updated>2010-02-22T16:05:57Z</updated>

    <summary>expect slow I/O while I rebuild the raid on robe....</summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[expect slow I/O while I rebuild the raid on robe.<br /><br /><br /><br /> ]]>
        
    </content>
</entry>

<entry>
    <title>(short) network outage at SVTIX, midnight, 2010-02-13</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/02/short-network-outage-at-svtix.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.186</id>

    <published>2010-02-14T09:46:11Z</published>
    <updated>2010-02-14T09:47:38Z</updated>

    <summary><![CDATA[uh, happened just a bit ago.&nbsp; maybe 15 minutes of downtime.&nbsp; SVTIX warned me of emergency maintenance of a UPS maybe three hours before that.&nbsp; Nothing was rebooted; we just lost network for a bit. &nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[uh, happened just a bit ago.&nbsp; maybe 15 minutes of downtime.&nbsp; SVTIX warned me of emergency maintenance of a UPS maybe three hours before that.&nbsp; Nothing was rebooted; we just lost network for a bit. &nbsp; ]]>
        
    </content>
</entry>

<entry>
    <title>Chris and I will be speaking at the local ACCU meeting tonight.  </title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/02/chris-and-i-will-be-speaking-a.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.185</id>

    <published>2010-02-11T01:28:28Z</published>
    <updated>2010-02-11T01:29:17Z</updated>

    <summary>http://accu.org/index.php/accu_branches/accu_usa/next...</summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        http://accu.org/index.php/accu_branches/accu_usa/next 
        
    </content>
</entry>

<entry>
    <title>sloooow disk I/O on horn due to bad disk</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/02/sloooow-disk-io-on-horn-due-to.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.184</id>

    <published>2010-02-07T21:32:32Z</published>
    <updated>2010-02-07T21:32:57Z</updated>

    <summary><![CDATA[I'm heading out to fix it right now.&nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[I'm heading out to fix it right now.&nbsp; ]]>
        
    </content>
</entry>

<entry>
    <title>IPv6 router upgrade at SVTIX</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/01/ipv6-router-upgrade-at-svtix.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.183</id>

    <published>2010-01-31T20:33:42Z</published>
    <updated>2010-01-31T20:35:26Z</updated>

    <summary><![CDATA[(there are only a few of you on it) &nbsp; we're rebooting our experimental IPv6 router for testing... it shouldn't be more than a few minutes downtime for IPv6.&nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[(there are only a few of you on it) &nbsp; <br /><br />we're rebooting our experimental IPv6 router for testing... it shouldn't be more than a few minutes downtime for IPv6.&nbsp; <br /> ]]>
        
    </content>
</entry>

<entry>
    <title>partial network outage last night</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2010/01/partial-network-outage-last-ni.html" />
    <id>tag:wiki.xen.prgmr.com,2010:/xenophilia//2.182</id>

    <published>2010-01-04T01:39:22Z</published>
    <updated>2010-01-04T01:43:22Z</updated>

    <summary><![CDATA[my provider tells me there was an intermittent network outage at my Fremont he.net location (my reseller, not he.net)&nbsp; from 11pm to 1am PST. &nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[my provider tells me there was an intermittent network outage at my Fremont he.net location (my reseller, not he.net)&nbsp; from 11pm to 1am PST. &nbsp; ]]>
        
    </content>
</entry>

<entry>
    <title>stables and birds going down for update and move</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2009/12/stables-and-birds-going-down-f.html" />
    <id>tag:wiki.xen.prgmr.com,2009:/xenophilia//2.181</id>

    <published>2009-12-31T06:04:55Z</published>
    <updated>2009-12-31T06:07:46Z</updated>

    <summary>they are in one of our supermicro 2 in 1u units...</summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        they are in one of our supermicro 2 in 1u units 
        
    </content>
</entry>

<entry>
    <title>hydra rebooting shortly</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2009/12/hydra-rebooting-shortly.html" />
    <id>tag:wiki.xen.prgmr.com,2009:/xenophilia//2.180</id>

    <published>2009-12-31T02:55:00Z</published>
    <updated>2009-12-31T03:10:31Z</updated>

    <summary><![CDATA[we're trying to see if we can xm save like we did on lion, unlike we did on boar, but it's a pretty old box, so we might be rebooting you.&nbsp; [root@hydra /]# uptime&nbsp;18:56:26 up 410 days, 15:44,&nbsp; 2 users,&nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
        <category term="hosting status" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="outage" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[we're trying to see if we can xm save like we did on lion, unlike we did on boar, but it's a pretty old box, so we might be rebooting you.&nbsp; <br /><br />[root@hydra /]# uptime<br />&nbsp;18:56:26 up 410 days, 15:44,&nbsp; 2 users,&nbsp; load average: 0.09, 0.29, 0.25<br /><br /> ]]>
        
    </content>
</entry>

<entry>
    <title>boar.prgmr.com going down shortly</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2009/12/boarprgmrcom-going-down-shortl.html" />
    <id>tag:wiki.xen.prgmr.com,2009:/xenophilia//2.179</id>

    <published>2009-12-31T00:24:57Z</published>
    <updated>2009-12-31T00:27:10Z</updated>

    <summary><![CDATA[just like lion, save for that there are fewer customers on boar.[root@boar ~]# uptime&nbsp;16:24:55 up 410 days, 15:32,&nbsp; 3 users,&nbsp; load average: 0.00, 0.01, 0.00[root@boar ~]# xm list |wc -l18starting upgrade now (service won't be impacted until we start the...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[just like lion, save for that there are fewer customers on boar.<br /><br />[root@boar ~]# uptime<br />&nbsp;16:24:55 up 410 days, 15:32,&nbsp; 3 users,&nbsp; load average: 0.00, 0.01, 0.00<br /><br />[root@boar ~]# xm list |wc -l<br />18<br /><br /><br />starting upgrade now (service won't be impacted until we start the reboot)<br /><br /><br /> ]]>
        
    </content>
</entry>

<entry>
    <title>(short) downtime on hydra, stables, birds and boar - tomorow</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2009/12/short-downtime-on-hydra-stable.html" />
    <id>tag:wiki.xen.prgmr.com,2009:/xenophilia//2.178</id>

    <published>2009-12-30T07:15:13Z</published>
    <updated>2009-12-30T07:18:29Z</updated>

    <summary><![CDATA[all servers will be rebooted (as lion was today)&nbsp; for some kernel upgrades, and to consolidate all my he.net servers to one rack. &nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
        <category term="hosting status" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="outage" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[all servers will be rebooted (as lion was today)&nbsp; for some kernel upgrades, and to consolidate all my he.net servers to one rack. &nbsp; ]]>
        
    </content>
</entry>

<entry>
    <title>~20 min network outage today</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2009/12/20-min-network-outage-today.html" />
    <id>tag:wiki.xen.prgmr.com,2009:/xenophilia//2.177</id>

    <published>2009-12-30T04:25:23Z</published>
    <updated>2009-12-30T04:27:20Z</updated>

    <summary><![CDATA[from 11:58 to 12:19 PST&nbsp; -&nbsp; We suspect upstream network trouble as the cause.&nbsp;...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[from 11:58 to 12:19 PST&nbsp; -&nbsp; We suspect upstream network trouble as the cause.&nbsp; ]]>
        
    </content>
</entry>

<entry>
    <title>lion rebooting for kernel refresh and a move</title>
    <link rel="alternate" type="text/html" href="http://wiki.xen.prgmr.com/xenophilia/2009/12/lion-rebooting-for-kernel-refr.html" />
    <id>tag:wiki.xen.prgmr.com,2009:/xenophilia//2.176</id>

    <published>2009-12-30T02:13:28Z</published>
    <updated>2009-12-30T02:15:13Z</updated>

    <summary><![CDATA[root@lion ~]# uptime&nbsp;18:12:12 up 451 days, 17:42,&nbsp; 8 users,&nbsp; load average: 0.01, 0.03, 0.00as usual, if we don't screw it up it will be 20 minutes downtime and no reboot for you, due to xm save/restore...]]></summary>
    <author>
        <name>luke</name>
        <uri>http://prgmr.com</uri>
    </author>
    
        <category term="hardware" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="hosting status" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="outage" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-US" xml:base="http://wiki.xen.prgmr.com/xenophilia/">
        <![CDATA[root@lion ~]# uptime<br />&nbsp;18:12:12 up 451 days, 17:42,&nbsp; 8 users,&nbsp; load average: 0.01, 0.03, 0.00<br /><br /><br />as usual, if we don't screw it up it will be 20 minutes downtime and no reboot for you, due to xm save/restore<br /><br /> ]]>
        
    </content>
</entry>

</feed>
