[Tfug] Server purchase (!)
Claude Rubinson
rubinson at u.arizona.edu
Thu Oct 12 16:21:27 MST 2006
Ahh... This is the type of information that I needed. Helps me to
make a more informed cost/benefit analysis. Thanks a lot!
C.
On Thu, Oct 12, 2006 at 09:17:44AM -0700, Bexley Hall wrote:
>
> --- Brian Masur <bcmasur at hotmail.com> wrote:
>
> > All electronics (especially ones with moving parts,
> > like hard drives) are subject to failure at any
> > time.
>
> Actually, most "electronics" (as distinct from
> electromechanical devices -- like hard drives)
> exhibit a bathtub curve of failure probabilities.
> They fail soon after manufacture (one edge of the
> bathtub) or "at maturity" (the other end of the
> bathtub).
>
> This assumes they are used in their proper
> environment (Vcc, Tambient, etc.) -- something that
> isn't always guaranteed in a "PC" case (you'll
> note that a "genuine" server case tends to have
> a sh*tload more fan noise associated with it -- since
> cooling is the number one cause of accelerated
> failures (I think +10C halves MTBF, as a rule of
> thumb)
>
> Some devices (notably, newer CPU's with sub micron
> geometries) will have a more predictable "wearout"
> point due to electromigration, etc. (I've not
> done the math but it intuitively *feels* like
> the number of clock cycles you'll get from a CPU
> is a constant, K, regardless of speed, over it's
> lifetime... this would be an amusing bit of
> research to pursue).
>
> Other devices (CD-R/W) have similar wearout points
> inherent in their designs (e.g., total PoHr's
> limits usable life). Remedies here tend to be
> either "difficult" (i.e. independantly control
> power to said device -- easier done for external
> devices) or "practical" (i.e. plan on replacing it!)
>
> > This is the reason for redundancy,
> > especially when you bring "backup" into the picture.
>
> Disks and power supplies are most often made
> redundant.
> Power supplies fail due to their greater stresses --
> they live in a warmer environment (since many are not
> well ventilated) and handle "extremes" (i.e. switching
> currents result in lots of thermal/mechanical shocks
> to these devices at the switching frequency... things
> that their low mass experience more dramatically
> than would be evident outside the package (where
> larger masses prevail). Capacitors (and connectors)
> tend to be a high failure rate item as well.
> Capacitors can degrade even when not in service
> (store a machine in a hot Tucson garage and you're
> not doing it any favors!).
>
> Disks nowadays have very high MTBF's (1,000,000
> PoHrs).
> But, most sit in poorly ventilated cases where they
> run a lot warmer than they would like. You either
> invest in loud fans or redundant data stores
> (note that a power supply/controller failure can
> render a RAID array useless. And, software bugs
> can hose redundant drives in an ohnosecond -- I
> lost two copies of an archive to a driver bug in
> an old version of FreeBSD :< )
>
> Bottom line, if you *really* want to save your data,
> keep it redundant *machines* (so a failure of one
> doesn't kill "both" copies of your data) -- and in
> different locations (so a fire or lightning strike
> doesn't do likewise!)
>
> I rely on *lots* of spindles (so I only lose a little
> at a time) plus three backups on different media:
> CD-R (which is really NOT very reliable, long term),
> magnetic tape and MO cartridges. Thankfully, the
> amount of *dynamic* data that I have that would be
> subject to this sort of backup is much less than
> the archive already built :>
>
> Last note on connectors... they lurk in often ignored
> locations! In a modern PC, one of the connectors
> most frequently ignored (for reliability) are those
> for the memory devices (SIMMs/DIMMs/etc.). The
> quality
> of these vary *immensely* -- and, there is no easy
> way to determine what you've got!
>
> In addition to tin/gold issues, many of these have
> very low insertion ratings -- some as few as *6*!
> Of course, that doesn't guarantee that it will
> fail on the 7th attempt... but, if you are pulling
> memory frequently, you have to wonder *when* you
> will cross the line from "reliable" to "marginal".
>
> Tinkerers beware :>
> --don
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam? Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
>
> _______________________________________________
> Tucson Free Unix Group - tfug at tfug.org
> Subscription Options:
> http://www.tfug.org/mailman/listinfo/tfug_tfug.org
More information about the tfug
mailing list