Cyberdemon crowd surfing

Why do you need a new CPU?

Well, isn't it obvious?

(See also here and here.)

Going up!

Large Lego crane

When I first saw this crane in the LUGnet news I, of course, immediately thought "what's Meccano doing on a Lego site?"

But this is, in fact, a Lego construction, built out of the studless beams that're now normal for all Lego Technic. It's got a long way to go before it can challenge the biggest cranes built with old-style studded beams, but it's an imposing creation nonetheless. (Note the Linear-Actuator-driven counterweight mechanism!)

Crane holding excavator detail

It is, of course, entirely traditional for a bigger toy to humiliate a smaller one.

Big R/C car dominates small R/C cars

I've been doing that for years.

It's not the size of the track, it's what you do with it

One of my readers was delighted to discover this Google ad on this very site:

Girl impressed by big thick masculine track.

I agree with him that it is completely awesome.

(I've linked the above image to the online store of "Radmeister", the people responsible. That's not a paid link, of course; you can click on it without costing Radmeister any money. If you happen to see the same image to the right of this page, then that'll be a real ad. Do tell me if something even better crops up.)

The ad was, no doubt, attracted by my recent series of posts about Lego tracks.

You wouldn't think the nice lady in the bikini would find Lego tracks very impressive. But quantity has a quality all its own, and after the last post I was as good as my word and did indeed buy yards of new-style tracks on BrickLink. The only reason why you haven't yet seen a picture of them lying there like Worf's spare baldric collection is that the BrickLink dealer accidentally sent me a mere 400 links instead of the 480 I paid for.

When the rest of them show up - giving a total length of 5.76 metres, versus the lousy 4.8 I've got now - I shall make them into a fly-curtain or something while I design a vehicle worthy of them.

He has trouble in sand traps

This little guy's hilarious.

(Yet another Crabfu creation, of course.)

Death of RAID predicted, film at 11

From a reader:

Love to hear your take on this: Why RAID 5 stops working in 2009.

That article by Robin Harris is more than a year old, but was suddenly linked hither and yon in the last few days. Its thesis is that because RAID array capacities are approaching unrecoverable read error rates, if one disk in your RAID fails, you'll very probably get a read error from one of the other disks in the course of rebuilding the array, and lose data.

This basic claim is true, but there are three reasons why this problem is not as scary as it sounds.

1: Losing one bit shouldn't lose you more than one file. Consumer RAID controllers may have a fit over small errors, but losing one file because a drive failed is seldom actually a serious problem.

2: Putting your valuable data on a RAID array and not backing it up is a bad idea even if disk errors never happen. One day, you are guaranteed to confidently instruct your computer to delete something that you actually want, and RAID won't protect you from that, or from housefires, theft of the drives, and so on. You still need proper backups.

3: If you're going to build a story on statistics, it helps a lot if you get the statistics right.

Robin Harris says it is "almost certain" that 12 terabytes of disk drives with a one-in-12Tb read error rate will have an error if you read the whole capacity.

This statement is wrong.

Actually, the probability of one or more errors, in this situation, is only 63.2%. When you know why this is, you discover that there's less to fear here than you'd think.

(Robin Harris is not the only sinner, here. This guy makes exactly the same mistake. This guy, on the other hand, says one error in ten to the fourteen reads gives you "a 56% chance" of an error in seven terabytes read; he's actually done the maths correctly.)

The mistake people make over and over again when figuring out stuff like this is saying that that if you've got (say) a one in a million chance of event Y occurring every time you take action X, and you do X a million times, the probability that Y will have happened is 1.

It isn't.

(If it were, then if you did X a million and one times, the probability that Y will have occured would now be slightly more than one. This is unacceptably weird, even by mathematicians' standards.)

What you do to figure out the real probabilities in this sort of situation is look at the probability that Y will never happen over your million trials.

(If it matters to you if Y happens more than once, then things get more complex. But usually the outcomes you're interested in are "Y does not happen at all" and "Y happens one or more times". That is the case here, and in many other "chance of failure" sorts of situations.)

To make this easier to understand, let's look at a version of the problem using numbers that you can figure out on a piece of paper, without having to do anything a million times.

Let's say that you're throwing an ordinary (fair!) six-sided die, and you don't want to get a one. The chance of getting a one is, of course, one in six, and let's say you're throwing the die six times.

For each throw, the probability of something other than one coming up is five in six. So the probability of something other than one coming up for all six throws is:

5/6 times 5/6 times 5/6 times 5/6 times 5/6 times 5/6.

This can more easily be written as five-sixths to the power of six, or (5/6)^6, and it's equal to (5^6)/(6^6), or 15625/46656. That's about 0.335, where 1 is certainty, and 0 is impossibility.

So six trials, in each of which an undesirable outcome has a one in six chance of happening, certainly do not make the undesirable outcome certain. You actually have about a one-third chance that the undesirable outcome will not happen at all.

It's easy to adjust this for different probabilities and different numbers of trials. If you intend to throw the dice 15 times instead of six, you calculate (5/6)^15, which gives you about a 0.065 chance that you'll get away with no ones. And if you decide to toss a coin ten times, and want to know how likely it is that it'll never come up tails, then the calculation will be (1/2)^10, a miserable 0.00098.

In the one-in-a-million, one-million-times version, you figure out (1 - 1/1000000)^1000000, which is about 0.368. So there's a 36.8% chance that the one-in-a-million event will never happen in one million trials, and a 63.2% chance that the event will happen one or more times.

OK, on to the disk-drive example.

Let's say that the chance of an unrecoverable read failure is indeed one in ten to the 14 - 1/100,000,000,000,000. I'll express this big number, and the other big numbers to come, in the conventional computer-y form of scientific notation that doesn't require little superscript numbers. One times ten to the power of 14, a one with 14 zeroes after it, is thus written "1E+14".

The chance of no error occurring on any given read, given this error probability, is 1 - 1/(1E+14), which is 0.99999999999999. Very close to one, but not quite there.

(Note that if you start figuring this stuff out for yourself in a spreadsheet or something, really long numbers may cause you to start hitting precision problems, where the computer runs out of digits to express a number like 0.99999999999999999999999999999999999999 correctly, rounds it off to one, and breaks your calculation. Fortunately, the mere fourteen-digit numbers we're working with here are well within normal computer precision.)

OK, now let's say we're reading the whole of a drive which just happens to have a capacity of exactly 1E+14 bits, at this error rate of one error in every 10^14 reads. So the chance of zero errors is:

(1 - 1/(1E+14))^1E+14

This equals about 0.368. Or, if you prefer, a 63.2% chance of one or more errors.

Note that the basic statement about the probability of an error remains true - overall, a drive with an Unrecoverable Read Error Rate of one in ten to the fourteen will indeed have such an error once in every ten to the fourteen reads. But that doesn't guarantee such an error in any particular ten to the fourteen reads, any more than the fact that a coin comes up evenly heads or tails guarantees that you'll get one of each if you throw it twice.

Now, a RAID that's 63.2% likely to have an error if one of its drives fails is still not a good thing. But there's a big difference between 63.2% and "almost certain".

(Note also that we're talking about a lot of data, here. At fifty megabytes per second, ten to the fourteen bits will take about 2.8 days to read.)

Getting the statistics right makes the numbers look proportionally better if the error rate can be reduced.

If drive manufacturers manage to reduce the error rate by a factor of ten, for instance, so now it's one in every ten to the fifteen reads instead of every 1E+14, the chance that you'll get no such errors in a given ten to the fourteen reads improves to about 90.5%.

If they reduce the error rate all the way to one in ten to the sixteen, then ten to the fourteen reads are 98.9% likely to all be fine.

I'm not saying it's necessarily easy to make such an improvement in the read error rate, especially in the marketing-bulldust-soaked hard-drive industry.

But neither is the situation as dire as the "almost certain" article says.

All who commit such crimes against mathematical literacy are hereby sentenced to read John Allen Paulos' classic Innumeracy.

(This is not a very severe sentence, since the book is actually rather entertaining.)

Big dozer, little dozer

Now that Lego has two kinds of track piece, it's easier to make similar tracked vehicles in very different scales.

Via TechnicBricks, a frequent link-target for me recently, behold the last unknown Technic Lego set for 2009:

2009 Technic sets

It's the one in the top-left corner, being dwarfed by the 8256 Super Kart and 8262 Quad-Bike. It's called the 8259 Mini Dozer, and it's...

Lego 8259 Mini Dozer

...completely anerable.

Compare and contrast with its (much) bigger brother...

8275 Motorized Bulldozer

...the 8275 Motorized Bulldozer. The Mini Dozer is, basically, a micro-scale version of 8275.

So now you can put 8259 on one end, 8275 on the other, and the classic 856 in the middle, and have flying-ducks bulldozers!

(I wonder how long it'll be before someone finds room for a Micro Motor inside an 8259?)

UPDATE: I know what you're thinking. You're thinking, "I won't believe this kit even exists unless I can fly around it using Microsoft Photosynth and see every tiny detail."

No problem.

UPDATE 2: It didn't take long for people to start building the set before you can buy one.

Track traction

It is, as mentioned in this post, not entirely easy to get Lego tracks - either the new big or old small ones, or ones improvised from other pieces - to grip the ground well. They're particularly lousy on smooth surfaces, like indoor floors or table-tops.

I am indebted to the excellent TechnicBricks blog once again for showing me that it's actually easy to increase the traction of the new-style treads. Just loop rubber bands around each link, or every second link if you want to avoid bands rubbing on each other.

OK, now I've definitely got to buy a bucketload of links, and make something that can climb stairs.

I don't remember voting for this

Australians will be unable to opt-out of the government's pending Internet content filtering scheme, and will instead be placed on a watered-down blacklist, experts say.

Under the government's $125.8 million Plan for Cyber-Safety, users can switch between two blacklists which block content inappropriate for children, and a separate list which blocks illegal material.

If this actually happens, then it'll be a considerable pain. I'd just switch to using Tor or something to avoid being unable to see "illegal" content (it should be for the courts to decide if a page is "illegal", of course...), but that's pretty bleeding slow.

(On account, of course, of the vast numbers of office workers downloading BlackBackdoorBimbosAndTheirBarnyardBeaus27.avi.)

But I'm pretty sure it's not going to happen.

Similar threats have been made here in the past, and they've always petered out into nothing. There are no votes to be won in actually filtering the Internet, after all. The people who vote based on Net filtering promises are unable to tell whether it's actually happening or not. And there are plenty of votes to be lost when everyone who doesn't call their browser "the Internet" discovers that they can't get to YouPorn or Mininova any more.

There's not even much money to be made in making filtering software that actually works. The big bucks in content filtering were and are based on arse-covering and plausible deniability, not actually stopping anybody from seeing anything in particular.

I confidently predict that this will just end up being another easily-circumvented waste of taxpayers' money.