KILO, MEGA, KIBI, MIBI…

In our discussion of the question of how 12 megapixels became 12.1, two commenters made the same suggestion: that a megapixel might really be measured in powers of two, the way a megabyte is. The base-2-and-exponents explanation was the first thing that occurred to Dr. Boli as well, but it seems to be universally admitted that a megapixel is a million pixels, not more, and not counting by powers of two. Dr. Boli went looking for any evidence of powers of two in the calculation of megapixels and came up with none. Furthermore, if it were true, it seems to Dr. Boli that 4,000 by 3,000 pixels would be less than 12 megapixels, since “pixels” are in fact counted individually, not in groups of kilopixels.

In fact, 12 million pixels is a number with odd mathematical properties in photography. It is very rare for a camera to make pictures with a number of pixels in either dimension that is divisible by a big round number like 1,000; usually the dimensions are something like 4,608 x 3,456. The reason has to do with the aspect ratios of rectangular pictures, which in most cameras are either 3 x 2 or 4 x 3. At an aspect ratio of 4 x 3, 4,000 by 3,000 pixels make up exactly 12 million pixels, which is intellectually satisfying; but you need some math to figure out what figures make up 10 million pixels or 16 million pixels at the same aspect ratio.

So looking for evidence that pixels were counted the way bytes are counted got us nowhere. But our search did lead us into an interesting demonstration of an Internet principle Dr. Boli has pointed out before, to which we may for the sake of convenience assign the name Boli’s Law of Internet Controversy: On the Internet, the victory goes to the most pedantic.

It used to be true that a kilobyte was 1,024 bytes, and a megabyte was 1,024 kilobytes. But the pedants have had their way with “kilobyte” and “megabyte,” insisting that they must be exact powers of ten. Therefore a unit of 1024 bytes is a kibibyte; and similarly, what you think of as a megabyte is a mibibyte. Wiktionary now classifies the usual meaning of “kilobyte” as a secondary “informal” definition; and we can see the moment the pedants invaded, because Wiktionary also preserves the discussions that surround its changes. Three years ago, User A said,

Switch def. 1 and 2 even if right (in the past?)

kilobyte (kB) for 1000 bytes might not be rare anymore? Is it standard in Operating systems now? Many computer hackers want to hang on to KB vs kB, but might be admitting defeat and accepting kB and use KiB (or KB, or Kilobyte with capital K, to distinguish).

Note, incidentally, the punctuation that indicates the use of the interrogative tone in discourse, which Dr. Boli keeps promising to write an essay about in his series on cultural neoteny, and he will probably get around to it eventually. To this indefinitely phrased statement, User B replied:

Absolutely. The binary definition of 1024 is officially deprecated, and not only should the numbers be switched it should be made clear that the binary definition is obsolete.

Now, here is an interesting glimpse into the mind of the pedant. Read that sentence again, and then ask yourself: What office has the authority to deprecate, officially, a common noun in English?

There is an answer to that question in many other languages. If we were speaking French, we might be able to refer to an official ruling of the French Academy on the meaning of “kilobyte.” But English never developed such an authority. We may trace the reason back to Samuel Johnson.

The preface to Johnson’s dictionary is a work that truly deserves to be called seminal, because it sowed the seeds for all the lexicographical thoughts that have sprouted in English since Johnson’s time. It is also one of the finest specimens of English prose ever written.

In this preface, Dr. Johnson explains that he had thought he might set the rules for correct English for all time. But then… Well, let us hear it from the Doctor himself:

Those who have been persuaded to think well of my design, require that it should fix our language, and put a stop to those alterations which time and chance have hitherto been suffered to make in it without opposition. With this consequence I will confess that I flattered myself for a while; but now begin to fear that I have indulged expectation which neither reason nor experience can justify. When we see men grow old and die at a certain time one after another, from century to century, we laugh at the elixir that promises to prolong life to a thousand years; and with equal justice may the lexicographer be derided, who being able to produce no example of a nation that has preserved their words and phrases from mutability, shall imagine that his dictionary can embalm his language, and secure it from corruption and decay, that it is in his power to change sublunary nature, or clear the world at once from folly, vanity, and affectation.

This seems like obvious truth to English-speakers, because we have grown up in a world where Johnson’s opinion is accepted as dogma. Yet it is not dogma for other languages, as Johnson himself points out.

With this hope, however, academies have been instituted, to guard the avenues of their languages, to retain fugitives, and repulse intruders; but their vigilance and activity have hitherto been vain; sounds are too volatile and subtile for legal restraints; to enchain syllables, and to lash the wind, are equally the undertakings of pride, unwilling to measure its desires by its strength.

Johnson’s opinion that language cannot be legislated has become the dogma of professional lexicographers. Merriam-Webster defines kilobyte as “a unit of information equal to 1024 bytes,” and then adds, “also: one thousand bytes.” The American Heritage Dictionary has a very similar definition, with a long usage note at gigabyte explaining that the first meaning is more common in most contexts, but the other more common for certain branches of the industry.

None of this satisfies the pedant, however. The professional lexicographers are wrong. The pedant knows, and his authority is indisputable. It is official. The official authority usually turns out to be a high-school English teacher who taught pedantry along with English, but the pedant has absolute faith in the irrefutability of his knowledge. If you want to know how Sisyphus felt, try starting a discussion on the Wiktionary page for kilobyte, saying, “Hey, I don’t think you’re really right about…” You will not win, because you are not the most pedantic person on the Internet.

Well, then, what have we learned? Nothing about megapixels. As far as Dr. Boli has been able to determine, “megapixel” has always meant a million pixels, not a power of two, and not a little bit less than a million pixels, which is what we would require for 4,000 x 3,000 to make 12.1 megapixels. But we have learned, once again, that the only way to win an argument on the Internet is to be more pedantic than the opposition and never to admit the possibility of error. There was a time when Dr. Boli would have called himself pedantic, but the Internet has taught him that he is underqualified for pedantry.

Comments

  1. Inquiring Mimes says:

    So 1,024 pieces of kibble would be kibikibble?

  2. Belfry Bat says:

    Congratulations

  3. Belfry Bat says:

    Then it occurred to me that there IS an office with the authority and audacity to tell us what words do mean (or don’t); the State Supreme Court of Ohio confidently ruled (some time ago) that, of course, “boneless” chicken wings might have bones in them and no reasonable customer (in Ohio, anyway) would expect otherwise.

  4. Richard A says:

    I was ready to accept your challenge on behalf of the correct, original, definition of kilobyte, but then realized I am cursed with too much situational awareness to carry it off.

  5. avwolf says:

    The introduction of the silly units happened when I was most of the way through my degree in Computer Engineering and it is one of those things that makes me unreasonably angry. We should never have let the hard drive manufacturers win. “If we use the *exact metric* meaning of these prefixes, instead of the common-use computer meaning, we can make our customers think they are getting more while we actually give them less!” It is an embarrassment to my profession and professional organization that we bowed to this scheme.

    More on topic, I do sometimes wish the lexicographers had not completely resigned themselves to being unable to restrain language at all. As embarrassed as I may be at computer science sort-of adopting mebibyte, I cannot imagine the degree lexicographers with any self-respect want to crawl under a rock for members of their profession declaring “6-7” to be the “word of the year for 2025.”

    • Belfry Bat says:

      Okay, here we go… “to what degree anywise self-respecting lexicographers would rather crawl under rocks, seeing how et.c.” I can pedant with the worst of them, and verb the choicest nouns of ever, though alas I’ve coined naught of catchiness all year neither—my daughter, all of five, defined “dishtribution centre” last week and I’m all five of envy for it!

Leave a Reply to avwolf Cancel reply

Your email address will not be published. Required fields are marked *