How did we arrive at today's common word sizes?

emotional_soup_88@programming.dev · edit-2 5 days ago

How did we arrive at today's common word sizes?

CannedYeet@lemmy.world · 5 days ago

So it’s really just a coincidence that a bit is an eighth of a byte and also an eighth of a dollar?

https://en.wikipedia.org/wiki/Bit_(money)

AbidanYre@lemmy.world · edit-2 5 days ago

Once upon a time there were 6-bit computers. But yeah, as you’ve found, everything these days is 8*(2^x)

https://en.wikipedia.org/wiki/Six-bit_character_code

emotional_soup_88@programming.dev · 5 days ago

Thanks! Your answer led me to this, which kind of explains it:

https://en.wikipedia.org/wiki/Word_(computer_architecture)

Character size was in the past (pre-variable-sized character encoding) one of the influences on unit of address resolution and the choice of word size. Before the mid-1960s, characters were most often stored in six bits; this allowed no more than 64 characters, so the alphabet was limited to upper case. Since it is efficient in time and space to have the word size be a multiple of the character size, word sizes in this period were usually multiples of 6 bits (in binary machines). A common choice then was the 36-bit word, which is also a good size for the numeric properties of a floating point format.

After the introduction of the IBM System/360 design, which uses eight-bit characters and supports lower-case letters, the standard size of a character (or more accurately, a byte) becomes eight bits. Word sizes thereafter are naturally multiples of eight bits, with 16, 32, and 64 bits being commonly used.

So it has to do with character size, earlier six bits and today one byte/eight bits.

dhork@lemmy.world · 5 days ago

There are a few different concepts to address here.

First is that we’ve developed a convention of 8 bits constituting a byte. And I think that is because 8 is the most convenient power of 2 which can be used for generic information. Historically, there have been attempts to encode useful information in 5 or 6 bits, and the original ASCII was a 7-bit code which encoded upper and lower case Latin characters, and simple punctuation, as well as some control characters. But it was a simple extension to pull it out to 8 bit, which is not only a power of 2 but also can be expressed fully by 2 hexadecimal characters.

Then, as 8-bit lead to 16-bit and beyond, it is convenient to stick with the same power of 2 representation. It is also convenient to stick with the Hexadecimal notation. Plus, it makes it easier to break down larger integer values into smaller chunks if they are all 8 bit. (As long as you understand endianness…)

But, as to your question why many online calculators don’t let you change the bit length, that is explained because it is trivial to extend these values. You can represent a signed 2’s complement value of a certain length in a larger counter by simply extending the MSB.

emotional_soup_88@programming.dev · 5 days ago

Thanks! I have no idea what endianness is, except for hear “big endian” in some CS-related presentation a while back… I’ll read up on it!

As for my questions and your answer, would it be correct to say then that it’s about scalability? That one byte being eight bits scales efficiently in binary?

dhork@lemmy.world · 4 days ago

would it be correct to say then that it’s about scalability? That one byte being eight bits scales efficiently in binary?

Kind of, but in his case it’s all about human scalability. 8 bits turns out to be a convenient chunk to encode characters in. ASCII is 7 bits, but it turns out to be only useful for things in the Latin alphabet. System designers decided that it was worth retaining the 8th bit (even if it was unused in flat text files). There is a “extended” 8-bit ASCII standard but the 7-bit standard was always more widespread. Why arent all of our bytes 7 bits, then? I stand by my personal theory that it is because it is very easy to represent the full range of 8 bits in Hex.

Later on, the Unicode folks brought some utility to that 8th bit. UTF-8 is an encoding that mirrors ASCII in the lower 7 bits, but can be extended into multi-byte characters and represent other scripts too. An overwhelming amountof Internet content is actually encoded in UTF-8. These will render correctly in an editor that only understands 7-bit ASCII, except for some things like the Euro symbol, which are multi-byte constructs that require that 8th bit in order to be recognized.

So maybe in addition to looking into Endianness, you should spend some time reading up on Unicode and it’s history to get to the answer you are looking for.

emotional_soup_88@programming.dev · 4 days ago

Amazing! Seems I posted in the right “sub”. I’ll check out Unicode tonight, perhaps as a “prelude” to endianness. :)

Roguelazer@lemmy.world · 5 days ago

To answer your specific question: no. There have been and continue to be lots of CPUs that have things that could plausibly be called a “bit size” that aren’t a power of 2. Note that the “bit size” can refer to a few things (the width of the bus between the CPU and memory, the native size of a pointer, and/or the native width of the arithmetic units). I’ll give examples of each.

On essentially every “64-bit” computer, the bus to memory is not 64 bits wide. For example, the Apple M4 ARM CPUs are 64-bit but have a 128-bit memory bus over which they communicate something like 43-bit physical addresses. ARM has always been this way; the original 32-bit ARM1 had 26-bit physical addresses.

As to pointer size, the best example is probably the currently-being-developed CHERI architecture which is 64-bit arithmetic but 129-bit pointers.

For an arithmetic unit example, the floating-point unit on Intel CPUs was traditionally 80 bits wide. These days, it’s emulated on a 128-bit wide SSE unit but you still see 80 bits in code a bit.

Binette@lemmy.ml · 5 days ago

The reason it’s like that is because if people want to use the same machine, but store different sizes of information (ex someone wants to store 5 bits, while another wants to store 24), 2^n is the best way to fetch that information quickly.

For example, let’s say you have a memory that has as many sets as you want, but with only one bit in each set. I store information that is 2 bits of size. I can split the 2 bit information into two and store each bit in a set that’s one index appart. So if I wanted to read the information, I’d just read 0 and 1, 10 and 11, 100 and 101. This follows a rather simple pattern, where the leftmost bit to the one before the rightmost is the index of each information packet, and the rightmost just signals if it’s the first or the last bit of the packet.

For example, if I have 11 01, the memory would look a bit like this:

00: 1

01: 1

10: 0

11: 1

If I want to get the first packet , I just have to ask: what data has the leftmost bit to 0? We can add as many more information as we want, and it would still follow.

If you were to send information with 3 bits of size, or any that isn’t a power of 2, you wouldn’t get an easy adressing pattern. If I were to send, for example, 101 110, I would get something like this:

000: 1

001: 0

010: 1

011: 1

100: 1

101: 0

There is no pattern I can take out of the indexing of the memory to access the information. Where when I send an information that is 2 bits of size, I can take n-1 bits from the left and index it, I can’t do that for information that isn’t sent at 2^n (3, 5, 10, etc.)

The sollution, of course, would be to have the memory sets be of size 3, but we’d run into the same “problem” if the information received is not base 3. Heck, we’d run into another problem that is similar, but is more hidden in the sets themselves rather than the indexing.

Let’s say we want to put information that is 1 bit lenght in memory that has infinite sets that are 3 bits of length, and i put in 1 0 1 1

0: 101

1: 001

I can’t easily put in a pattern either. If I want to get the second information ( index 1), I would have to do 1 / 3 to check if it goes in the first or second memory adress, then, i would have to do 1 % 3 to check what position it’s at. If I wanted to get the 4th information (index 3), for example, I would get 3 / 3 = 1, then 3 % 3 = 0, so second set, index 0. Granted, both operations are done in one division operation, but it’s still slower than just shifting bits.

One could also just skip one bit if they receive 1 bit information with 3 bit sized sets. The memory would then look like this:

0: 001

1: 101

You could then just access the nth information by taking the leftmost bits for the index of the memory, then the right most bit to see if you should take the first or the third bit. For example, I want to take the 4th information (third index, 11). 1, the left bit, is the index of the set, and 1, the right one, says we need to take the 3rd bit.

This is better, but then we’d need to calculate how much space is given for different sizes of information. Four bits would have 2, 5 bits would have 1, 1 bit has 1. The formula here would indicate 3 - (n % 3) bits. That needs another modulo for it, so while accessing it is less of a problem, determining what space it needs requires another weird computation.

A final example, putting 1 bit information in two bit sized sets gives us this ( with the same input as before )

0: 01

1: 11

The third bit (index 10) can be accessed by taking set 1 (leftmost bits) at the position 1 (rightmost bit), which does give us the bit 1, the third bit of 1011.

Now if we were to store information of size 3, we’d have to use the same technique as with storing 2 bits in 3 bits: adding spaces.

Let’s say we want to store 011 101 in 2 bit sized sets:

00: 01

01: 01

10: 10

11: 01

To determine the numbers of space, we have to do 3 % 2, but the operation % 2 is very easy to do for computers, since you just take the last bit (the rule follows for % 2^n: you take the last n bits). Next, if I want to access the second information (index 1), I just mutliply the index by 2 (easy to do for computers, since it’s just a bit shift), then take the current block and the block right after it. So 10 and 11, which give me 101.

Keep in mind, this is only for machines that are made to use, as optimally as possible, any information at any bit sizes. If you have only 5 bit size information, there is no use for you to stick to a 2^n size, as you figured.

This explanation was also me just pulling out counter-examples on the fly, and I’m not in the best of states, so if there are passages that seem a bit weird or don’t explain things very well, please let me know.

Edit: formating