100

Base $10$ uses these digits: $\{0,1,2,3,4,5,6,7,8,9\};\;$ base $2$ uses: $\{0,1\};\;$ but what would base $1$ be?


Let's say we define Base $1$ to use: $\{0\}$. Because $10_2$ is equal to $010_2$, would all numbers be equal?

The way I have thought Base 1 might be represented is tally marks, $0_{10}$ would be represented by nothing. So, $5$ in Base 1 would be represented by $00000$? Or we could define Base 1 to use: $\{$|$\}$ and $5$ would be |||||?

Justin
  • 2,447
  • 4
  • 18
  • 38
  • 1
    @Name. Whoever you are, I just want to advice you a little bit, I think you shouldn't spend so much reputation on these old questions. – Rounak Sarkar Oct 10 '21 at 04:07
  • 1
    My real name is Jitendra Singh and I am spending much rep so that these question answers are active and new people can read them @RounakSarkar –  Oct 10 '21 at 04:10

6 Answers6

64

You're exactly right that such a system would be represented by the use of arbitrary tally marks. Such a system is known as a Unary Numeral System (Wikipedia Entry):

The unary numeral system is the bijective base-1 numeral system. It is the simplest numeral system to represent natural numbers: in order to represent a number N, an arbitrarily chosen symbol representing 1 is repeated N times. This system is used in tallying. For example, using the tally mark |, the number 6 is represented as ||||||.

...

There is no explicit symbol representing zero in unary as there is in other traditional bases, so unary is a bijective numeration system with a single digit. If there were a 'zero' symbol, unary would effectively be a binary system. [boldface mine] In a true unary system there is no way to explicitly represent none of something, though simply making no marks represents it implicitly. Even in advanced tallying systems like Roman numerals, there is no zero character; instead the Latin word for "nothing," nullae, is used.

amWhy
  • 204,278
  • 154
  • 264
  • 488
  • 1
    for representing 0 you can define a number to be represented by 1+itself symbols. so 0 would be `|` and 1 would be `||` etc. – ratchet freak Apr 25 '13 at 10:51
  • @ratchetfreak Why not just use `_` (nothing, math.stackexchange.com doesn't allow me to use ' ')? When we use tally marks, `|` is 1, `||` is 2, and `_` (nothing) is 0. – Justin Apr 25 '13 at 17:25
  • @gangqinlaohu see the second quote block in this answer ;) – ratchet freak Apr 25 '13 at 17:28
  • 1
    @ratchetfreak Exactly, 0 is represented by nothing – Justin Apr 25 '13 at 17:29
  • @gangqinlaohu unless you specify that 1 is represented by 2 tallies – ratchet freak Apr 25 '13 at 17:30
  • For what it's worth, you can choose arbitrary symbols in *any* numeral system; separating syntax and semantics is a general concept. Note that in formal language theory, the empty word aka "nothing" (with interpretation $0$ in this context) is usually denoted by $\varepsilon$ or $\lambda$. – Raphael Oct 14 '13 at 23:04
  • @amWhy, So this begs the question, why does "base 2" use `[0, 1]` instead of `[1, 2]` ? – Pacerier Mar 21 '17 at 15:43
  • Do you know of any bit strings that use only $1's$ and $2's$? Just like base 10 uses only the digits in $\{0, 1, 2, 3, 4, 5, 6, 7, 8, 9\}$. Take for example $1101_2 = 1\times 2^0 + 0\times 2^1 + 1 \times 2^2, 1\times 2^3)_10.$ – amWhy Mar 21 '17 at 15:50
  • 4
    The bolded statement is not really right. In a base-1 system, it is simply the case that $1 = 10 = 100 = \cdots$ because $1 = 1^2 = 1^3 = \cdots$, so that the $0$ is a redundant symbol. The use of it in the unary system does not *make it binary*. That statement, as well as the following one that there is no explicit symbol for $0$, is sensible only under the presumption that base-representations of natural numbers should be bijective. There may certainly be a $0$ in a unary system. –  Sep 30 '17 at 19:00
  • 1
    The unary / tally mark system is a bijective base, not a positional base. The examples given with base 10 and 2 in the question are positional bases. In a positional base 1, you only got one digit, with no value: 0. All positions will have zero value, and you can only represent one number: 0. – Bijective base 1 would be one way to make it funcitonal, but that isn't a positional base. – Liggliluff Feb 20 '19 at 22:36
  • 4
    The others answer by MJD is miles ahead of this one... yet this one, citing a pretty bad Wikipedia paragraph, because of being accepted answer is upvoted to the sky. – Santropedro May 03 '19 at 20:57
  • 1
    I defer to your thought here. In base 1, we surely use random tally mark of our choice but normally '1' is used. I know, the digit should be 1 less than base but using '1' as symbol perfectly works to represent any decimal number e.g., (5)base10 = (11111)base1 and zero is represented as empty string. Even the wiki page you attached talking same. Also, you can derived decimal number using number expression. – Ubi.B Aug 21 '21 at 06:54
34

I would like to expand on Trevor Wilson's answer. Base-$b$ representation of integers is rooted in the fact that, for any non-negative integer $n$, there is a unique representation of $n$ in the form $$n = \sum_{i=0}^\infty a_ib^i$$ where $0 \le a_i < b$. For example, when $b$ is 3, and $n$ is 47, the unique solution has $a_0 = 2, a_1 = 0, a_2 = 2, a_3 = 1, $ and $a_i = 0$ for all $i>3$. The $a_i$ are called the "base-$b$ digits of $n$"; in our example the base-3 digits of 47 are 1202. We say that the sequence of digits is a numeral, and that it represents the number $n$.

The uniqueness property means that each $n$ has exactly one base-$b$ representation. If one requires that the sequence of $a_i$ is eventually zero (that is, that $a_i = 0$ for all sufficiently large $i$) then the converse holds also: each sequence of digits corresponds to exactly one $n$. In fact there are four properties that hold:

  1. Each $n$ has at least one representation
  2. Each $n$ has no more than one representation
  3. Each representation corresponds to at least one $n$
  4. Each representation corresponds to no more than one $n$

It is quite possible to construct representations that lack some of these properties. For example, consider the base-3 representation, but drop the restriction that says that $0\le a_i < 3$. Then property 2 fails: The number 47 has many base-3 representations: 502, for example, or 362, or 1 12 2 (here $a_1 = 12$), or even one (harder to write) where $a_0 = 47$. Each sequence of digits still represents a single $n$, but a particular $n$ might have many representations as a sequence of digits. Sometimes such representations even have some use.

Some of these properties are more important than others. Property 4, for example, is crucial, because if it doesn't hold, then there is some sequence of digits that might represent two different numbers, and when you see it you don't know what number is being represented. Such a system can't really be called a system for representing numbers.

Similarly, a system which fails to have property 1 has limited usefulness. Such a system can represent some $n$, but not all.

Depending on where and how it fails, a representation might be more or less useful. Fraction notation, for example, is universally used to represent rational numbers. But it fails to have properties 2 and 3! (It fails 2 since each rational number has many representations, say as $\frac12, \frac24, $ or $\frac{120}{240}$. And it fails 3 since $\frac10$ and $\frac00$ do not represent any rational numbers.) But these failures don't prevent it from being useful as a representation of rational numbers. A more serious failure arises if you try to make fractions represent real numbers; then property 1 fails, since there is no fraction representation for the number $\pi$ or $\sqrt2$.

Now let's return to $$n = \sum_{i=0}^\infty a_ib^i.$$ I said that this representation of non-negative integers has all four properties, but I left out an important limitation: the four properties only hold for $b\ge 2$. If $b=1$, the restriction $0\le a_i<b$ degenerates to $a_i=0$, and we can no longer represent any number except 0. So only 0 has a base-1 represenation. As a numeral system, this is completely useless.

If we drop the $0\le a_i<b$ restriction, we get something that hardly resembles a system of representation at all: Each number $n$ now has many base-1 representations For example, one could write 5 as 14, or 32, or 1121.

So, although it is inconsistent, mathematicians, and especially computer scientists, adopt a different meaning for "base-$1$ representation". They abandon $\sum a_ib^i$ completely and agree to represent the number $n$ as a sequence of exactly $n$ ones. For example, $7$ is represented as 1111111. This restores properties 1–4, so it is a sensible representation.

MJD
  • 62,206
  • 36
  • 276
  • 489
  • Another way of looking at it is that they keep the sum definition, but replace `0 <= ai < b` with `0 < ai <= b` – Joe K Apr 25 '13 at 18:43
  • No, because then you can't represent *any* number: you need the sequence to be eventually zero. You can patch it up, but it's a bit more complicated than you said. – MJD Apr 30 '13 at 23:22
  • True. I guess that's just the way I've intuitively thought of it, but you're right, it is more complicated to actually make it work. – Joe K Apr 30 '13 at 23:59
  • The problem you run into is rooted in the fact that you identify symbols (digits) and their meaning (numbers). There is no real reason (as far as I can tell) for disallowing arbitrary $f(a_i)$ instead of $a_i$ in the "value sum", where $f : \Sigma \to \mathbb{N}$ maps digits to numbers -- as long as 1-4 are fulfilled, of course. – Raphael Oct 14 '13 at 23:13
  • @MJD, So if "base 1" is the wrong term, what's the correct term for numbers like `11111`? (to be more accurate, it should be `00000`) – Pacerier Mar 21 '17 at 15:51
19

Yes, the usual answer is that numbers are represented by "tally marks" in base $1$. However, the numeral $0$ might not be the best choice of a tally mark because if $00000$ were interpreted in base $1$ analogously to its interpretation in other bases, then it would be interpreted as $0 \cdot 1^5 + 0\cdot 1^4 + 0 \cdot 1^3 + 0 \cdot 1^2 + 0 \cdot 1^1$, which is $0$ rather than $5$.

Trevor Wilson
  • 16,402
  • 30
  • 66
  • 3
    So... $0=5$ you say? Great! We can all go home early! :-) – Asaf Karagila Apr 25 '13 at 16:59
  • 6
    The trick is not to use the 1-hour clock, but to use $\mathbb{F}_\text{un},$ whose vector spaces are any set. – Loki Clock Apr 26 '13 at 00:29
  • 11
    I've never been quite certain whether $\mathbb{F}_\text{un}$ is a real thing, or someone is just being $\mathbb{F}_\text{unny}$. – Trevor Wilson Apr 26 '13 at 01:20
  • @TrevorWilson, What about fractions for base 1? `11.111`? – Pacerier Mar 21 '17 at 15:56
  • @Pacerier In base 1, setting aside the question of whether digits greater than 0 (such as 1) should be allowed, I think a dot would not change the value of an expression. So for example, 11.111 would just be another way to write five. Because all the positive and negative integer powers of 1 are equal, the value of a digit in base 1 is independent of its position, whether before or after the dot. This makes the dot superfluous as an indicator of position. – Trevor Wilson Apr 03 '17 at 02:30
17

There is no base $1$, and no unary number system. Base $b$ requires at least two symbols from $0$ to $b - 1$. Base $b$ does not use the digit $b$. For instance base $2$ does not use the digit $2$. So any system that uses the digit $1$ cannot be base $1$.

Tally marks are typographic representation of integers, but are not a "base", let alone "base 1".

The printed representation of a number in a base has a length which is proportional to the logarithm of the number. In the tally mark system, the length is proportional to the number.

Bases can represent fractions. For instance 1.11 in binary is one and three quarters. This works thanks to the negative powers to the right of the point. If we multiply one and three quarters by two, we can simply move the binary point: 11.1. By golly, this is three and a half, exactly right.

The best that the tally system can do here is to be repaired with some scheme whereby tally marks after a dot represent an enumeration of the countable set of fractions. For instance .1 could mean half, .11 means a third, .111 two thirds, and so on. But this scheme is fundamentally incompatible with what is on the left of the point. Moving the point has no intuitive meaning.

The use of at least two symbols in bases is related to the field of numbers having two elements: an additive identity (zero) and multiplicative identity (one). The naive tally mark system ignores this concept of a field with two identity elements, which is why it runs aground when it comes to representing fractions. It doesn't even represent zero, except perhaps by means of leaving an ambiguous empty space devoid of tallies to represent nothingness.

Kaz
  • 6,671
  • 1
  • 17
  • 30
  • 5
    "The printed representation of a number in a base has a length which is proportional to the logarithm of the number. In the tally mark system, the length is proportional to the number." Exactly!, in base 1, the representation is proportional the logarithm in base $1 + \epsilon$ of the number, which is effectively linear. In this interpretation base 1 still counts as a base, although a special in that sense. – alfC Apr 25 '13 at 05:27
  • 1
    Your assertion that base 1 not having field identities is causing it to fail to represent fractions is wrong. It's easy to define a base 1 with two symbols, 0 and 1, in the same way other bases are defined, as an infinite string of digits with the number represented being the sum of the base raised to the power of its position multiplied by the digit. It fails to represent fractions, but that's because 1 raised to any integer power is 1, not because of a missing zero element. – Michael Shaw Apr 25 '13 at 14:12
  • 1
    @alfC Sure and we could argue that 1 is a prime number because it's only divisible by 1 and itself. – Kaz Apr 25 '13 at 14:40
  • 2
    @Kaz: We could argue exactly that. We could also argue that 2 is not a prime number because it's even and no other prime number is even. Many older definitions of prime numbers included 1. Mathematical definitions are not engraved on stone and handed to Moses by God, we make them up. We try to make them up to match reality in some way, but nothing says they can't be changed. – Michael Shaw Apr 25 '13 at 18:38
  • We could also argue that 3 is not a prime number, because it's divisible by 3, and no other prime number is divisible by 3, et cetera. That is just silly. The definition of a number base is not carved in stone, but there is no utility in calling something base 1 that doesn't whatsoever resemble bases 2, 3, 4, ... – Kaz Apr 25 '13 at 18:45
  • 1
    @Kaz: In a previous comment I explained how to define base 1 in a way that very strongly resembles the other bases. It's the obvious way to define base 1. Saying "base 1 is less useful than other bases" makes sense, saying "base 1 doesn't exist" is silly. – Michael Shaw Apr 26 '13 at 04:09
  • 1
    @MichaelShaw I read your previous comment, and right off the bat the problem is that your base 1 requires digits 0 and 1, whereas a base N only has digits from 0 to N-1. I am not saying that this sytem doesn't *exist*, just that it shouldn't be called *base*. Words are only useful to the extent that they discern. If we stretch words too thinly to cover incompatible concepts, we blunt their edge. – Kaz Apr 26 '13 at 06:12
  • @Kaz: Saying that it exists, but shouldn't be called a base is reasonable. – Michael Shaw Apr 26 '13 at 12:16
  • @alfC, What do you mean by "*proportional to the logarithm of the number*"? – Pacerier Mar 21 '17 at 16:00
  • @Pacerier, I mean that the length of the representation of a number in base $b$ is proportional to the logarithm in base $b$ of the number. For example 10 (in base 10) has 2 digits ($2 = \log_{10}{10} + 1$), 10000 (in base 10) has 5 digits ($5 = \log_{10}{10000} + 1$). This true in any base (check it for base 2). My point is that "base 1" is a special limit of these where one interprets $\log_{1+\epsilon}$ as a linear function. Seen that way there is a valid concept of "base 1" representation. – alfC Mar 21 '17 at 18:52
  • @Pacerier, also note that "proportional to the logarithm of the number" is a quote from the answer repeated in my comment. – alfC Mar 21 '17 at 20:21
  • All the bases starting at 2 are utilizing 0. Base 1 only uses the number "1" and the only way to express 0 in 1's is to not express (not-1). Base 2, instead of being 1 and 1,1 (or 2) is "not-1" (or 0) and 1. So from there on your base is going to be one more than the largest single digit used. Except for Base 1, because you can't have "one more" when you only have 1. ;) – Ron Kyle Sep 22 '17 at 03:20
  • There _is_ a base-1 system, it’s just that it’s a system of another class — so-called “bijective” (see Henry’s answer; a pity it was posted too late and is undervoted). As there are negapositional and balanced positional numerations, so there are this one too. – arseniiv May 07 '18 at 19:13
6

Though many consider a tally system to be a base 1 system, I think this confuses more than it clarifies. A tally system is not a radix system and does not represent numbers using the same mechanism. The tally system is simply a string whose length represents the number (integer) in question while a radix numeral is a shorthand for a power series.

Therefore to imply that each of these distinct systems have a comparable attribute known as their "base" is misleading. Much like claiming that a playing card deck and a computer plotter each has a comparable operation called "draw".

Ken Clement
  • 171
  • 1
  • 3
  • The tally mark system is a bijective base 1; and the simpliest explanation is that you don't use 0. In bijective base 10, you have digits 1 through A; after 9 comes A, then 11. All numbers in the positional and bijective base are writen the same as long as the positional number does not contain a zero. 829 = 829, but 809 becomes 7A9. – Liggliluff Feb 20 '19 at 22:40
4

$\displaystyle \sum_{i=0}^k a_i b^i$ with $1 \le a_i \le b$ can represent any positive integer uniquely for fixed $b$ and has the tally mark interpretation when $b=1$.

This is called bijective numeration. When $b$ is ten you lose the $0$ numeral but gain a numeral for ten ($A$ say). Many decimal numbers look the same when written in bijective base-ten, such as $432$, but others do not, such as $402$ which becomes $3A2$.

Henry
  • 142,043
  • 9
  • 114
  • 228
  • This can actually represent any non-negative integer, when the sequence of digits is empty. Suppose we sum from $0$ to $n-1$ instead of $k$, and call $n$ the number of digits Then we can adopt the convention that if $n$ is zero, the sum of the terms for $i = 0$ to $-1$ is the empty series. – Kaz Jan 08 '22 at 02:33