Questions tagged [floating-point]

Floating point numbers are approximations of real numbers that can represent larger ranges than integers but use the same amount of memory, at the cost of lower precision. If your question is about small arithmetic errors (e.g. why does 0.2 + 0.1 equal 0.300000001?) or decimal conversion errors, please read the "info" page linked below before posting.

Many questions asked here about floating point math are about small inaccuracies in floating point arithmetic. To use the example from the excerpt, 0.1 + 0.1 + 0.1 might result in 0.300000001 instead of the expected 0.3. Errors like these are caused by the way floating point numbers are represented in computers' memory.

Integers are stored as exact values of the numbers they represent. Floating point numbers are stored as two values: a significand and an exponent. It is not possible to find a significand-exponent pair that matches every possible real number. As a result, some approximation and therefore inaccuracy is unavoidable.

Two commonly cited introductory-level resources about floating point math are What Every Computer Scientist Should Know About Floating-Point Arithmetic and the floating-point-gui.de.

FAQs:

Why 0.1 does not exist in floating point

Floating Point Math at https://0.30000000000000004.com/

Related tags:

ieee-754 (most used standard for floating-point computation)
- half-precision-float (16b float)
- single-precision (32b float)
- double-precision (64b float)
- extended-precision (80b float, usually)
- quadruple-precision (128b float)
types in c and c++
- double
- long-double
aspects of floating point numbers and computations

Programming languages where all numbers are double-precision (64b) floats:

javascript (see Number.MAX_SAFE_INTEGER on MDN and What is JavaScript's highest integer value that a Number can go to without losing precision?)
awk (see Expressions in awk in POSIX)
lua (up to 5.2 only, 5.3 introduced integers; see Changes in the Language in Lua 5.3 manual)

13427 questions

579

votes

5 answers

Why does Math.round(0.49999999999999994) return 1?

In the following program you can see that each value slightly less than .5 is rounded down, except for 0.5. for (int i = 10; i >= 0; i--) { long l = Double.doubleToLongBits(i + 0.5); double x; do { x =…

java floating-point double rounding

asked Mar 28 '12 at 07:30

Peter Lawrey

498,481
72
700
1,075

577

votes

32 answers

What is the most effective way for float and double comparison?

What would be the most efficient way to compare two double or two float values? Simply doing this is not correct: bool CompareDoubles1 (double A, double B) { return A == B; } But something like: bool CompareDoubles2 (double A, double B) { …

c++ algorithm optimization floating-point

asked Aug 20 '08 at 02:09

Alex

5,795
3
15
3

542

votes

27 answers

How to nicely format floating numbers to string without unnecessary decimal 0's

A 64-bit double can represent integer +/- 253 exactly. Given this fact, I choose to use a double type as a single type for all my types, since my largest integer is an unsigned 32-bit number. But now I have to print these pseudo integers, but the…

java string floating-point format double

asked Mar 31 '09 at 22:54

Pyrolistical

26,088
21
78
104

523

votes

5 answers

Correct format specifier for double in printf

What is the correct format specifier for double in printf? Is it %f or is it %lf? I believe it's %f, but I am not sure. Code sample #include int main() { double d = 1.4; printf("%lf", d); // Is this wrong? }

c floating-point printf double format-specifiers

asked Nov 24 '10 at 06:45

Leopard

5,241
3
13
4

508

votes

5 answers

How to get a random number between a float range?

randrange(start, stop) only takes integer arguments. So how would I get a random number between two float values?

python random floating-point

asked May 22 '11 at 13:00

Mantis Toboggan

5,645
3
16
10

458

votes

17 answers

How to parse float with two decimal places in javascript?

I have the following code. I would like to have it such that if price_result equals an integer, let's say 10, then I would like to add two decimal places. So 10 would be 10.00. Or if it equals 10.6 would be 10.60. Not sure how to do…

javascript floating-point

asked Dec 14 '10 at 01:41

user357034

9,761
18
53
70

456

votes

13 answers

What is the difference between float and double?

I've read about the difference between double precision and single precision. However, in most cases, float and double seem to be interchangeable, i.e. using one or the other does not seem to affect the results. Is this really the case? When are…

c++ c floating-point precision

asked Mar 05 '10 at 12:48

VaioIsBorn

6,893
9
29
27

450

votes

14 answers

How to format a float in javascript?

In JavaScript, when converting from a float to a string, how can I get just 2 digits after the decimal point? For example, 0.34 instead of 0.3445434.

javascript floating-point

asked Mar 19 '09 at 09:38

F40

404

votes

11 answers

How dangerous is it to compare floating point values?

I know UIKit uses CGFloat because of the resolution independent coordinate system. But every time I want to check if for example frame.origin.x is 0 it makes me feel sick: if (theView.frame.origin.x == 0) { // do important operation } Isn't…

objective-c ios c floating-point floating-accuracy

asked Apr 26 '12 at 13:41

Proud Member

38,700
43
143
225

387

votes

16 answers

What is the best way to compare floats for almost-equality in Python?

It's well known that comparing floats for equality is a little fiddly due to rounding and precision issues. For example: https://randomascii.wordpress.com/2012/02/25/comparing-floating-point-numbers-2012-edition/ What is the recommended way to deal…

python floating-point

asked Apr 08 '11 at 13:02

Gordon Wrigley

9,129
8
41
59

374

votes

16 answers

How do I print a double value with full precision using cout?

In my earlier question I was printing a double using cout that got rounded when I wasn't expecting it. How can I make cout print a double using full precision?

c++ floating-point precision iostream cout

asked Feb 16 '09 at 18:15

Jason Punyon

37,168
13
93
118

371

votes

8 answers

How to convert float to int with Java

I used the following line to convert float to int, but it's not as accurate as I'd like: float a=8.61f; int b; b=(int)a; The result is : 8 (It should be 9) When a = -7.65f, the result is : -7 (It should be -8) What's the best way to do it ?

java floating-point int

asked Aug 18 '09 at 17:41

Frank

28,342
54
158
227

370

votes

11 answers

JavaScript displaying a float to 2 decimal places

I wanted to display a number to 2 decimal places. I thought I could use toPrecision(2) in JavaScript . However, if the number is 0.05, I get 0.0500. I'd rather it stay the same. See it on JSbin. What is the best way to do this? I can think of coding…

javascript floating-point precision

asked Jul 02 '10 at 03:21

alex

438,662
188
837
957

343

votes

6 answers

Double vs. BigDecimal?

I have to calculate some floating point variables and my colleague suggest me to use BigDecimal instead of double since it will be more precise. But I want to know what it is and how to make most out of BigDecimal?

java floating-point double bigdecimal

asked Aug 05 '10 at 09:39

Truong Ha

9,070
10
35
45

311

votes

2 answers

What does the constant 0.0039215689 represent?

I keep seeing this constant pop up in various graphics header files 0.0039215689 It seems to have something to do with color maybe? Here is the first hit on Google: void RDP_G_SETFOGCOLOR(void) { Gfx.FogColor.R = _SHIFTR(w1, 24, 8) *…

c floating-point constants magic-numbers

asked Mar 24 '14 at 21:45

crush

15,889
8
54
95

Prev 1

…

99 100 Next