An integral gives you the area between the horizontal axis
and the curve. Most of the time this is the x axis.

```
y
| |
--|-- ----|---- f(x)
/ | \ / |
/ | -------- |
| / | |
-----|------- | |
| | |
| | |
----------|--------------+--------------------|----- x
a b
```

And the area enclosed is:

Area = $\int^b_a f(x) dx$

But say you didn't want to use an integral to measure the
area between the x axis and the curve. Instead you just
caclulate the average value of the graph between a and b
and draw a striaght flat line y = avg(x) (the average
value of x in that range).

Now you have a graph like this:

```
y
| |
- | - - - | - - f(x)
| / | \ / |
-----|-----------------------------------|---- avg(x)
| / | |
- - -|- - - - | |
| | |
| | |
----------|--------------+--------------------|----- x
a b
```

And the area enclosed is a rectangle:

Area = avg(x) w where w is the width iof the section

The height is avg(x) and the width is w = b-a or in English,
"the width of a slice of the x axis going from a to b."

But say you need a more accurate area. You could break the
graph up into smaller sections and make rectangles out of them.
Say you make 4 equal sections:

```
y
| |
|----|---| |-------|---- f(x)
| | | | |
| | |--------| |
| | | | | |
-----|---------| | | | |
| | | | | |
| | | | | |
----------|---------|----+---|--------|-------|----- x
a b
```

And the area is:

Area = section 1 + section 2 + section 3 + section 4

= avg(x,1) w + avg(x,2) w + avg(x,3) w + avg(x,4) w

where w is the width of each section. The sections are all
the same size, so in this case w=(b-a)/4 or in English,
"the width of a thin slice of the x axis or 1/4 of the
width from a to b."

And if we write this with a sumation we get:

Area = $\sum^4_{n=1}avg(x,n) w$

But it's still not accurate enough. Let's use
an infinite number of sections. Now our area
becomes a summation of an infinite number of sections.
Since it's an infinite sum, we will use the
integral sign instead of the summation sign:

Area = $\int avg(x) w$

where avg(x) for an infinitely thin section will be
equal to f(x) in that section, and w will be "the width of
an infinitely thin section of the x axis."

So instead of avg(x) we can write f(x), because they are
the same if the average is taken over an infinitely small width.

And we can rename the w variable to anything we want.
The width of a section is the difference between the
right side and the left side. The difference between
two points is often called the delta of those values.
So the difference of two x values (like a and b) would
be called delta-x. But that is too long to use in an
equation, so when we have an infinitely small delta,
it is shortened to dx.

If we replace avg(x) and w with these equivalent things:

Area = $\int f(x) dx$

So what the equation says is:

Area equals the sum of an infinite number of rectangles
that are f(x) high and dx wide (where dx is an infinitely
small distance).

So you need the dx because otherwise you aren't summing
up rectangles and your answer wouldn't be total area.

dx literally means "an infinitely small width of x".

It even means this in derivatives. A derivative of
a function is the slope of the graph at that point.
Slope is usually measured as the y difference of two
points divided by the x difference of those points:

Slope = (y2 - y1) / (x2 - x1)

But the closer these points get the smaller these
differences get. Let's start calling them deltas,
because the difference between two points is often
called the delta of those values.

Slope = delta-y / delta-x

The deltas get smaller and smaller as these two x,y
points get closer and closer. When they are an
infinitely small distance apart, then the delta-y
and delta-x is shortened to dy and dx:

Slope = dy / dx

The slope is still Slope = (y2 - y1) / (x2 - x1)
but these points are infinitely close together, so
we use dy and dx to tell ourselves that they are
infinitely close or "differential distances."