Questions tagged [statistics]

Mathematical statistics is the study of statistics from a mathematical standpoint, using probability theory and other branches of mathematics such as linear algebra and analysis.

Statistics is the science of the collection, organization, and interpretation of data. It deals with many aspects of data, which includes the planning of data collection in terms of the design of surveys and experiments. (From Wikipedia)

More specifically, mathematical statistics is the study of statistics from a mathematical standpoint, using probability theory as well as other branches of mathematics such as linear algebra and mathematical analysis. (From Wikipedia)

For questions which are more generally about collecting and treating data, it is advised that you post your question on Cross Validated and DSSE.

33922 questions
439
votes
4 answers

What is the intuitive relationship between SVD and PCA?

Singular value decomposition (SVD) and principal component analysis (PCA) are two eigenvalue methods used to reduce a high-dimensional data set into fewer dimensions while retaining important information. Online articles say that these methods are…
187
votes
7 answers

How to intuitively understand eigenvalue and eigenvector?

I'm learning multivariate analysis and I have learnt linear algebra for two semester when I was a freshman. Eigenvalue and eigenvector is easy to calculate and the concept is not difficult to understand.I found that there are many application of…
141
votes
3 answers

Why is the eigenvector of a covariance matrix equal to a principal component?

If I have a covariance matrix for a data set and I multiply it times one of it's eigenvectors. Let's say the eigenvector with the highest eigenvalue. The result is the eigenvector or a scaled version of the eigenvector. What does this really…
Ryan
  • 5,069
  • 6
  • 18
  • 10
117
votes
8 answers

derivative of cost function for Logistic Regression

I am going over the lectures on Machine Learning at Coursera. I am struggling with the following. How can the partial derivative of $$J(\theta)=-\frac{1}{m}\sum_{i=1}^{m}y^{i}\log(h_\theta(x^{i}))+(1-y^{i})\log(1-h_\theta(x^{i}))$$ where…
116
votes
4 answers

What is the difference and relationship between the binomial and Bernoulli distributions?

How should I understand the difference or relationship between binomial and Bernoulli distribution?
user122358
  • 2,552
  • 5
  • 20
  • 32
113
votes
6 answers

How often does it happen that the oldest person alive dies?

Today, we are brought the sad news that Europe's oldest woman died. A little over a week ago the oldest person in the U.S. unfortunately died. Yesterday, the Netherlands' oldest man died peacefully. The Gerontology Research Group keeps records:…
Řídící
  • 3,048
  • 6
  • 19
  • 39
107
votes
9 answers

If I flip a coin 1000 times in a row and it lands on heads all 1000 times, what is the probability that it's an unfair coin?

Consider a two-sided coin. If I flip it $1000$ times and it lands heads up for each flip, what is the probability that the coin is unfair, and how do we quantify that if it is unfair? Furthermore, would it still be considered unfair for $50$…
Adam Freymiller
  • 1,051
  • 2
  • 8
  • 7
100
votes
8 answers

Is Bayes' Theorem really that interesting?

I have trouble understanding the massive importance that is afforded to Bayes' theorem in undergraduate courses in probability and popular science. From the purely mathematical point of view, I think it would be uncontroversial to say that Bayes'…
user78270
  • 3,718
  • 2
  • 18
  • 27
97
votes
6 answers

Variance of sample variance?

What is the variance of the sample variance? In other words I am looking for $\mathrm{Var}(S^2)$. I have started by expanding out $\mathrm{Var}(S^2)$ into $E(S^4) - [E(S^2)]^2$ I know that $[E(S^2)]^2$ is $\sigma$ to the power of 4. And that is as…
MathMan
  • 1,151
  • 1
  • 8
  • 8
91
votes
10 answers

What's so special about standard deviation?

Equivalently, about variance? I realize it measures the spread of a distribution, but many other metrics could do the same (e.g., the average absolute deviation). What is its deeper significance? Does it have a particular geometric interpretation…
blue_note
  • 1,235
  • 7
  • 12
79
votes
4 answers

Intuition behind using complementary CDF to compute expectation for nonnegative random variables

I've read the proof for why $\int_0^\infty P(X >x)dx=E[X]$ for nonnegative random variables (located here) and understand its mechanics, but I'm having trouble understanding the intuition behind this formula or why it should be the case at all. Does…
bouma
  • 1,015
  • 1
  • 9
  • 8
78
votes
5 answers

Density of sum of two independent uniform random variables on $[0,1]$

I am trying to understand an example from my textbook. Let's say $Z = X + Y$, where $X$ and $Y$ are independent uniform random variables with range $[0,1]$. Then the PDF is $$f(z) = \begin{cases} z & \text{for $0 < z < 1$} \\ 2-z & \text{for $1 \le…
76
votes
6 answers

How was the normal distribution derived?

Abraham de Moivre, when he came up with this formula, had to assure that the points of inflection were exactly one standard deviation away from the center, and so that it was bell-shaped, as well as make sure that the area under the curve was…
73
votes
8 answers

incremental computation of standard deviation

How can I compute the standard deviation in an incremental way (using the new value and the last computed mean and/or std deviation) ? for the non incremental way, I just do something…
72
votes
3 answers

Intuitive explanation of a definition of the Fisher information

I'm studying statistics. When I read the textbook about Fisher Information, I couldn't understand why the Fisher Information is defined like this: $$I(\theta)=E_\theta\left[-\frac{\partial^2 }{\partial \theta^2}\ln P(\theta;X)\right].$$ Could anyone…
maple
  • 2,453
  • 2
  • 23
  • 33
1
2 3
99 100