It's such a common problem but the answers are hard to find. I want to measure the performance of [ web server 95th percentile response time | API calls | algorithm performance | disk I/O | whatever ]. But, you know, that's a lot of data and I don't want to store it because this is used in production. Also, I don't want to spend a lot of CPU time calculating how slow my software is.
If you search for answers, you'll see many references to ancient algorithms that store a ton of data in bins or keep a large reservoir of random sample data. Common results include P-square and binmedian , and notice it's hard to find any decent implementations because although they're commonly suggested they're also garbage and nobody with a clue uses them.
You'll also find clever-sounding answers you can't implement because half the explanation is missing. Maybe if you were a stats major you'd understand this.
So what can I use to get cheap performance statistics? Algorithm and source code, please.