MySQL pagination without double-querying?

Question

I was wondering if there was a way to get the number of results from a MySQL query, and at the same time limit the results.

The way pagination works (as I understand it), first I do something like

query = SELECT COUNT(*) FROM `table` WHERE `some_condition`

After I get the num_rows(query), I have the number of results. But then to actually limit my results, I have to do a second query like:

query2 = SELECT COUNT(*) FROM `table` WHERE `some_condition` LIMIT 0, 10

My question: Is there anyway to both retrieve the total number of results that would be given, AND limit the results returned in a single query? Or any more efficient way of doing this. Thanks!

Although you wouldn't have COUNT(*) in query2 – dlofrodloh May 10 '16 at 21:18 — dlofrodloh, May 10 '16 at 21:18

Derrick · Answer 1 · 2013-08-08T17:39:55.217

70

I almost never do two queries.

Simply return one more row than is needed, only display 10 on the page, and if there are more than are displayed, display a "Next" button.

SELECT x, y, z FROM `table` WHERE `some_condition` LIMIT 0, 11

// iterate through and display 10 rows.

// if there were 11 rows, display a "Next" button.

Your query should return in an order of most relevant first. Chances are, most people aren't going to care about going to page 236 out of 412.

When you do a google search, and your results aren't on the first page, you likely go to page two, not nine.

edited Aug 08 '13 at 17:39

answered Jul 24 '10 at 11:14

Derrick

1,893
1
12
15

43

Actually, if I don't find it on the first page of a Google query, usually I do skip to page nine. – Phil May 18 '11 at 05:42
3

@Phil I heard this before but why do that? – May 13 '12 at 04:56
5

A little late, but here is my reasoning. Some searches are dominated by search engine optimized link farms. So the first few pages are the different farms fighting it out for position number 1, the useful result is likely still associated with the query, just not on the top. – Phil Aug 14 '12 at 23:14
4

`COUNT` is an aggregate function. How do you return the count **and** all the results in one query? The above query will only return 1 row, no matter what the `LIMIT` is set at. If you add `GROUP BY`, it'll return all results but the `COUNT` will be inaccurate – pixelfreak Nov 29 '12 at 09:57
1

The result of count is usefull to calculate how many pages will be needed to see the total number of rows. – rvazquezglez Mar 22 '13 at 18:21
@pixelfreak count(*) was a mistake, obviously... I updated the query to include x,y,z columns. – Derrick Aug 08 '13 at 17:41
1

I'm way late to the question and answer. This will give you a "next button", but how do you even know if there's going to BE a page 9 unless you query first? If all you're looking for is a next and back, I like this approach! – Matthew Johnson Apr 18 '14 at 21:58
Basically a very good idea, however if you want that your pages tells you how many pages you got for example Page: 3 / 12 then this will not help out here. Also displaying a last-page button is not possible this way. – Steini Jul 18 '14 at 08:53
2

This is one of the approaches recommended by Percona: http://www.percona.com/blog/2008/09/24/four-ways-to-optimize-paginated-displays/ – techdude Feb 03 '15 at 16:18

score 68 · Accepted Answer · answered May 04 '09 at 02:37

68

No, that's how many applications that want to paginate have to do it. It's reliable and bullet-proof, albeit it makes the query twice. But you can cache the count for a few seconds and that will help a lot.

The other way is to use SQL_CALC_FOUND_ROWS clause and then call SELECT FOUND_ROWS(). apart from the fact you have to put the FOUND_ROWS() call afterwards, there is a problem with this: There is a bug in MySQL that this tickles that affects ORDER BY queries making it much slower on large tables than the naive approach of two queries.

answered May 04 '09 at 02:37

staticsan

28,233
4
55
72

2

It's not quite race-condition proof, however, unless you do the two queries within a transaction. This generally isn't a problem, though. – NickZoic May 04 '09 at 03:23
By "reliable" I meant the SQL itself is always going to return the result you want, and by "bullet-proof" I meant that there are no MySQL bugs hampering what SQL you can use. Unlike using SQL_CALC_FOUND_ROWS with ORDER BY and LIMIT, according to the bug I mentioned. – staticsan May 04 '09 at 04:30
5

On complex queries, using SQL_CALC_FOUND_ROWS to fetch the count in the same query will almost always be slower than doing two separate queries. This is because it means all rows will need to be retrieved in full, regardless of the limit, then only those specified in the LIMIT clause are returned. See also my response which has links. – thomasrutter Sep 08 '11 at 05:21
Depending on the reason you need this, you may also want to think of just not retrieving the total results. It's becoming a more common practice to implement auto-paging methods. Sites like Facebook, Twitter, Bing, and Google have been using this method for ages. – Thomas B Nov 30 '12 at 06:24

thomasrutter · Answer 3 · 2011-09-08T05:27:19.127

27

Another approach to avoiding double-querying is to fetch all the rows for the current page using a LIMIT clause first, then only do a second COUNT(*) query if the maximum number of rows were retrieved.

In many applications, the most likely outcome will be that all of the results fit on one page, and having to do pagination is the exception rather than the norm. In these cases, the first query will not retrieve the maximum number of results.

For example, answers on a stackoverflow question rarely spill onto a second page. Comments on an answer rarely spill over the limit of 5 or so required to show them all.

So in these applications you can simply just do a query with a LIMIT first, and then as long as that limit is not reached, you know exactly how many rows there are without the need to do a second COUNT(*) query - which should cover the majority of situations.

edited Sep 08 '11 at 05:27

answered Sep 08 '11 at 05:19

thomasrutter

104,920
24
137
160

1

@thomasrutter I had the same approach, however discovered a flaw with it today. The final page of results will not then have the pagination data. i.e., let's say each page should have 25 results, the last page will likely not have that many, let's say it has 7... that means the count(*) will never be run, and so no pagination will be displayed to the user. – duellsy Aug 21 '12 at 06:34
2

No - if you are say, 200 results in, you query the next 25 and you only get 7 back, that tells you that the total number of results is 207 and therefore you don't need to do another query with COUNT(*) because you already know what it's going to say. You have all the information you need to show pagination. If you are having a problem with pagination not showing to the user then you have a bug somewhere else. – thomasrutter Aug 22 '12 at 02:38

thomasrutter · Answer 4 · 2011-09-08T05:24:06.320

In most situations it is much faster and less resource intensive to do it in two separate queries than to do it in one, even though that seems counter-intuitive.

If you use SQL_CALC_FOUND_ROWS, then for large tables it makes your query much slower, significantly slower even than executing two queries, the first with a COUNT(*) and the second with a LIMIT. The reason for this is that SQL_CALC_FOUND_ROWS causes the LIMIT clause to be applied after fetching the rows instead of before, so it fetches the entire row for all possible results before applying the limits. This can't be satisfied by an index because it actually fetches the data.

If you take the two queries approach, the first one only fetching COUNT(*) and not actually fetching and actual data, this can be satisfied much more quickly because it can usually use indexes and doesn't have to fetch the actual row data for every row it looks at. Then, the second query only needs to look at the first $offset+$limit rows and then return.

This post from the MySQL performance blog explains this further:

http://www.mysqlperformanceblog.com/2007/08/28/to-sql_calc_found_rows-or-not-to-sql_calc_found_rows/

For more information on optimising pagination, check this post and this post.

score 5 · Answer 5 · answered Jun 11 '20 at 15:54

For anyone looking for an answer in 2020. As per MySQL documentation:

"The SQL_CALC_FOUND_ROWS query modifier and accompanying FOUND_ROWS() function are deprecated as of MySQL 8.0.17 and will be removed in a future MySQL version. As a replacement, considering executing your query with LIMIT, and then a second query with COUNT(*) and without LIMIT to determine whether there are additional rows."

I guess that settles that.

https://dev.mysql.com/doc/refman/8.0/en/information-functions.html#function_found-rows

Kama · Answer 6 · 2012-04-09T06:00:39.033

2

My answer may be late, but you can skip the second query (with the limit) and just filter the info through your back end script. In PHP for instance, you could do something like:

if($queryResult > 0) {
   $counter = 0;
   foreach($queryResult AS $result) {
       if($counter >= $startAt AND $counter < $numOfRows) {
            //do what you want here
       }
   $counter++;
   }
}

But of course, when you have thousands of records to consider, it becomes inefficient very fast. Pre-calculated count maybe a good idea to look into.

Here's a good read on the subject: http://www.percona.com/ppc2009/PPC2009_mysql_pagination.pdf

edited Apr 09 '12 at 06:00

answered Apr 09 '12 at 05:42

Kama

173
1
2
12

Link's dead, I guess this is the correct one: http://www.percona.com/files/presentations/ppc2009/PPC2009_mysql_pagination.pdf. Won't edit because not sure if it is. – hectorg87 Jul 10 '14 at 13:53

score 2 · Answer 7 · edited May 27 '21 at 16:27

2

query = SELECT col, col2, (SELECT COUNT(*) FROM `table`)/10 AS total FROM `table` WHERE `some_condition` LIMIT 0, 10

Where 10 is the page size and 0 is the page number (you need to use pageNumber-1 in the query)

edited May 27 '21 at 16:27

Adil Malik

5,813
7
44
74

answered May 04 '09 at 02:21

Cris McLaughlin

1,161
1
13
22

16

This query just returns the total number of record in the table; not the number of records that match the condition. – Lawrence Barsanti May 03 '10 at 00:51
1

The total number of records is what is needed for pagination (@Lawrence). – imme Nov 20 '14 at 14:50
Oh, well, just add the `where` clause to the inner query and you get the right "total" alongside with the paged results (page is selected with the `limit` clause – Erenor Paz Mar 07 '19 at 09:41
the sub-query count(*) would require the same where clause or else it won't return the correct number of results – AKrush95 Dec 06 '19 at 14:38

score 0 · Answer 8 · answered Jul 16 '16 at 11:26

You can reuse most of the query in a subquery and set it to an identifier. For example a movie query that finds movies containing the letter 's' ordering by runtime would look like this on my site.

SELECT Movie.*, (
    SELECT Count(1) FROM Movie
        INNER JOIN MovieGenre 
        ON MovieGenre.MovieId = Movie.Id AND MovieGenre.GenreId = 11
    WHERE Title LIKE '%s%'
) AS Count FROM Movie 
    INNER JOIN MovieGenre 
    ON MovieGenre.MovieId = Movie.Id AND MovieGenre.GenreId = 11
WHERE Title LIKE '%s%' LIMIT 8;

Do note that I'm not a database expert, and am hoping someone will be able to optimize that a bit better. As it stands running it straight from the SQL command line interface they both take ~0.02 seconds on my laptop.

score -15 · Answer 9 · edited Oct 29 '12 at 11:46

-15

SELECT * 
FROM table 
WHERE some_condition 
ORDER BY RAND()
LIMIT 0, 10

edited Oct 29 '12 at 11:46

Taryn

224,125
52
341
389

answered Oct 29 '12 at 03:28

John

1

3

This doesn't answer the question, and an order by rand is a really bad idea. – Dan Walmsley Nov 08 '16 at 22:09

MySQL pagination without double-querying?

9 Answers9

Linked

Related