Chapter 8 Optimization

8.2.1.19 LIMIT Query Optimization查询优化

~~If you need only a specified number of rows from a result set, use a LIMIT clause in the query, rather than fetching the whole result set and throwing away the extra data.~~如果只需要结果集中指定数量的行，请在查询中使用LIMIT子句，而不是获取整个结果集并丢弃额外的数据。

~~MySQL sometimes optimizes a query that has a LIMIT row_count clause and no HAVING clause:~~MySQL有时会优化具有LIMIT row_count子句并且无HAVING子句的查询：

~~If you select only a few rows with LIMIT, MySQL uses indexes in some cases when normally it would prefer to do a full table scan.~~如果您利用LIMIT只选择了几行，MySQL在某些情况下会使用索引，而通常它更愿意执行完整表扫描。
~~If you combine LIMIT row_count with ORDER BY, MySQL stops sorting as soon as it has found the first row_count rows of the sorted result, rather than sorting the entire result.~~ 如果将LIMIT row_count与ORDER BY结合使用，MySQL会在找到排序结果的最前面的row_count个行后立即停止排序，而不是对整个结果进行排序。~~If ordering is done by using an index, this is very fast.~~ 如果使用索引进行排序，则速度非常快。~~If a filesort must be done, all rows that match the query without the LIMIT clause are selected, and most or all of them are sorted, before the first row_count are found.~~ 如果必须进行文件排序，则在找到最前面的row_count个行之前，将选择与查询匹配但没有LIMIT子句的所有行，并对其中的大部分或全部进行排序。~~After the initial rows have been found, MySQL does not sort any remainder of the result set.~~在找到初始行之后，MySQL不会对结果集的任何剩余部分进行排序。

~~One manifestation of this behavior is that an ORDER BY query with and without LIMIT may return rows in different order, as described later in this section.~~这种行为的一种表现形式是，有LIMIT和无LIMIT的ORDER BY查询可能以不同的顺序返回行，如本节后面所述。
~~If you combine LIMIT row_count with DISTINCT, MySQL stops as soon as it finds row_count unique rows.~~如果将LIMIT row_count与DISTINCT结合起来，MySQL会在找到row_count个唯一行时立即停止。
~~In some cases, a GROUP BY can be resolved by reading the index in order (or doing a sort on the index), then calculating summaries until the index value changes.~~ 在某些情况下，可以通过按顺序读取索引（或对索引进行排序），然后计算摘要，直到索引值发生变化来解析GROUP BY。~~In this case, LIMIT row_count does not calculate any unnecessary GROUP BY values.~~在这种情况下，LIMIT row_count不会计算任何不必要的GROUP BY值。
~~As soon as MySQL has sent the required number of rows to the client, it aborts the query unless you are using SQL_CALC_FOUND_ROWS.~~ 一旦MySQL向客户端发送了所需数量的行，它就会中止查询，除非您使用的是SQL_CALC_FOUND_ROWS。~~In that case, the number of rows can be retrieved with SELECT FOUND_ROWS().~~ 在这种情况下，可以使用SELECT FOUND_ROWS()检索行数。~~See Section 12.16, “Information Functions”.~~请参阅第12.16节，“信息功能”。
~~LIMIT 0 quickly returns an empty set.~~ LIMIT 0快速返回一个空集。~~This can be useful for checking the validity of a query.~~ 这对于检查查询的有效性非常有用。~~It can also be employed to obtain the types of the result columns within applications that use a MySQL API that makes result set metadata available.~~ 它还可以用于获取应用程序中的结果列的类型，这些应用程序使用MySQL API使结果集元数据可用。~~With the mysql client program, you can use the --column-type-info option to display result column types.~~使用mysql客户端程序，可以使用--column-type-info选项显示结果列类型。
~~If the server uses temporary tables to resolve a query, it uses the LIMIT row_count clause to calculate how much space is required.~~如果服务器使用临时表来解析查询，它将使用LIMIT row_count子句来计算需要多少空间。
~~If an index is not used for ORDER BY but a LIMIT clause is also present, the optimizer may be able to avoid using a merge file and sort the rows in memory using an in-memory filesort operation.~~如果索引未用于ORDER BY，但也存在LIMIT子句，则优化器可以避免使用合并文件，并使用内存中的filesort操作对内存中的行进行排序。

~~If multiple rows have identical values in the ORDER BY columns, the server is free to return those rows in any order, and may do so differently depending on the overall execution plan.~~ 如果多行在ORDER BY列中具有相同的值，则服务器可以自由地以任何顺序返回这些行，并且根据总体执行计划的不同，返回的顺序也可能不同。~~In other words, the sort order of those rows is nondeterministic with respect to the nonordered columns.~~换句话说，这些行的排序顺序相对于非排序列是不确定的。

~~One factor that affects the execution plan is LIMIT, so an ORDER BY query with and without LIMIT may return rows in different orders.~~ 影响执行计划的一个因素是LIMIT，因此有LIMIT和无LIMIT的ORDER BY查询可能会以不同的顺序返回行。~~Consider this query, which is sorted by the category column but nondeterministic with respect to the id and rating columns:~~考虑此查询，该查询由category列排序，但相对于id和rating列不确定性：

mysql> SELECT * FROM ratings ORDER BY category;
+----+----------+--------+
| id | category | rating |
+----+----------+--------+
|  1 |        1 |    4.5 |
|  5 |        1 |    3.2 |
|  3 |        2 |    3.7 |
|  4 |        2 |    3.5 |
|  6 |        2 |    3.5 |
|  2 |        3 |    5.0 |
|  7 |        3 |    2.7 |
+----+----------+--------+

~~Including LIMIT may affect order of rows within each category value.~~ 包含LIMIT可能会影响每个category值中的行顺序。~~For example, this is a valid query result:~~例如，这是一个有效的查询结果：

mysql> SELECT * FROM ratings ORDER BY category LIMIT 5;
+----+----------+--------+
| id | category | rating |
+----+----------+--------+
|  1 |        1 |    4.5 |
|  5 |        1 |    3.2 |
|  4 |        2 |    3.5 |
|  3 |        2 |    3.7 |
|  6 |        2 |    3.5 |
+----+----------+--------+

~~In each case, the rows are sorted by the ORDER BY column, which is all that is required by the SQL standard.~~在每种情况下，行都是按照ORDER BY列排序的，这是SQL标准所要求的全部内容。

~~If it is important to ensure the same row order with and without LIMIT, include additional columns in the ORDER BY clause to make the order deterministic.~~ 如果重要的是确保具有LIMIT和不具有LIMIT的行顺序相同，请在ORDER BY子句中包含其他列，以使顺序具有确定性。~~For example, if id values are unique, you can make rows for a given category value appear in id order by sorting like this:~~例如，如果id值是唯一的，则可以通过如下排序使给定category值的行按id顺序显示：

mysql> SELECT * FROM ratings ORDER BY category, id;
+----+----------+--------+
| id | category | rating |
+----+----------+--------+
|  1 |        1 |    4.5 |
|  5 |        1 |    3.2 |
|  3 |        2 |    3.7 |
|  4 |        2 |    3.5 |
|  6 |        2 |    3.5 |
|  2 |        3 |    5.0 |
|  7 |        3 |    2.7 |
+----+----------+--------+

mysql> SELECT * FROM ratings ORDER BY category, id LIMIT 5;
+----+----------+--------+
| id | category | rating |
+----+----------+--------+
|  1 |        1 |    4.5 |
|  5 |        1 |    3.2 |
|  3 |        2 |    3.7 |
|  4 |        2 |    3.5 |
|  6 |        2 |    3.5 |
+----+----------+--------+

~~For a query with an ORDER BY or GROUP BY and a LIMIT clause, the optimizer tries to choose an ordered index by default when it appears doing so would speed up query execution.~~ 对于带有ORDER BY或GROUP BY和LIMIT子句的查询，优化器会在默认情况下尝试选择一个有序索引，因为这样做会加快查询的执行速度。~~Prior to MySQL 8.0.21, there was no way to override this behavior, even in cases where using some other optimization might be faster.~~ 在MySQL 8.0.21之前，即使在使用其他优化可能更快的情况下，也无法覆盖此行为。~~Beginning with MySQL 8.0.21, it is possible to turn off this optimization by setting the optimizer_switch system variable's prefer_ordering_index flag to off.~~从MySQL 8.0.21开始，可以通过将optimizer_switch系统变量的prefer_ordering_index标志设置为off来关闭此优化。

Example: First we create and populate a table t as shown here:示例：首先，我们创建并填充一个表t，如下所示：

# Create and populate a table t:

mysql> CREATE TABLE t (
    ->     id1 BIGINT NOT NULL,
    ->     id2 BIGINT NOT NULL,
    ->     c1 VARCHAR(50) NOT NULL,
    ->     c2 VARCHAR(50) NOT NULL,
    ->  PRIMARY KEY (id1),
    ->  INDEX i (id2, c1)
    -> );

# [Insert some rows into table t - not shown]

~~Verify that the prefer_ordering_index flag is enabled:~~验证是否已启用prefer_ordering_index标志：

mysql> SELECT @@optimizer_switch LIKE '%prefer_ordering_index=on%';
+------------------------------------------------------+
| @@optimizer_switch LIKE '%prefer_ordering_index=on%' |
+------------------------------------------------------+
|                                                    1 |
+------------------------------------------------------+

~~Since the following query has a LIMIT clause, we expect it to use an ordered index, if possible.~~ 因为下面的查询有一个LIMIT子句，如果可能的话，我们希望它使用一个有序索引。~~In this case, as we can see from the EXPLAIN output, it uses the table's primary key.~~在本例中，正如我们从EXPLAIN输出中看到的，它使用表的主键。

mysql> EXPLAIN SELECT c2 FROM t
    ->     WHERE id2 > 3
    ->     ORDER BY id1 ASC LIMIT 2\G
*************************** 1. row ***************************
           id: 1
  select_type: SIMPLE
        table: t
   partitions: NULL
         type: index
possible_keys: i
          key: PRIMARY
      key_len: 8
          ref: NULL
         rows: 2
     filtered: 70.00
        Extra: Using where

~~Now we disable the prefer_ordering_index flag, and re-run the same query; this time it uses the index i (which includes the id2 column used in the WHERE clause), and a filesort:~~现在我们禁用preference_ordering_index标志，并重新运行相同的查询；这次它使用索引i（包括WHERE子句中使用的id2列）和filesort：

mysql> SET optimizer_switch = "prefer_ordering_index=off";

mysql> EXPLAIN SELECT c2 FROM t
    ->     WHERE id2 > 3
    ->     ORDER BY id1 ASC LIMIT 2\G
*************************** 1. row ***************************
           id: 1
  select_type: SIMPLE
        table: t
   partitions: NULL
         type: range
possible_keys: i
          key: i
      key_len: 8
          ref: NULL
         rows: 14
     filtered: 100.00
        Extra: Using index condition; Using filesort