The diskpagebuffer
table provides statistics about disk page buffer usage by NDB Cluster Disk Data tables.
The diskpagebuffer
table contains the following columns:
node_id
The data node ID
block_instance
Block instance
pages_written
Number of pages written to disk.
pages_written_lcp
Number of pages written by local checkpoints.
pages_read
Number of pages read from disk
log_waits
Number of page writes waiting for log to be written to disk
page_requests_direct_return
Number of requests for pages that were available in buffer
page_requests_wait_queue
Number of requests that had to wait for pages to become available in buffer
page_requests_wait_io
Number of requests that had to be read from pages on disk (pages were unavailable in buffer)
You can use this table with NDB Cluster Disk Data tables to determine whether DiskPageBufferMemory
is sufficiently large to allow data to be read from the buffer rather from disk; minimizing disk seeks can help improve performance of such tables.
You can determine the proportion of reads from DiskPageBufferMemory
to the total number of reads using a query such as this one, which obtains this ratio as a percentage:
SELECT node_id, 100 * page_requests_direct_return / (page_requests_direct_return + page_requests_wait_io) AS hit_ratio FROM ndbinfo.diskpagebuffer;
The result from this query should be similar to what is shown here, with one row for each data node in the cluster (in this example, the cluster has 4 data nodes):
+---------+-----------+ | node_id | hit_ratio | +---------+-----------+ | 5 | 97.6744 | | 6 | 97.6879 | | 7 | 98.1776 | | 8 | 98.1343 | +---------+-----------+ 4 rows in set (0.00 sec)
hit_ratio
values approaching 100% indicate that only a very small number of reads are being made from disk rather than from the buffer, which means that Disk Data read performance is approaching an optimum level. If any of these values are less than 95%, this is a strong indicator that the setting for DiskPageBufferMemory
needs to be increased in the config.ini
file.
A change in DiskPageBufferMemory
requires a rolling restart of all of the cluster's data nodes before it takes effect.
block_instance
refers to an instance of a kernel block. Together with the block name, this number can be used to look up a given instance in the threadblocks
table. Using this information, you can obtain information about disk page buffer metrics relating to individual threads; an example query using LIMIT 1
to limit the output to a single thread is shown here:
mysql>SELECT
>node_id, thr_no, block_name, thread_name, pages_written,
>pages_written_lcp, pages_read, log_waits,
>page_requests_direct_return, page_requests_wait_queue,
>page_requests_wait_io
>FROM ndbinfo.diskpagebuffer
>INNER JOIN ndbinfo.threadblocks USING (node_id, block_instance)
>INNER JOIN ndbinfo.threads USING (node_id, thr_no)
>WHERE block_name = 'PGMAN' LIMIT 1\G
*************************** 1. row *************************** node_id: 1 thr_no: 1 block_name: PGMAN thread_name: rep pages_written: 0 pages_written_lcp: 0 pages_read: 1 log_waits: 0 page_requests_direct_return: 4 page_requests_wait_queue: 0 page_requests_wait_io: 1 1 row in set (0.01 sec)