fio磁盘测试工具使用笔记

发布于:2025-04-03 ⋅ 阅读:(13) ⋅ 点赞:(0)

本文介绍磁盘性能测试工具fio在某国产操作系统(内核4.19,gcc为7.3.0)上的编译和使用。

背景

某项目使用物理机安装某数据库,相关人员提到磁盘性能方面的要求,用fio测试32k的随机读写,性能要达到1万 IOPS。本文即进行此工具测试。

fio是一款强大的磁盘 I/O 性能测试工具,用于评估存储设备(如硬盘、SSD、NVMe)的 吞吐量(BW)、IOPS(每秒 I/O 操作数)和延迟(Latency)。它支持多种 I/O 引擎和工作模式,可模拟真实业务场景的压力测试。

主要特点如下:

  • 多引擎支持:libaio(Linux 异步 I/O)、io_uring(高性能)、psync(同步)、mmap(内存映射)等。
  • 灵活配置:可自定义 块大小(block size)、队列深度(iodepth)、读写比例(rwmixread) 等参数。
  • 多线程/多任务:通过 numjobs 模拟并发负载。
  • 跨平台:支持 Linux、Windows、macOS。

安装

下载:https://github.com/axboe/fio/releases 。文件:fio-3.39.tar.gz

在线安装依赖库:

sudo yum install libaio libaio-devel -y

编译:

./configure
make
sudo make install

验证 libaio 支持:

# ./fio --enghelp | grep libaio
        libaio

测试

32k随机读写测试命令,使用libaio库,命令如下:

fio --name=randrw_test --ioengine=libaio --iodepth=32 --rw=randrw --rwmixread=50 --bs=32k --direct=1 --size=4G --numjobs=4 --runtime=60 --time_based --group_reporting

关键参数说明

  • –iodepth=32:设置I/O队列深度为32,这对SSD/NVMe设备很重要
  • –numjobs=4:使用4个并行线程/进程来产生负载
  • –direct=1:绕过系统缓存,直接I/O
  • –size=4G:每个线程测试4GB数据
  • –runtime=60:测试运行60秒

测试结果的主要指标如下:

IOPS=423, BW=13.2MiB/s

从结果看,测试服务器硬盘性能一般般。详细结果如下:

# fio --name=randrw_test --ioengine=libaio --iodepth=32 --rw=randrw --rwmixread=50 --bs=32k --direct=1 --size=4G --numjobs=4 --runtime=60 --time_based --group_reporting
randrw_test: (g=0): rw=randrw, bs=(R) 32.0KiB-32.0KiB, (W) 32.0KiB-32.0KiB, (T) 32.0KiB-32.0KiB, ioengine=libaio, iodepth=32
...
fio-3.39
Starting 4 processes
randrw_test: Laying out IO file (1 file / 4096MiB)
randrw_test: Laying out IO file (1 file / 4096MiB)
randrw_test: Laying out IO file (1 file / 4096MiB)
randrw_test: Laying out IO file (1 file / 4096MiB)
Jobs: 4 (f=4): [m(4)][5.0%][r=18.0MiB/s,w=18.8MiB/s][r=576,w=603 IOPS][eta 00m:5Jobs: 4 (f=4): [m(4)][6.7%][r=17.8MiB/s,w=19.2MiB/s][r=571,w=614 IOPS][eta 00m:5Jobs: 4 (f=4): [m(4)][8.3%][r=17.1MiB/s,w=16.9MiB/s][r=547,w=542 IOPS][eta 00m:5Jobs: 4 
。。。
(f=4): [m(4)][100.0%][r=6048KiB/s,w=6688KiB/s][r=189,w=209 IOPS][eta 00mJobs: 4 (f=4): [m(4)][9.6%][r=7776KiB/s,w=5760KiB/s][r=243,w=180 IOPS][eta 09m:3Jobs: 2 (f=2): [m(2),_(2)][34.4%][r=1069KiB/s,w=855KiB/s][r=33,w=26 IOPS][eta 01m:58s]
randrw_test: (groupid=0, jobs=4): err= 0: pid=1048845: Thu Mar 27 16:53:20 2025
  read: IOPS=423, BW=13.2MiB/s (13.9MB/s)(821MiB/62024msec)
    slat (nsec): min=4456, max=96903, avg=14923.19, stdev=5994.18
    clat (usec): min=30, max=3032.4k, avg=230270.66, stdev=238818.59
     lat (usec): min=37, max=3032.4k, avg=230285.58, stdev=238818.97
    clat percentiles (msec):
     |  1.00th=[    3],  5.00th=[   12], 10.00th=[   26], 20.00th=[   58],
     | 30.00th=[   96], 40.00th=[  134], 50.00th=[  169], 60.00th=[  209],
     | 70.00th=[  266], 80.00th=[  347], 90.00th=[  498], 95.00th=[  659],
     | 99.00th=[ 1150], 99.50th=[ 1469], 99.90th=[ 2106], 99.95th=[ 2165],
     | 99.99th=[ 2601]
   bw (  KiB/s): min= 1856, max=46208, per=100.00%, avg=13948.19, stdev=2178.00, samples=481
   iops        : min=   58, max= 1444, avg=435.88, stdev=68.06, samples=481
  write: IOPS=429, BW=13.4MiB/s (14.1MB/s)(832MiB/62024msec); 0 zone resets
    slat (usec): min=4, max=110, avg=15.08, stdev= 7.04
    clat (usec): min=30, max=2336.7k, avg=66911.12, stdev=126808.10
     lat (usec): min=37, max=2336.8k, avg=66926.21, stdev=126809.08
    clat percentiles (usec):
     |  1.00th=[     35],  5.00th=[     38], 10.00th=[     40],
     | 20.00th=[     42], 30.00th=[     45], 40.00th=[     48],
     | 50.00th=[     63], 60.00th=[  41157], 70.00th=[  79168],
     | 80.00th=[ 121111], 90.00th=[ 183501], 95.00th=[ 270533],
     | 99.00th=[ 522191], 99.50th=[ 633340], 99.90th=[1736442],
     | 99.95th=[1920992], 99.99th=[2231370]
   bw (  KiB/s): min= 1344, max=48128, per=100.00%, avg=14177.60, stdev=2354.39, samples=480
   iops        : min=   42, max= 1504, avg=443.05, stdev=73.57, samples=480
  lat (usec)   : 50=22.30%, 100=4.20%, 250=0.19%, 500=0.13%, 750=0.11%
  lat (usec)   : 1000=0.07%
  lat (msec)   : 2=0.18%, 4=0.49%, 10=1.89%, 20=2.60%, 50=7.73%
  lat (msec)   : 100=13.39%, 250=27.73%, 500=13.43%, 750=3.72%, 1000=1.02%
  lat (msec)   : 2000=0.72%, >=2000=0.11%
  cpu          : usr=0.10%, sys=0.35%, ctx=50323, majf=0, minf=54
  IO depths    : 1=0.1%, 2=0.1%, 4=0.1%, 8=0.1%, 16=0.1%, 32=99.8%, >=64=0.0%
     submit    : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
     complete  : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.1%, 64=0.0%, >=64=0.0%
     issued rwts: total=26277,26637,0,0 short=0,0,0,0 dropped=0,0,0,0
     latency   : target=0, window=0, percentile=100.00%, depth=32

Run status group 0 (all jobs):
   READ: bw=13.2MiB/s (13.9MB/s), 13.2MiB/s-13.2MiB/s (13.9MB/s-13.9MB/s), io=821MiB (861MB), run=62024-62024msec
  WRITE: bw=13.4MiB/s (14.1MB/s), 13.4MiB/s-13.4MiB/s (14.1MB/s-14.1MB/s), io=832MiB (873MB), run=62024-62024msec

Disk stats (read/write):
    dm-0: ios=26277/27963, sectors=1681728/1722759, merge=0/0, ticks=5988971/1844511, in_queue=0, util=98.92%, aggrios=26277/27885, aggsectors=1681728/1722759, aggrmerge=0/78, aggrticks=5991165/1774213, aggrin_queue=7745112, aggrutil=99.33%
  sda: ios=26277/27885, sectors=1681728/1722759, merge=0/78, ticks=5991165/1774213, in_queue=7745112, util=99.33%

其它说明

可以进行以下几种类型的测试:

  • 随机读写:–rw=randrw
  • 随机读:–rw=randread
  • 随机写:–rw=randwrite

其它事项:

  • 对于NVMe SSD,可以使用--ioengine=io_uring以获得更好性能。
  • 如果系统核数较多,则可以增加线程数numjobs,发挥最大性能。

小结

目前还没有服务器测试,后续找机会再试试。

在centos7无法编译信息

fio新版本对GCC有要求,如版本过低,则无法编译,如下:

# ./configure 
Operating system              Linux
CPU                           x86_64
Big endian                    no
Compiler                      gcc
Cross compile                 no

Static build                  no

Your compiler doesn't support C11 atomics. gcc 4.9/clang 3.6 are the
minimum versions with it - perhaps your compiler is too old?
C11 atomics support not found
Configure failed, check config.log and/or the above output

网站公告

今日签到

点亮在社区的每一天
去签到