Forums - PetaSAN

ForumGeneral DiscussionOSD latency problems (VMware stuc …
You need to log in to create posts and topics. Login · Register
OSD latency problems (VMware stuck)

Pages: 1 2

trexman
60 Posts

October 23, 2019, 11:02 am
Quote from trexman on October 23, 2019, 11:02 am
after reboot you see normal OSD latency, but increase with time ?

Yes. This is what we see since the 3 last reboots.

% utilization also increases with time ?

I would say no.

Here are some graphs with IOPS, OSD latency and Disk utilization for the last:

6h: https://imgur.com/Go5jTZ9
24h: https://imgur.com/x0glmJ5 (reboot was yesterday at around 21:00)
week: https://imgur.com/FjXJQWx

after reboot you see normal OSD latency, but increase with time ?

Yes. This is what we see since the 3 last reboots.

% utilization also increases with time ?

I would say no.

Here are some graphs with IOPS, OSD latency and Disk utilization for the last:

6h: https://imgur.com/Go5jTZ9
24h: https://imgur.com/x0glmJ5 (reboot was yesterday at around 21:00)
week: https://imgur.com/FjXJQWx

#11

therm
121 Posts

October 23, 2019, 11:14 am
Quote from therm on October 23, 2019, 11:14 am
hi trexman,

you are using consumer ssds. We did also (Samsung 850/860 pro). Those SSDs collapse at a specific amount of write load (due to their internal cache beeing filled). We ended up in an setup using intels 2TB DC P4600 as journal devices with filestore and 4TB hgst disks. We use an all flash pool only for those data were it is necessary(on those P4600), that keeps it affordable.

Regards,

Dennis

hi trexman,

you are using consumer ssds. We did also (Samsung 850/860 pro). Those SSDs collapse at a specific amount of write load (due to their internal cache beeing filled). We ended up in an setup using intels 2TB DC P4600 as journal devices with filestore and 4TB hgst disks. We use an all flash pool only for those data were it is necessary(on those P4600), that keeps it affordable.

Regards,

Dennis

#12

admin
2,969 Posts

October 23, 2019, 11:22 am
Quote from admin on October 23, 2019, 11:22 am
You could be experiencing this

https://tracker.ceph.com/issues/38738

http://lists.ceph.com/pipermail/ceph-users-ceph.com/2019-March/033476.html

The default allocator in Luminous and Mimic (called stupid allocator) may experience this under some cases. A new allocator in Nautilus bitmap allocator was backported in Luminous 12.2.12 which is PetaSAN v 2.3.0 and 2.3.1 before Nautilus upgrade. If you are using 12.2.12, switch to this allocator, in the conf file in [osd] section:

[osd]
bluestore_allocator = bitmap
bluefs_allocator = bitmap

then restart all osd on node

systemctl restart ceph-osd.target

i suggest you do this on 1 node first and observe with time. you can check if it applied via:

ceph daemon osd.X config show --cluster XX | grep allocator

You could be experiencing this

https://tracker.ceph.com/issues/38738

http://lists.ceph.com/pipermail/ceph-users-ceph.com/2019-March/033476.html

The default allocator in Luminous and Mimic (called stupid allocator) may experience this under some cases. A new allocator in Nautilus bitmap allocator was backported in Luminous 12.2.12 which is PetaSAN v 2.3.0 and 2.3.1 before Nautilus upgrade. If you are using 12.2.12, switch to this allocator, in the conf file in [osd] section:

[osd]
bluestore_allocator = bitmap
bluefs_allocator = bitmap

then restart all osd on node

systemctl restart ceph-osd.target

i suggest you do this on 1 node first and observe with time. you can check if it applied via:

ceph daemon osd.X config show --cluster XX | grep allocator

#13

trexman
60 Posts

October 25, 2019, 6:03 pm
Quote from trexman on October 25, 2019, 6:03 pm
@therm:
Thanks for sharing your experience. We would have to think about this.

@admin:
Thanks so much for the research.

If I see this right, than an update of our cluster from PetaSAN 2.3.0 to 2.3.1 (Ceph Nautilus) would also change the allocator to bitmap right?
And I can check it with the last command.

Thank you

@therm:
Thanks for sharing your experience. We would have to think about this.

@admin:
Thanks so much for the research.

If I see this right, than an update of our cluster from PetaSAN 2.3.0 to 2.3.1 (Ceph Nautilus) would also change the allocator to bitmap right?
And I can check it with the last command.

Thank you

#14

admin
2,969 Posts

October 25, 2019, 10:07 pm
Quote from admin on October 25, 2019, 10:07 pm
Yes many consumer grade SSDs will cause issues,

If you upgrade to 2.3.1 and run the Nautilus upgrade steps, yes it will be built it, you do not need the --cluster XX in the commands with 2.3.1. If you want to test the new allocator with 2.3.0, you can use the steps posted earlier.

Yes many consumer grade SSDs will cause issues,

If you upgrade to 2.3.1 and run the Nautilus upgrade steps, yes it will be built it, you do not need the --cluster XX in the commands with 2.3.1. If you want to test the new allocator with 2.3.0, you can use the steps posted earlier.

#15

trexman
60 Posts

November 19, 2019, 1:29 pm
Quote from trexman on November 19, 2019, 1:29 pm
Just as a short summery:
After the PetaSAN upgrade to V 2.3.1 with the allocator bitmap, our problem did not occur again as you can see here:

Thanks for your great help!

Just as a short summery:
After the PetaSAN upgrade to V 2.3.1 with the allocator bitmap, our problem did not occur again as you can see here:

Thanks for your great help!

#16

admin
2,969 Posts

November 20, 2019, 9:39 pm
Quote from admin on November 20, 2019, 9:39 pm
Excellent, thanks for posting..

Excellent, thanks for posting..

#17

mfrater
1 Post

November 25, 2019, 2:07 am
Quote from mfrater on November 25, 2019, 2:07 am
We had the same issue recently. HDD commit latency was fine. SSD commit latency was awful. Latency increased over a number of hours to > 100-200ms.

We have (only) 3 SSDs split across 3 hosts. We changed bluestore_allocator to "bitmap" on each host one at a time as per process above. Worked a treat. Here are the results.

https://imgur.com/a/b6lOnhr

We had the same issue recently. HDD commit latency was fine. SSD commit latency was awful. Latency increased over a number of hours to > 100-200ms.

We have (only) 3 SSDs split across 3 hosts. We changed bluestore_allocator to "bitmap" on each host one at a time as per process above. Worked a treat. Here are the results.

https://imgur.com/a/b6lOnhr

#18

Post Reply: OSD latency problems (VMware stuck)

Cancel

Pages: 1 2