dbus-broker caused the system OOM issue.
Sui Chen
suichen6 at gmail.com
Thu Feb 4 04:24:46 AEDT 2021
On Mon, Feb 1, 2021 at 5:40 PM Fran Hsu (徐誌謙) <Fran.Hsu at quantatw.com> wrote:
>
> Hello,
> Sometimes my BMC will run out of memory so that I try to find out which process caused the OOM issue.
> From the htop information, it looks like the dbus-broker uses huge memory space.
> What I'm doing is to send the ipmi sdr list command from Host OS thru the kcs interface every 2 seconds.
> In the begging, I could see the dbus-broker uses up to 400MB and then reduced to less than 100MB.
> But after some hours I could see the memory usage keep increasing and the dbus-broker caused system OOM eventually.
>
> 1 [|||||||||||||||||||||||||||||||| 67.1%] Tasks: 104, 20 thr; 2 running
> 2 [|||||||||||||||||||||||||||||| 60.4%] Load average: 3.12 3.34 3.54
> Mem[|||||||||||||||||||||||||||||||||||||969M/1003M] Uptime: 12:59:12
> Swp[ 0K/0K]
>
> PID USER PRI NI VIRT RES SHR S CPU% MEM% TIME+ Command
> 281 messagebu 20 0 870M 868M 376 S 0.6 86.6 40:54.32 dbus-broker --log 4 --controller 9 --machine-id 932c
> 265 messagebu 20 0 259M 284 4 S 0.0 0.0 0:00.34 /usr/bin/dbus-broker-launch --scope system --audit
>
> Version:
> dbus-broker :25-r0
>
> I'd appreciate any comments or feedback.
>
> Thanks,
> Fran Hsu
> E-Mail : Fran.Hsu at QuantaTW.com
> Tel: +886-3-327-2345 Ext: 16935
>
>
Hello Fran,
I would do a DBus message capture and use it to examine: 1) what
method calls, signals and errors happened on the system, and 2)
whether there are method calls that take too long to complete, or
never return due to some errors.
The capture may be done using a command like "busctl capture >
dbus_capture.pcap"; the command monitors and dumps the DBus messages
to the output file similarly to Wireshark's capture mode. After
transferring the output file from the BMC to the host, dbus-pcap
(https://github.com/openbmc/openbmc-tools/tree/master/dbus-pcap) or a
work-in-progress visual tool
(https://gerrit.openbmc-project.xyz/c/openbmc/openbmc-tools/+/34263)
may be used to examine it to see if there are method calls that take a
very long time to complete or not return at all.
Hope this helps and thanks,
Sui
More information about the openbmc
mailing list