Out-of-band NIC management

Ratan Gupta ratagupt at linux.vnet.ibm.com
Wed Jul 17 14:27:09 AEST 2019


Hi Joel,

We had ported the similiar ncsi-netlink utility to openbmc under 
phosphor-networkd.

https://github.com/openbmc/phosphor-networkd/blob/master/ncsi_util.hpp

Ben,

we can extend the same.

Regards
Ratan Gupta



On 17/07/19 8:40 AM, Joel Stanley wrote:
> On Tue, 16 Jul 2019 at 21:46, Ben Wei <benwei at fb.com> wrote:
>> Hi all,
>>
>> Would anyone be interested in collaborating on out-of-band NIC management and monitoring?
>>
>> DMTF has as a NCSI spec (https://www.dmtf.org/sites/default/files/standards/documents/DSP0222_1.1.0.pdf), that defines a standard interface for BMCs to manage NICs.
>> And in kernel 5.x , NC-SI driver supports Netlink interface for communicating with userspace processes.
>>
>> I'm thinking adding the following tools to OpenBMC as a starting point and build form there:
>>
>>        1. A command line utility (e.g. ncsi-util) to send raw NC-SI commands, useful for debugging and initial NIC bring up,
>>        For example:
>>            ncsi-util -eth0 -ch 0 <raw NC-SI command>
> The NCSI kernel maintainer, Sam, has written a tool that fits this descirption:
>
>   https://github.com/sammj/ncsi-netlink
>
>>        We can further extend this command line tool to support other management interfaces, e.g sending MCTP or PLDM commands to NIC.
> I have added Jeremy to cc, who has been doing some MCTP related work recently.
>
>>        2. A daemon running on OpenBMC (e.g ncsid) monitoring NIC status,  for example:
>>            a. Query and log NIC capability and current parameter setting
>>            b. Periodically check NIC link status, re-initialize NC-SI link if NIC is unreachable, log the status
>>            c. Enable and monitor NIC Asynchronous Event Notifications (AENs)
>>                  i. such as  Link Status Change, Configuration required, Host driver status change
>>                  ii. there are OEM-specific AENs that BMC may also enable and monitor
>>                  iii. either log these events, and/or performs recovery and remediation as needed
>>            d. Additional monitoring such as
>>                  i.  temperature (not in standard NC-SI command yet),
>>                  ii. firmware version, update event, network traffic statistics
>>
>> Both the CLI tool and the monitoring daemon can either communicate to kernel driver directly via Netlink independently, or we can have the ncsi daemon acting as command serializer to kernel and other user space processes.
>> These are just some of my initial thoughts and I'd love to hear some feedback if these would be useful to OpenBMC.
>>
>> If anyone in interested in collaborate on these we can discuss more on features and design details.
>>
>> Regards,
>> -Ben



More information about the openbmc mailing list