Online Instance with OFED Driver Kernel Replacement Solution

BCC BCC

  • API Reference
    • Appendix
    • Automatic Snapshot Policy Related Interfaces
      • Bind automatic snapshot policy
      • Create automatic snapshot policy
      • Delete automatic snapshot policy
      • Query details of automatic snapshot policy
      • Query the list of automatic snapshot policies
      • Unbind automatic snapshot policy
      • Update automatic snapshot policy
    • Cluster Management Related Interfaces
      • Create EHC Cluster
      • Delete EHC Cluster
      • EHC Cluster List
      • Modify EHC Cluster
    • Common Request Headers and Response Headers
    • Dedicated Cluster Related Interfaces
      • Canceling Dedicated Cluster Auto-Renewal
      • Create a dedicated cluster
      • Dedicated Cluster Renewal
      • Dedicated Cluster Tag Binding
      • Dedicated Cluster Tag Unbinding
      • Enabling Dedicated Cluster Auto-Renewal
      • Querying Dedicated Cluster Details
      • Querying Dedicated Cluster List
      • Resizing Dedicated Cluster
    • Deployment Group Related Interfaces
      • Adjust Deployment Group
      • Create Deployment Group
      • Delete Deployment Group
      • Get Deployment Group Details
      • Modify deployment group
      • Query the deployment group list
      • Remove Instance from Deployment Group
    • Disk Related Interfaces
      • Attach a CDS disk
      • Bind a tag to a disk
      • Cancel Auto-renewal
      • Create a CDS disk
      • Detach a CDS disk
      • Disk Expansion and Type Change
      • Enable auto-renewal
      • Get Purchasable Disk Information
      • Modify disk properties
      • Query CDS Price
      • Query disk details
      • Query the disk list
      • Release a CDS disk (new)
      • Release a CDS disk
      • Rename a disk
      • Renew a disk
      • Roll back disk data
      • Shift charge for disks
      • Unbind a tag from a disk
    • Error response
    • General Description
    • Image Related Interfaces
      • Bind a tag to an image
      • Cancel cross-region replication of custom images
      • Create a custom image
      • Delete a custom image
      • Import an image
      • Modify Custom Image Name
      • Query available public image based on instance specifications
      • Query image details
      • Query List of Users with Shared Image Access
      • Query OS information based on instance IDs in batch
      • Query the image list
      • Replicate a custom image across regions
      • Share a custom image
      • Unbind a tag from an image
      • Unshare a custom image
    • Instance Related Interfaces
      • Add an instance to a security group
      • Add IPv6
      • Batch Adding Secondary IP Addresses for Primary Network Interface Card
      • Batch Convert to Postpay
      • Batch Unsubscribe Prepaid Instances
      • Billing Change - Convert to Subscription
      • Bind a role
      • Bind a tag to an instance
      • Change the instance subnet
      • Change the instance VPC
      • Convert to prepay in batch
      • Create an instance (by specifying instance package specifications)
      • Create automatic renewal rules
      • Create instance
      • Delete automatic renewal rules
      • Delete secondary IP address from the primary network interface card in batch
      • Early Release of Prepaid Instances
      • Enter rescue mode
      • Exit rescue mode
      • Instance Resizing (By Specifying Instance Package Specification)
      • Modify instance attributes
      • Modify instance description
      • Modify instance hostname
      • Modify the instance password
      • Query Instance List by Instance ID
      • Query instance list
      • Query Multiple Instance Details
      • Query recycle bin instance list
      • Query resizing specifications
      • Query the custom data of the instance
      • Query the details of a specified instance
      • Query the list of BCC instances eligible for no charge for stopped instances
      • Query the VNC address of the instance
      • Reboot an instance
      • Reinstall an instance
      • Reinstall OS for Instances (Batch Reinstallation)
      • Release instances in batch
      • Release IPv6
      • Release Postpay Instances (Release via POST Request)
      • Release recycle bin instances
      • Remove an instance from a security group
      • Remove the relationship between an instance and its deployment group
      • Renew an instance
      • Resize instances in batches
      • Resuming Billing for BCC in Recycle Bin
      • Retrieve a list of instance roles
      • Retrieve network interface card information for an instance
      • Set Auto-release of Spot Instance EIP
      • Set Instance Deletion Protection
      • Setting Auto-Release for Postpay
      • Start an instance
      • Starting Instances (Batch Start)
      • Stop an instance
      • Stopping Instances (Batch Stop)
      • Unbind a role
      • Unbind a tag from an instance
    • Key Pair Related Interfaces
      • Bind a key pair
      • Change description of key pair
      • Create a key pair
      • Delete a key pair
      • Import a key pair
      • Query key pair details
      • Query the key pair list
      • Rename a key pair
      • Unbind a key pair
    • Maintenance platform APIs
      • Maintenance Records Related Interfaces
      • Maintenance Task Related Interfaces
      • Pre-authorization Related Interfaces
    • Other APIs
      • Query instance package price
      • Query instance package specifications (new)
      • Querying Availability Zones By Package Specification
    • Overview
    • Region and Availability Zone Related Interfaces
      • Query list of availability zones
      • Retrieve the Domain Name List API
    • Reserved Instance Related Interfaces
      • Accept Reserved Instance Transfer
      • Adjust reserved instance
      • Bind Tag to Reserved Instance
      • Create Auto-renewal Rule for Reserved Instance
      • Create reserved instances
      • Disable Auto-renewal Rule for Reserved Instance
      • Initiate Reserved Instance Transfer
      • Query Reserved Instance
      • Reject Reserved Instance Transfer
      • Renew reserved instances
      • Reserved Instance Inbound List
      • Reserved Instance Outbound List
      • Revoke Reserved Instance Transfer
      • Unbind Tag from Reserved Instance
    • Security Group Related Interfaces
      • Authorize security group rules
      • Create a security group
      • Delete a security group
      • Delete security group rules
      • Instance associating with security group
      • Query the security group list
      • Replace security group for instance
      • Revoke security group rules
      • Unbind security group from instance
      • Update security group rules
    • Service domain
    • Snapshot Related Interfaces
      • Bind a tag to a snapshot chain
      • Create a snapshot
      • Delete a snapshot
      • Query snapshot chain list
      • Query snapshot details
      • Query the snapshot list
      • Replicate a snapshot across regions
      • Unbind a tag from a snapshot chain
    • Spot Instance Related Interfaces
      • Cancel Spot Instance Order
      • Create spot instances
      • Query Spot Instance Packages
      • Query Spot Price
  • Cloud Assistant API and SDK Reference
    • Python SDK
      • Batch Retrieval of bsm-agent State within Instances
      • CaClient
      • Command Interface
      • Historical records
      • Initialization
      • Install SDK
      • Log API
  • Cloud Assistant API Reference
    • Batch Retrieve Instance BSM-Agent Status
    • Command Interface
    • Common Request and Response Headers
    • Error response
    • General Description
    • Historical records
    • Log API
    • Service domain
  • FAQs
    • Billing class
    • Common Questions Overview
    • Images
    • Instances
    • Network class
    • Performance
    • Reserved Instance
    • Security class
    • Snapshots
    • Storage class
    • Virtual machine operation and maintenance class
  • Fault handling
    • GPU Instance Downtime Troubleshooting
    • Linux Baidu Cloud Compute Entering Single User Mode to Reset Paasword Reports Error-Authentication token lock busy
    • Linux Baidu Cloud Compute Entering Single User Mode to Reset Paasword Reports Error-Authentication token manipulation error
    • Linux Baidu Cloud Compute Entering Single User Mode to Reset Root Paasword
    • Linux Baidu Cloud Compute Execution Yum Update Reports Dependency Conflict
    • Linux Baidu Cloud Compute Setting Login Method
    • Unable to Remotely Connect to Linux Instance
    • Unable to Remotely Connect to Windows Instance
    • Windows Server Cannot Ping
  • Function Release Records
  • Go-SDK
    • Automatic snapshot policy
    • Availability zone
    • Dedicated CDS Cluster
    • Deployment group
    • Disk
    • Elastic high-performance computing cluster
    • Image
    • Initialization
    • Instance
    • Key pair
    • Overview
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
  • Java-SDK
    • Automatic snapshot policy
    • Availability zone
    • BCC instance
    • BccClient
    • Deployment group
    • Disk
    • Exception handling
    • Image
    • Install the SDK Package
    • Key pair
    • Log
    • Overview
    • Quick Start
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
    • Version Change Records
  • Operation guide
    • Deployment group
      • Adjust Deployment Group for Existing Instances
      • Create BCC Instance within Deployment Group
      • Create Deployment Group
      • Delete Deployment Group
      • Modify Deployment Group Information
      • Overview of deployment policy
      • Remove BCC Instance from Deployment Group
    • Elastic high-performance computing cluster
      • Create Elastic High-performance Computing Cluster
      • Overview of elastic hyper-computing cluster
    • Image
      • Baidu AI Cloud Image Service Agreement
      • Create a custom image
      • Creating Linux Image Based on Local System
      • Cross-region image replication
      • Data and Personal Information Protection Statement
      • Delete a custom image
      • How to Set Network Adapter Multi-Queue Properties
      • Image List
      • Image Overview
      • Image Quick Guide
      • Import Custom Image
        • Checking Network Configuration
        • Checking Virtio Driver
        • Creating Instance Using Imported Custom Image
        • Detecting Image Using Image Detection Tool
        • Image format check
        • Image Format Conversion
        • Image Import Restrictions
        • Import Overview
        • Install cloudbase-init
        • Installing Virtio Driver
        • System Optimization Operations (Linux)
      • OS Official End of Maintenance Plan
      • Public Image Release Records
        • 2023 BCC Public Image Release Records
        • 2023 EBC Public Image Release Records
        • 2024 BCC Public Image Release Records
        • 2024 EBC Public Image Release Records
        • 2025 BCC Public Image Release Records
      • Reinstall OS
      • Share a custom image
      • Windows Image Creation
    • Instance
      • Batch Sequential Naming
      • Change Availability Zone
      • Change Charge Type
        • Postpay to Subscription
        • Subscription to Postpay
      • Change Instance Configuration
      • Create instance
        • Create Instance Navigation
        • Create Instance via Custom Image
        • Create Instance via Purchase Page
        • Create Instances with Launch Templates
        • Purchase Instance with Same Configuration
      • Cross-AZ migration
      • Functional restrictions
      • Instance Overview
      • Modify instance name
      • Postpay Instance No Charge for Stopped Instances
      • Reboot an instance
      • Reclaim instance
      • Release an instance
      • Renew an instance
      • Reset password
      • Set IAM Role
      • Sign in to instance
        • Log in to Linux Instance
          • Connect to Baidu Cloud Compute Using SmartTerm
          • Connect to Linux instance using user name-password certification
          • Log in to Linux Instance Using SSH Key on Linux or Mac
          • Log in to Linux Instance Using SSH Key via SecureCRT
          • Remotely sign in to Baidu Cloud Computer via VNC
          • Sign in to Linux instances via SSH key in Putty
          • Sign in to Linux instances via SSH key in Xshell
        • Log in to Windows Instance
          • Remotely sign in to Baidu Cloud Computer via VNC
          • Sign in to Windows instance via remote desktop
        • Login Method Overview
      • Spot instances
        • Create spot instances
        • What Is Spot Instance
      • Start an instance
      • Stop an instance
        • Elastic Baremetal Instance and Elastic High-performance Computing Cluster Instance Halt Capability Configuration Instructions
        • Stop an instance
      • View instance
    • Launch Template
    • Network
      • Bind and Unbind EIP
      • Change Internal IP
      • Change subnet
      • Change VPC
      • Configure Dual Network Interface Card External Network Access for Windows Baidu Cloud Compute
      • Configure IPv6 Address
      • Elastic network interface
        • Configure elastic network interface
        • Create elastic network interface
        • Elastic Network Interface Overview
        • Mount elastic network interface
      • Elastic RDMA Network Interface
        • Elastic RDMA Network Interface Overview
        • Instructions for use
    • Operations and Monitor
      • Cloud Assistant
        • Cloud Assistant Overview
        • Cloud Assistant Usage Notes
        • Create command
        • FAQs
        • Historical records
        • Identity and access management
        • Instance list
        • My commands
        • Public commands
        • Upload files
      • Evacuate Faulty Instance
      • Instance Health Check
      • Maintenance Platform
        • Maintenance Platform Access Instructions
        • Maintenance Platform Overview
        • Maintenance Records Query
        • Maintenance Task
        • Pre-authorization Rule Settings
      • Monitor and Alarm
      • Self-Service Diagnostic Tool
        • Application Performance Diagnostic Tool Btune
        • Self-Service Diagnostic Tool Overview
      • Use Rescue Mode
    • Reserved Instance
      • FAQ
      • Introduction to reserved instance
      • Renew reserved instances
      • Reserved Instance AZ Change
      • Reserved Instance Cross-Account Transfer
      • Reserved Instance EHC Cluster Change
      • Reserved Instance Purchase
    • Resource Assessment
    • Security
      • Identity and access management
        • Identity and access management
        • Price Permission Control
      • Key pair
        • Bind SSH Key Pair
        • Create SSH Key Pair
        • Delete SSH Key Pair
        • SSH Key Pair Overview
        • Unbind SSH Key Pair
      • Security group
        • Associate with security group
        • Copy security group
        • Create a security group
        • Delete a security group
        • Disassociate security group
        • Edit security group
        • Security group overview
    • Server Migration
      • Migration Operation Steps
      • Server Migration Operation Process
      • Server Migration Overview
    • Snapshot
      • Automatic snapshot
      • Create disk snapshot
      • Delete a snapshot
      • Rollback snapshot
      • Snapshot Overview
    • Storage
      • Overview
      • System disk expansion
    • Tag Management
      • Configure Tag
      • Tag Management Overview
      • View Resource Bill Based on Tag
  • Operations Reference
    • Linux Operations Reference
      • Baidu Cloud Compute Delete or Modify File Prompting File System Read-Only
      • CentOS 7 Install Docker
      • Configure Linux Analysis Tools atop and kdump
      • Enter Single User Mode
        • CentOS 6 Enter Single User
        • CentOS 7 Enter Single User
        • CentOS 8 Enter Single User
        • FreeBSD 11 Single User Modify Password SOP
        • opensuse42
          • 3 Enter Single User
        • Ubuntu Enter Single User
        • OpenSUSE 42_3 Enter Single User
      • How to Enable or Disable Intel CPU Vulnerability Patch in Linux Image
      • How to Handle System Boot Exception Caused by fstab Configuration
      • How to Resolve Unable to Log in to Linux BCC Instance via SSH
      • Linux BCC Instance Insufficient Space Issue Resolution
      • Linux BCC Instance Lagging Issue
      • Linux Format and Mount Instance Mount Local Disk
      • Linux Modify Default Remote Connection Port
      • Linux Server CPU and Memory Full Issue Troubleshooting
      • Linux System Install Graphical Interface and Remote Connection
        • CentOS
          • CentOS 8 Install Graphical Interface and Connect Using VNC Viewer
          • CentOS6
            • 5 Install KDE Desktop Graphical Interface and Connect Using VNC Viewer
          • CentOS7
            • 2 Install GNOME Graphical Interface and Connect Using VNC Viewer
          • Install GNOME GUI on CentOS 7_2 and Connect via VNC Viewer
          • Install GUI on CentOS 8 and Connect via VNC Viewer
          • Install KDE Desktop GUI on CentOS 6_5 and Connect via VNC Viewer
        • opensuse
          • opensuse42
            • 3 Install Graphical Interface and Connect Using VNC Viewer
          • Install GUI on OpenSUSE 42_3 and Connect via VNC Viewer
        • Ubuntu
          • Ubuntu14
            • 04 Install Graphical Interface and Connect Using VNC Viewer
          • Ubuntu18
            • 04 Install Graphical Interface and Connect Using VNC Viewer
          • Install GUI on Ubuntu 14_04 and Connect via VNC Viewer
          • Install GUI on Ubuntu 18_04 and Connect via VNC Viewer
      • Performance test
        • Linux CPU Performance Test
        • Linux Disk IO Performance Test
        • Linux Network Performance Test
      • Ubunut16_04 Subscription Extended Security Maintenance (ESM)
    • Server configuration
      • Debian 9 Software Source Configuration Update
      • Linux PIP Configuration with Baidu Source
      • Online Instance with OFED Driver Kernel Replacement Solution
      • Windows PIP Configuration with Baidu Source
    • Windows Operations Reference
      • Blue Screen Crash General Handling Method
      • Configure Multi-User Access to Windows Server
      • Connecting to Windows Server Using Xshell
      • Data Disk Not Displayed After Server Reboot
      • Desktop and Application Color Turning Black Handling Method
      • Local Playback of Sound from Windows Server
      • Performance Resources
      • Remote Connection to Windows Unable to Use Clipboard
      • System Activation
      • System Black Screen Handling Method
      • Tutorial on Disabling Windows Firewall
      • Windows Allow Single User to Connect Multiple Sessions Method
      • Windows automatic update-related configuration
      • Windows BCC Instance Achieve SID Uniqueness After Joining Domain
      • Windows BCC Instance Lagging Issue
      • Windows CPU and Memory Full Troubleshooting Steps
      • Windows Delete Hardware Reserved Memory
      • Windows Instance Disk Space Insufficient Cleanup Tutorial
      • Windows Modify Default Remote Connection Port
      • Windows Quick Login
      • Windows Server Disable IE Enhanced Security Configuration Tutorial
      • Windows Set Only Allow Local & Fixed IP Access Remote Port
      • Windows Virtual Machine Auto Login Failure Handling Method
  • Product Description
    • Application scenarios
    • Correspondence Between Original and New Model Names
    • Discontinued Instance Specifications
    • Models and Their Regions
    • Product advantages
    • Usage restrictions
    • What Is Baidu Cloud Compute BCC
    • Instance specification
  • Product pricing
    • Billing overview
    • Charge type
      • Postpay Pricing
      • Reserved instances
      • Subscription (Prepay) Pricing
      • Reserved Instance
    • Refund policy
  • Python-SDK
    • BccClient
    • Disk
    • Image
    • Initialization
    • Install the SDK Package
    • Instance
    • Overview
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
    • Version Change Records
  • Python3-SDK
    • Automatic snapshot policy
    • BccClient
    • Dedicated CDS Cluster
    • Deployment group
    • Disk
    • Image
    • Initialization
    • Install the SDK Package
    • Instance
    • Key pair
    • Overview
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
  • Quick Start
    • Baidu Cloud Compute Security Description
    • Quick Configuration of Linux Baidu Cloud Compute
    • Quick Configuration of Windows Baidu Cloud Compute
  • Resources
    • Baidu AI Cloud Image Service Agreement
    • Image Quick Guide
  • Service Level Agreement (SLA)
    • Baidu Intelligent Cloud Baidu Cloud Compute Trusted Cloud
    • BCC Service Level Agreement SLA (V3_0)
  • Tag Service API Reference
    • Appendix 1
    • Domain name
    • General Description
    • Interface List
      • Create tag
      • Delete tag
      • Tag List
      • View resources bound to a tag
    • Introduction
  • Transfer
    • Site Offline Migration Solution
    • Site Smooth Migration Solution
  • Typical Practices
    • Connecting to Windows Server Using Xshell
    • Environment Setup
      • Manual Deployment of SQL Server 2008 R2 Express Edition
      • One-Click Setup for Linux Palworld Server
      • One-Click Setup for Windows Palworld Server
      • Set Up FTP Service on Linux
      • Setting Up Baota on Linux
      • Setting Up Baota on Windows
      • Tutorial on Setting Up IIS and FTP on Windows
      • Typical Practice for Setting Up Access VPN
      • Installing MySQL 8_0 Using Yum on CentOS 7 yum
    • Set Up FTP Service on Linux
    • Tag Management Practice
    • Tutorial on Disabling IE Enhanced Security Configuration on Windows Server 2008
    • Tutorial on Disabling Windows Firewall
    • Website Building Tutorial
      • Building a Website Using IIS Service on Windows 2008 and Windows 2012
      • Deploy SSL Certificate on Baidu Cloud Compute - Apache
      • Deploy SSL Certificate on Baidu Cloud Compute - IIS
      • Deploy SSL Certificate on Baidu Cloud Compute - Nginx
      • Deploy SSL Certificate on Baidu Cloud Compute - Tomcat
      • Installing and Configuring Apache Environment on Windows
      • Installing and Configuring Nginx Environment on Windows
      • CentOS-7_2 LNMP Environment Deployment
      • Deploy LAMP Environment on CentOS-7_2
  • Video Zone
    • Fault handling
    • Operation guide
      • Basic Functions
      • Environment Setup
      • Network Related
      • Operations Reference
      • Paasword Function
    • Operation guide
    • Product introduction
All documents
menu
No results found, please re-enter

BCC BCC

  • API Reference
    • Appendix
    • Automatic Snapshot Policy Related Interfaces
      • Bind automatic snapshot policy
      • Create automatic snapshot policy
      • Delete automatic snapshot policy
      • Query details of automatic snapshot policy
      • Query the list of automatic snapshot policies
      • Unbind automatic snapshot policy
      • Update automatic snapshot policy
    • Cluster Management Related Interfaces
      • Create EHC Cluster
      • Delete EHC Cluster
      • EHC Cluster List
      • Modify EHC Cluster
    • Common Request Headers and Response Headers
    • Dedicated Cluster Related Interfaces
      • Canceling Dedicated Cluster Auto-Renewal
      • Create a dedicated cluster
      • Dedicated Cluster Renewal
      • Dedicated Cluster Tag Binding
      • Dedicated Cluster Tag Unbinding
      • Enabling Dedicated Cluster Auto-Renewal
      • Querying Dedicated Cluster Details
      • Querying Dedicated Cluster List
      • Resizing Dedicated Cluster
    • Deployment Group Related Interfaces
      • Adjust Deployment Group
      • Create Deployment Group
      • Delete Deployment Group
      • Get Deployment Group Details
      • Modify deployment group
      • Query the deployment group list
      • Remove Instance from Deployment Group
    • Disk Related Interfaces
      • Attach a CDS disk
      • Bind a tag to a disk
      • Cancel Auto-renewal
      • Create a CDS disk
      • Detach a CDS disk
      • Disk Expansion and Type Change
      • Enable auto-renewal
      • Get Purchasable Disk Information
      • Modify disk properties
      • Query CDS Price
      • Query disk details
      • Query the disk list
      • Release a CDS disk (new)
      • Release a CDS disk
      • Rename a disk
      • Renew a disk
      • Roll back disk data
      • Shift charge for disks
      • Unbind a tag from a disk
    • Error response
    • General Description
    • Image Related Interfaces
      • Bind a tag to an image
      • Cancel cross-region replication of custom images
      • Create a custom image
      • Delete a custom image
      • Import an image
      • Modify Custom Image Name
      • Query available public image based on instance specifications
      • Query image details
      • Query List of Users with Shared Image Access
      • Query OS information based on instance IDs in batch
      • Query the image list
      • Replicate a custom image across regions
      • Share a custom image
      • Unbind a tag from an image
      • Unshare a custom image
    • Instance Related Interfaces
      • Add an instance to a security group
      • Add IPv6
      • Batch Adding Secondary IP Addresses for Primary Network Interface Card
      • Batch Convert to Postpay
      • Batch Unsubscribe Prepaid Instances
      • Billing Change - Convert to Subscription
      • Bind a role
      • Bind a tag to an instance
      • Change the instance subnet
      • Change the instance VPC
      • Convert to prepay in batch
      • Create an instance (by specifying instance package specifications)
      • Create automatic renewal rules
      • Create instance
      • Delete automatic renewal rules
      • Delete secondary IP address from the primary network interface card in batch
      • Early Release of Prepaid Instances
      • Enter rescue mode
      • Exit rescue mode
      • Instance Resizing (By Specifying Instance Package Specification)
      • Modify instance attributes
      • Modify instance description
      • Modify instance hostname
      • Modify the instance password
      • Query Instance List by Instance ID
      • Query instance list
      • Query Multiple Instance Details
      • Query recycle bin instance list
      • Query resizing specifications
      • Query the custom data of the instance
      • Query the details of a specified instance
      • Query the list of BCC instances eligible for no charge for stopped instances
      • Query the VNC address of the instance
      • Reboot an instance
      • Reinstall an instance
      • Reinstall OS for Instances (Batch Reinstallation)
      • Release instances in batch
      • Release IPv6
      • Release Postpay Instances (Release via POST Request)
      • Release recycle bin instances
      • Remove an instance from a security group
      • Remove the relationship between an instance and its deployment group
      • Renew an instance
      • Resize instances in batches
      • Resuming Billing for BCC in Recycle Bin
      • Retrieve a list of instance roles
      • Retrieve network interface card information for an instance
      • Set Auto-release of Spot Instance EIP
      • Set Instance Deletion Protection
      • Setting Auto-Release for Postpay
      • Start an instance
      • Starting Instances (Batch Start)
      • Stop an instance
      • Stopping Instances (Batch Stop)
      • Unbind a role
      • Unbind a tag from an instance
    • Key Pair Related Interfaces
      • Bind a key pair
      • Change description of key pair
      • Create a key pair
      • Delete a key pair
      • Import a key pair
      • Query key pair details
      • Query the key pair list
      • Rename a key pair
      • Unbind a key pair
    • Maintenance platform APIs
      • Maintenance Records Related Interfaces
      • Maintenance Task Related Interfaces
      • Pre-authorization Related Interfaces
    • Other APIs
      • Query instance package price
      • Query instance package specifications (new)
      • Querying Availability Zones By Package Specification
    • Overview
    • Region and Availability Zone Related Interfaces
      • Query list of availability zones
      • Retrieve the Domain Name List API
    • Reserved Instance Related Interfaces
      • Accept Reserved Instance Transfer
      • Adjust reserved instance
      • Bind Tag to Reserved Instance
      • Create Auto-renewal Rule for Reserved Instance
      • Create reserved instances
      • Disable Auto-renewal Rule for Reserved Instance
      • Initiate Reserved Instance Transfer
      • Query Reserved Instance
      • Reject Reserved Instance Transfer
      • Renew reserved instances
      • Reserved Instance Inbound List
      • Reserved Instance Outbound List
      • Revoke Reserved Instance Transfer
      • Unbind Tag from Reserved Instance
    • Security Group Related Interfaces
      • Authorize security group rules
      • Create a security group
      • Delete a security group
      • Delete security group rules
      • Instance associating with security group
      • Query the security group list
      • Replace security group for instance
      • Revoke security group rules
      • Unbind security group from instance
      • Update security group rules
    • Service domain
    • Snapshot Related Interfaces
      • Bind a tag to a snapshot chain
      • Create a snapshot
      • Delete a snapshot
      • Query snapshot chain list
      • Query snapshot details
      • Query the snapshot list
      • Replicate a snapshot across regions
      • Unbind a tag from a snapshot chain
    • Spot Instance Related Interfaces
      • Cancel Spot Instance Order
      • Create spot instances
      • Query Spot Instance Packages
      • Query Spot Price
  • Cloud Assistant API and SDK Reference
    • Python SDK
      • Batch Retrieval of bsm-agent State within Instances
      • CaClient
      • Command Interface
      • Historical records
      • Initialization
      • Install SDK
      • Log API
  • Cloud Assistant API Reference
    • Batch Retrieve Instance BSM-Agent Status
    • Command Interface
    • Common Request and Response Headers
    • Error response
    • General Description
    • Historical records
    • Log API
    • Service domain
  • FAQs
    • Billing class
    • Common Questions Overview
    • Images
    • Instances
    • Network class
    • Performance
    • Reserved Instance
    • Security class
    • Snapshots
    • Storage class
    • Virtual machine operation and maintenance class
  • Fault handling
    • GPU Instance Downtime Troubleshooting
    • Linux Baidu Cloud Compute Entering Single User Mode to Reset Paasword Reports Error-Authentication token lock busy
    • Linux Baidu Cloud Compute Entering Single User Mode to Reset Paasword Reports Error-Authentication token manipulation error
    • Linux Baidu Cloud Compute Entering Single User Mode to Reset Root Paasword
    • Linux Baidu Cloud Compute Execution Yum Update Reports Dependency Conflict
    • Linux Baidu Cloud Compute Setting Login Method
    • Unable to Remotely Connect to Linux Instance
    • Unable to Remotely Connect to Windows Instance
    • Windows Server Cannot Ping
  • Function Release Records
  • Go-SDK
    • Automatic snapshot policy
    • Availability zone
    • Dedicated CDS Cluster
    • Deployment group
    • Disk
    • Elastic high-performance computing cluster
    • Image
    • Initialization
    • Instance
    • Key pair
    • Overview
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
  • Java-SDK
    • Automatic snapshot policy
    • Availability zone
    • BCC instance
    • BccClient
    • Deployment group
    • Disk
    • Exception handling
    • Image
    • Install the SDK Package
    • Key pair
    • Log
    • Overview
    • Quick Start
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
    • Version Change Records
  • Operation guide
    • Deployment group
      • Adjust Deployment Group for Existing Instances
      • Create BCC Instance within Deployment Group
      • Create Deployment Group
      • Delete Deployment Group
      • Modify Deployment Group Information
      • Overview of deployment policy
      • Remove BCC Instance from Deployment Group
    • Elastic high-performance computing cluster
      • Create Elastic High-performance Computing Cluster
      • Overview of elastic hyper-computing cluster
    • Image
      • Baidu AI Cloud Image Service Agreement
      • Create a custom image
      • Creating Linux Image Based on Local System
      • Cross-region image replication
      • Data and Personal Information Protection Statement
      • Delete a custom image
      • How to Set Network Adapter Multi-Queue Properties
      • Image List
      • Image Overview
      • Image Quick Guide
      • Import Custom Image
        • Checking Network Configuration
        • Checking Virtio Driver
        • Creating Instance Using Imported Custom Image
        • Detecting Image Using Image Detection Tool
        • Image format check
        • Image Format Conversion
        • Image Import Restrictions
        • Import Overview
        • Install cloudbase-init
        • Installing Virtio Driver
        • System Optimization Operations (Linux)
      • OS Official End of Maintenance Plan
      • Public Image Release Records
        • 2023 BCC Public Image Release Records
        • 2023 EBC Public Image Release Records
        • 2024 BCC Public Image Release Records
        • 2024 EBC Public Image Release Records
        • 2025 BCC Public Image Release Records
      • Reinstall OS
      • Share a custom image
      • Windows Image Creation
    • Instance
      • Batch Sequential Naming
      • Change Availability Zone
      • Change Charge Type
        • Postpay to Subscription
        • Subscription to Postpay
      • Change Instance Configuration
      • Create instance
        • Create Instance Navigation
        • Create Instance via Custom Image
        • Create Instance via Purchase Page
        • Create Instances with Launch Templates
        • Purchase Instance with Same Configuration
      • Cross-AZ migration
      • Functional restrictions
      • Instance Overview
      • Modify instance name
      • Postpay Instance No Charge for Stopped Instances
      • Reboot an instance
      • Reclaim instance
      • Release an instance
      • Renew an instance
      • Reset password
      • Set IAM Role
      • Sign in to instance
        • Log in to Linux Instance
          • Connect to Baidu Cloud Compute Using SmartTerm
          • Connect to Linux instance using user name-password certification
          • Log in to Linux Instance Using SSH Key on Linux or Mac
          • Log in to Linux Instance Using SSH Key via SecureCRT
          • Remotely sign in to Baidu Cloud Computer via VNC
          • Sign in to Linux instances via SSH key in Putty
          • Sign in to Linux instances via SSH key in Xshell
        • Log in to Windows Instance
          • Remotely sign in to Baidu Cloud Computer via VNC
          • Sign in to Windows instance via remote desktop
        • Login Method Overview
      • Spot instances
        • Create spot instances
        • What Is Spot Instance
      • Start an instance
      • Stop an instance
        • Elastic Baremetal Instance and Elastic High-performance Computing Cluster Instance Halt Capability Configuration Instructions
        • Stop an instance
      • View instance
    • Launch Template
    • Network
      • Bind and Unbind EIP
      • Change Internal IP
      • Change subnet
      • Change VPC
      • Configure Dual Network Interface Card External Network Access for Windows Baidu Cloud Compute
      • Configure IPv6 Address
      • Elastic network interface
        • Configure elastic network interface
        • Create elastic network interface
        • Elastic Network Interface Overview
        • Mount elastic network interface
      • Elastic RDMA Network Interface
        • Elastic RDMA Network Interface Overview
        • Instructions for use
    • Operations and Monitor
      • Cloud Assistant
        • Cloud Assistant Overview
        • Cloud Assistant Usage Notes
        • Create command
        • FAQs
        • Historical records
        • Identity and access management
        • Instance list
        • My commands
        • Public commands
        • Upload files
      • Evacuate Faulty Instance
      • Instance Health Check
      • Maintenance Platform
        • Maintenance Platform Access Instructions
        • Maintenance Platform Overview
        • Maintenance Records Query
        • Maintenance Task
        • Pre-authorization Rule Settings
      • Monitor and Alarm
      • Self-Service Diagnostic Tool
        • Application Performance Diagnostic Tool Btune
        • Self-Service Diagnostic Tool Overview
      • Use Rescue Mode
    • Reserved Instance
      • FAQ
      • Introduction to reserved instance
      • Renew reserved instances
      • Reserved Instance AZ Change
      • Reserved Instance Cross-Account Transfer
      • Reserved Instance EHC Cluster Change
      • Reserved Instance Purchase
    • Resource Assessment
    • Security
      • Identity and access management
        • Identity and access management
        • Price Permission Control
      • Key pair
        • Bind SSH Key Pair
        • Create SSH Key Pair
        • Delete SSH Key Pair
        • SSH Key Pair Overview
        • Unbind SSH Key Pair
      • Security group
        • Associate with security group
        • Copy security group
        • Create a security group
        • Delete a security group
        • Disassociate security group
        • Edit security group
        • Security group overview
    • Server Migration
      • Migration Operation Steps
      • Server Migration Operation Process
      • Server Migration Overview
    • Snapshot
      • Automatic snapshot
      • Create disk snapshot
      • Delete a snapshot
      • Rollback snapshot
      • Snapshot Overview
    • Storage
      • Overview
      • System disk expansion
    • Tag Management
      • Configure Tag
      • Tag Management Overview
      • View Resource Bill Based on Tag
  • Operations Reference
    • Linux Operations Reference
      • Baidu Cloud Compute Delete or Modify File Prompting File System Read-Only
      • CentOS 7 Install Docker
      • Configure Linux Analysis Tools atop and kdump
      • Enter Single User Mode
        • CentOS 6 Enter Single User
        • CentOS 7 Enter Single User
        • CentOS 8 Enter Single User
        • FreeBSD 11 Single User Modify Password SOP
        • opensuse42
          • 3 Enter Single User
        • Ubuntu Enter Single User
        • OpenSUSE 42_3 Enter Single User
      • How to Enable or Disable Intel CPU Vulnerability Patch in Linux Image
      • How to Handle System Boot Exception Caused by fstab Configuration
      • How to Resolve Unable to Log in to Linux BCC Instance via SSH
      • Linux BCC Instance Insufficient Space Issue Resolution
      • Linux BCC Instance Lagging Issue
      • Linux Format and Mount Instance Mount Local Disk
      • Linux Modify Default Remote Connection Port
      • Linux Server CPU and Memory Full Issue Troubleshooting
      • Linux System Install Graphical Interface and Remote Connection
        • CentOS
          • CentOS 8 Install Graphical Interface and Connect Using VNC Viewer
          • CentOS6
            • 5 Install KDE Desktop Graphical Interface and Connect Using VNC Viewer
          • CentOS7
            • 2 Install GNOME Graphical Interface and Connect Using VNC Viewer
          • Install GNOME GUI on CentOS 7_2 and Connect via VNC Viewer
          • Install GUI on CentOS 8 and Connect via VNC Viewer
          • Install KDE Desktop GUI on CentOS 6_5 and Connect via VNC Viewer
        • opensuse
          • opensuse42
            • 3 Install Graphical Interface and Connect Using VNC Viewer
          • Install GUI on OpenSUSE 42_3 and Connect via VNC Viewer
        • Ubuntu
          • Ubuntu14
            • 04 Install Graphical Interface and Connect Using VNC Viewer
          • Ubuntu18
            • 04 Install Graphical Interface and Connect Using VNC Viewer
          • Install GUI on Ubuntu 14_04 and Connect via VNC Viewer
          • Install GUI on Ubuntu 18_04 and Connect via VNC Viewer
      • Performance test
        • Linux CPU Performance Test
        • Linux Disk IO Performance Test
        • Linux Network Performance Test
      • Ubunut16_04 Subscription Extended Security Maintenance (ESM)
    • Server configuration
      • Debian 9 Software Source Configuration Update
      • Linux PIP Configuration with Baidu Source
      • Online Instance with OFED Driver Kernel Replacement Solution
      • Windows PIP Configuration with Baidu Source
    • Windows Operations Reference
      • Blue Screen Crash General Handling Method
      • Configure Multi-User Access to Windows Server
      • Connecting to Windows Server Using Xshell
      • Data Disk Not Displayed After Server Reboot
      • Desktop and Application Color Turning Black Handling Method
      • Local Playback of Sound from Windows Server
      • Performance Resources
      • Remote Connection to Windows Unable to Use Clipboard
      • System Activation
      • System Black Screen Handling Method
      • Tutorial on Disabling Windows Firewall
      • Windows Allow Single User to Connect Multiple Sessions Method
      • Windows automatic update-related configuration
      • Windows BCC Instance Achieve SID Uniqueness After Joining Domain
      • Windows BCC Instance Lagging Issue
      • Windows CPU and Memory Full Troubleshooting Steps
      • Windows Delete Hardware Reserved Memory
      • Windows Instance Disk Space Insufficient Cleanup Tutorial
      • Windows Modify Default Remote Connection Port
      • Windows Quick Login
      • Windows Server Disable IE Enhanced Security Configuration Tutorial
      • Windows Set Only Allow Local & Fixed IP Access Remote Port
      • Windows Virtual Machine Auto Login Failure Handling Method
  • Product Description
    • Application scenarios
    • Correspondence Between Original and New Model Names
    • Discontinued Instance Specifications
    • Models and Their Regions
    • Product advantages
    • Usage restrictions
    • What Is Baidu Cloud Compute BCC
    • Instance specification
  • Product pricing
    • Billing overview
    • Charge type
      • Postpay Pricing
      • Reserved instances
      • Subscription (Prepay) Pricing
      • Reserved Instance
    • Refund policy
  • Python-SDK
    • BccClient
    • Disk
    • Image
    • Initialization
    • Install the SDK Package
    • Instance
    • Overview
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
    • Version Change Records
  • Python3-SDK
    • Automatic snapshot policy
    • BccClient
    • Dedicated CDS Cluster
    • Deployment group
    • Disk
    • Image
    • Initialization
    • Install the SDK Package
    • Instance
    • Key pair
    • Overview
    • Region
    • Reserved Instance
    • Security group
    • Snapshot
  • Quick Start
    • Baidu Cloud Compute Security Description
    • Quick Configuration of Linux Baidu Cloud Compute
    • Quick Configuration of Windows Baidu Cloud Compute
  • Resources
    • Baidu AI Cloud Image Service Agreement
    • Image Quick Guide
  • Service Level Agreement (SLA)
    • Baidu Intelligent Cloud Baidu Cloud Compute Trusted Cloud
    • BCC Service Level Agreement SLA (V3_0)
  • Tag Service API Reference
    • Appendix 1
    • Domain name
    • General Description
    • Interface List
      • Create tag
      • Delete tag
      • Tag List
      • View resources bound to a tag
    • Introduction
  • Transfer
    • Site Offline Migration Solution
    • Site Smooth Migration Solution
  • Typical Practices
    • Connecting to Windows Server Using Xshell
    • Environment Setup
      • Manual Deployment of SQL Server 2008 R2 Express Edition
      • One-Click Setup for Linux Palworld Server
      • One-Click Setup for Windows Palworld Server
      • Set Up FTP Service on Linux
      • Setting Up Baota on Linux
      • Setting Up Baota on Windows
      • Tutorial on Setting Up IIS and FTP on Windows
      • Typical Practice for Setting Up Access VPN
      • Installing MySQL 8_0 Using Yum on CentOS 7 yum
    • Set Up FTP Service on Linux
    • Tag Management Practice
    • Tutorial on Disabling IE Enhanced Security Configuration on Windows Server 2008
    • Tutorial on Disabling Windows Firewall
    • Website Building Tutorial
      • Building a Website Using IIS Service on Windows 2008 and Windows 2012
      • Deploy SSL Certificate on Baidu Cloud Compute - Apache
      • Deploy SSL Certificate on Baidu Cloud Compute - IIS
      • Deploy SSL Certificate on Baidu Cloud Compute - Nginx
      • Deploy SSL Certificate on Baidu Cloud Compute - Tomcat
      • Installing and Configuring Apache Environment on Windows
      • Installing and Configuring Nginx Environment on Windows
      • CentOS-7_2 LNMP Environment Deployment
      • Deploy LAMP Environment on CentOS-7_2
  • Video Zone
    • Fault handling
    • Operation guide
      • Basic Functions
      • Environment Setup
      • Network Related
      • Operations Reference
      • Paasword Function
    • Operation guide
    • Product introduction
  • Document center
  • arrow
  • BCCBCC
  • arrow
  • Operations Reference
  • arrow
  • Server configuration
  • arrow
  • Online Instance with OFED Driver Kernel Replacement Solution
Table of contents on this page
  • Overview
  • Operation steps
  • CentOS 7
  • CentOS 8
  • Rocky Linux 8
  • Rocky Linux 9
  • Ubuntu 20.04
  • Ubuntu 22.04

Online Instance with OFED Driver Kernel Replacement Solution

Updated at:2025-10-20

Overview

Some online BCC/EBC images come with pre-installed OFED drivers. Users can immediately utilize the RDMA feature after creating an instance with the RDMA network function package. However, since OFED is compiled for the current OS kernel version, switching to a different kernel version may cause the RDMA network interface card to fail. In such cases, recompiling and reinstalling the OFED driver will be necessary.

This document offers a solution for reinstalling OFED after modifying the kernel version on instances with various operating systems.

Operation steps

CentOS 7

For instance, consider upgrading the kernel to kernel-3.10.0-1160.90.1.el7.x86_64.

1. Install the kernel and kernel development package. Proceed to install the kernel-3.10.0-1160.90.1.el7.x86_64 version along with the related kernel development package on the instance:

Shell
1yum install -y kernel-3.10.0-1160.90.1.el7.x86_64 \
2                kernel-devel-3.10.0-1160.90.1.el7.x86_64 \
3                kernel-headers-3.10.0-1160.90.1.el7.x86_64 \
4                kernel-tools-3.10.0-1160.90.1.el7.x86_64

2. Check the RDMA lossless configuration package. Verify whether the RDMA lossless configuration package, rdma-userspace-config, is installed on the instance. If so, uninstall the package and reinstall it after OFED installation. Execute the following commands:

Shell
1# Check if rdma-userspace-config is installed
2rpm -qa |grep -i rdma-userspace-config

If the lossless configuration package is found installed, execute the command to delete it:

Plain Text
1# rpm -e rdma-userspace-config

3. Download the OFED installation package. You can download it from the Baidu software repository or transfer it to the instance after downloading it from the official OFED website:

  1. Download from Baidu software repository: Taking the download of ofed version 5.8-2.0.3 as an example, execute the following command in the instance to download;
Shell
1wget http://mirrors.baidubce.com/mlnx-ofed/5.8-2.0.3.0/MLNX_OFED_LINUX-5.8-2.0.3.0-rhel7.9-x86_64.tgz
  1. Download from the official website: Obtain the corresponding version of ofed installation package from the ofed official site

image.png

4. Compile and install ofed

  • Run the script below to compile and install the OFED driver.
  • The entire process of compilation and installation takes about 30 minutes.

Note: Ensure the script and the OFED installation package are placed in the same directory, and update the OFED version and kernel version in the script to match the actual versions being used.

Shell
1#!/usr/bin/env bash
2# Update mlnx ofed drivers
3# -- Prepare mlnx ofed drivers
4# -- Extract and install
5mlnx_ofed_version=5.8-2.0.3.0
6mlnx_ofed_drv=MLNX_OFED_LINUX-${mlnx_ofed_version}-rhel7.9-x86_64.tgz
7kern_ver=3.10.0-1160.90.1.el7.x86_64
8if [ ! -d /lib/modules/${kern_ver}/build ]; then
9    echo "There is no kernel build directory. Please check if kernel-devel is installed ..."
10    exit 1
11fi
12if ! which gcc >& /dev/null; then
13    yum install -y gcc
14fi
15if ! which make >& /dev/null; then
16    yum install -y make
17fi
18# Install Build Requirement
19yum install -y elfutils-libelf-devel createrepo python-devel redhat-rpm-config rpm-build libtool
20# Install runtime requirement for all OFED components installation
21yum install -y tcl gcc-gfortran fuse-libs tk libnl3-devel
22# Install updated mlnx_ofed packages including kernel modules and userspace packages
23mkdir update_drivers
24tar xf ${mlnx_ofed_drv} --strip-components 2 -C update_drivers/
25cd update_drivers
26./mlnxofedinstall --without-fw-update  --add-kernel-support -k ${kern_ver} --skip-distro-check \
27    --package-install-options "--force" --distro rhel7.9 -q
28if [ $? -ne 0 ]; then
29    echo "MLNX OFED driver install ... Failed."
30    exit 1
31fi
32cd ..
33# 82-net-setup-link.rules cause the name of NIC changed. Then disable this rule.
34if [ -f /usr/lib/udev/rules.d/82-net-setup-link.rules ]; then
35    mv /usr/lib/udev/rules.d/82-net-setup-link.rules /usr/lib/udev/rules.d/82-net-setup-link.rules.orig
36fi
37if [ -f /usr/lib/udev/rules.d/83-mlnx-sf-name.rules ]; then
38    mv /usr/lib/udev/rules.d/83-mlnx-sf-name.rules /usr/lib/udev/rules.d/83-mlnx-sf-name.rules.orig
39fi
40# Disable rshim(tmfifo_net0)
41systemctl disable rshim.service
42rm -rf ${mlnx_ofed_drv} update_drivers
43rm -rf /tmp/MLNX* /tmp/*.conf
44# update initramfs
45dracut -f /boot/initramfs-${kern_ver}.img ${kern_ver}
  • If rdma-userspace-config is uninstalled in the second step, reinstall it
Shell
1wget -q http://mirrors.baidubce.com/baidu/rdma_specs/rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
2rpm -ivh rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
3service rdma start
  1. Reboot the instance to enter the new kernel;
  2. Verify the installation.
Shell
1#Check the installed 5.8-2.0.3 version and the ofed compiled under the current kernel
2[root@localhost ~]# rpm -qa | grep mlnx | grep 3.10.0_1160
3mlnx-ofa_kernel-modules-5.8-OFED.5.8.2.0.3.1.kver.3.10.0_1160.90.1.el7.x86_64.x86_64
4mlnx-ofa_kernel-devel-5.8-OFED.5.8.2.0.3.1.kver.3.10.0_1160.90.1.el7.x86_64.x86_64
5knem-modules-1.1.4.90mlnx1-OFED.5.8.0.4.7.1.kver.3.10.0_1160.90.1.el7.x86_64.x86_64
6[root@localhost ~]# modinfo mlx5_core
7filename:       /lib/modules/3.10.0-1160.90.1.el7.x86_64/extra/mlnx-ofa_kernel/drivers/net/ethernet/mellanox/mlx5/core/mlx5_core.ko
8alias:          auxiliary:mlx5_core.eth-rep
9alias:          auxiliary:mlx5_core.eth
10basedon:        Korg 5.17-rc4
11version:        5.8-2.0.3
12license:        Dual BSD/GPL
13description:    Mellanox 5th generation network adapters (ConnectX series) core driver
14author:         Eli Cohen <eli@mellanox.com>
15retpoline:      Y
16rhelversion:    7.9
17srcversion:     6A14E2ECBAE645B024A60B6
18...
19 #Check the network interface card
20[root@localhost ~]# ifconfig
21 #Check the network interface card driver
22[root@localhost ~]# ethtool -i ethX
23 #Check the software package dependency. If dependency package is missing, software package is duplicated, or other issues exist, manually resolve by installing missing dependency package, and removing duplicate software package, etc.;
24[root@localhost ~]# yum check dependencies

CentOS 8

For example, consider upgrading the kernel to kernel-4.18.0-348.7.1.el8_5.x86_64.

1. Install the kernel and kernel development package. Install the kernel-4.18.0-348.7.1.el8_5.x86_64 version and its associated development packages in the instance:

Shell
1yum install -y kernel-4.18.0-348.7.1.el8_5.x86_64 \
2                kernel-devel-4.18.0-348.7.1.el8_5.x86_64 \
3                kernel-headers-4.18.0-348.7.1.el8_5.x86_64 \
4                kernel-tools-4.18.0-348.7.1.el8_5.x86_64

2. Verify the RDMA lossless configuration package. Some instances have the RDMA lossless configuration package (rdma-userspace-config) installed. This package needs to be removed and reinstalled after installing OFED:

Shell
1# Check if rdma-userspace-config is installed
2rpm -qa |grep -i rdma-userspace-config

If the lossless configuration package is found installed, execute the command to delete it:

Plain Text
1If it exists, uninstall the package
2# rpm -e rdma-userspace-config

3. Download the OFED installation package. You can download it from the Baidu software repository or transfer it to the instance after downloading it from the official OFED website:

  1. Download from Baidu software repository: Taking the download of ofed version 5.8-2.0.3 as an example, execute the following command in the instance to download:
Shell
1wget http://mirrors.baidubce.com/mlnx-ofed/5.8-2.0.3.0/MLNX_OFED_LINUX-5.8-2.0.3.0-rhel8.4-x86_64.tgz
  1. Download from the official website: Obtain the corresponding version of ofed installation package from the ofed official site

4. Compile and install ofed

Run the following script to compile and install the OFED driver. The entire process of compilation and installation takes about 30 minutes.

Note: Ensure the script and the OFED installation package are placed in the same directory, and update the OFED version and kernel version in the script to match the actual versions being used.

Shell
1#!/usr/bin/env bash
2# Update mlnx ofed drivers
3# -- Prepare mlnx ofed drivers
4# -- Extract and install
5mlnx_ofed_version=5.8-2.0.3.0
6mlnx_ofed_drv=MLNX_OFED_LINUX-${mlnx_ofed_version}-rhel8.4-x86_64.tgz
7kern_ver=4.18.0-348.7.1.el8_5.x86_64
8if [ ! -d /lib/modules/${kern_ver}/build ]; then
9    echo "There is no kernel build directory. Please check if kernel-devel is installed ..."
10    exit 1
11fi
12if ! which gcc >& /dev/null; then
13    yum install -y gcc
14fi
15if ! which make >& /dev/null; then
16    yum install -y make
17fi
18# Install Build Requirement
19yum install -y createrepo python36-devel libtool python36 kernel-rpm-macros gdb-headless rpm-build elfutils-libelf-devel
20# Install runtime requirement for all OFED components installation
21yum install -y tk gcc-gfortran tcsh tcl libnl3-devel perl-Math-Complex cmake-filesystem
22# Install updated mlnx_ofed packages including kernel modules and userspace packages
23mkdir update_drivers
24tar xf ${mlnx_ofed_drv} --strip-components 2 -C update_drivers/
25cd update_drivers
26./mlnxofedinstall --without-fw-update  --add-kernel-support -k ${kern_ver} --skip-distro-check \
27    --package-install-options "--force" --distro rhel8.4 -q
28if [ $? -ne 0 ]; then
29    echo "MLNX OFED driver install ... Failed."
30    exit 1
31fi
32cd ..
33# 82-net-setup-link.rules cause the name of NIC changed. Then disable this rule.
34if [ -f /usr/lib/udev/rules.d/82-net-setup-link.rules ]; then
35    mv /usr/lib/udev/rules.d/82-net-setup-link.rules /usr/lib/udev/rules.d/82-net-setup-link.rules.orig
36fi
37if [ -f /usr/lib/udev/rules.d/83-mlnx-sf-name.rules ]; then
38    mv /usr/lib/udev/rules.d/83-mlnx-sf-name.rules /usr/lib/udev/rules.d/83-mlnx-sf-name.rules.orig
39fi
40# Disable rshim(tmfifo_net0)
41systemctl disable rshim.service
42rm -rf ${mlnx_ofed_drv} update_drivers
43rm -rf /tmp/MLNX* /tmp/*.conf
44# update initramfs
45dracut -f /boot/initramfs-${kern_ver}.img ${kern_ver}
  • If rdma-userspace-config is uninstalled in the second step, reinstall it
Shell
1wget -q http://mirrors.baidubce.com/baidu/rdma_specs/rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
2rpm -ivh --nodeps --force rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
3service rdma start
  1. Reboot the instance to enter the new kernel;
  2. Verify the installation.
Shell
1#Check the installed 5.8-2.0.3 version and the ofed compiled under the current kernel
2[root@localhost ~]# rpm -qa | grep mlnx | grep 4.18.0_348
3mlnx-ofa_kernel-devel-5.8-OFED.5.8.2.0.3.1.kver.4.18.0_348.7.1.el8_5.x86_64.x86_64
4knem-modules-1.1.4.90mlnx1-OFED.5.8.0.4.7.1.kver.4.18.0_348.7.1.el8_5.x86_64.x86_64
5mlnx-ofa_kernel-modules-5.8-OFED.5.8.2.0.3.1.kver.4.18.0_348.7.1.el8_5.x86_64.x86_64
6[root@localhost ~]# modinfo mlx5_core
7filename:       /lib/modules/4.18.0-348.7.1.el8_5.x86_64/extra/mlnx-ofa_kernel/drivers/net/ethernet/mellanox/mlx5/core/mlx5_core.ko
8alias:          auxiliary:mlx5_core.eth-rep
9alias:          auxiliary:mlx5_core.eth
10basedon:        Korg 5.17-rc4
11version:        5.8-2.0.3
12license:        Dual BSD/GPL
13description:    Mellanox 5th generation network adapters (ConnectX series) core driver
14author:         Eli Cohen <eli@mellanox.com>
15rhelversion:    8.5
16srcversion:     AED21A09CA345D254692F69
17...
18 #Check the network interface card
19[root@localhost ~]# ifconfig
20 #Check the network interface card driver
21[root@localhost ~]# ethtool -i ethX
22 #Check the software package dependency. If dependency package is missing, software package is duplicated, or other issues exist, manually resolve by installing missing dependency package, and removing duplicate software package, etc.;
23[root@localhost ~]# yum check

Rocky Linux 8

For example, consider upgrading the kernel to kernel-4.18.0-477.13.1.el8_8.x86_64.

1. Install the kernel and kernel development package. Install the kernel-4.18.0-477.13.1.el8_8.x86_64 version and its associated development packages in the instance:

Shell
1yum install -y kernel-4.18.0-477.13.1.el8_8.x86_64 \
2                kernel-devel-4.18.0-477.13.1.el8_8.x86_64 \
3                kernel-headers-4.18.0-477.13.1.el8_8.x86_64 \
4                kernel-tools-4.18.0-477.13.1.el8_8.x86_64

2. Verify the RDMA lossless configuration package. Some instances have the RDMA lossless configuration package (rdma-userspace-config) installed. This package needs to be removed and reinstalled after installing OFED:

Shell
1# Check if rdma-userspace-config is installed
2rpm -qa |grep -i rdma-userspace-config

If the lossless configuration package is found installed, execute the command to delete it:

Plain Text
1If it exists, uninstall the package
2# rpm -e rdma-userspace-config

3. Download the OFED installation package. You can download it from the Baidu software repository or transfer it to the instance after downloading it from the official OFED website:

  1. Download from Baidu software repository: Taking the download of ofed version 5.8-2.0.3 as an example, execute the following command in the instance to download.
Shell
1wget http://mirrors.baidubce.com/mlnx-ofed/5.8-2.0.3.0/MLNX_OFED_LINUX-5.8-2.0.3.0-rhel8.7-x86_64.tgz
  1. Download from the official website: Obtain the corresponding version of ofed installation package from the ofed official site

4. Compile and install ofed

Run the script below to compile and install the OFED driver.

Note: Ensure the script and the OFED installation package are placed in the same directory, and update the OFED version and kernel version in the script to match the actual versions being used.

The entire process of compilation and installation takes about 30 minutes.

Shell
1#!/usr/bin/env bash
2# Update mlnx ofed drivers
3# -- Prepare mlnx ofed drivers
4# -- Extract and install
5mlnx_ofed_version=5.8-2.0.3.0
6mlnx_ofed_drv=MLNX_OFED_LINUX-${mlnx_ofed_version}-rhel8.7-x86_64.tgz
7kern_ver=4.18.0-477.13.1.el8_8.x86_64
8if [ ! -d /lib/modules/${kern_ver}/build ]; then
9    echo "There is no kernel build directory. Please check if kernel-devel is installed ..."
10    exit 1
11fi
12if ! which gcc >& /dev/null; then
13    yum install -y gcc
14fi
15if ! which make >& /dev/null; then
16    yum install -y make
17fi
18# Install Build Requirement
19yum install -y createrepo python36-devel libtool python36 kernel-rpm-macros gdb-headless rpm-build elfutils-libelf-devel
20# Install runtime requirement for all OFED components installation
21yum install -y tk gcc-gfortran tcsh tcl libnl3-devel perl-Math-Complex cmake-filesystem
22# Install updated mlnx_ofed packages including kernel modules and userspace packages
23mkdir update_drivers
24tar xf ${mlnx_ofed_drv} --strip-components 2 -C update_drivers/
25cd update_drivers
26./mlnxofedinstall --without-fw-update  --add-kernel-support -k ${kern_ver} --skip-distro-check \
27    --package-install-options "--force" --distro rhel8.7 -q
28if [ $? -ne 0 ]; then
29    echo "MLNX OFED driver install ... Failed."
30    exit 1
31fi
32cd ..
33# 82-net-setup-link.rules cause the name of NIC changed. Then disable this rule.
34if [ -f /usr/lib/udev/rules.d/82-net-setup-link.rules ]; then
35    mv /usr/lib/udev/rules.d/82-net-setup-link.rules /usr/lib/udev/rules.d/82-net-setup-link.rules.orig
36fi
37if [ -f /usr/lib/udev/rules.d/83-mlnx-sf-name.rules ]; then
38    mv /usr/lib/udev/rules.d/83-mlnx-sf-name.rules /usr/lib/udev/rules.d/83-mlnx-sf-name.rules.orig
39fi
40# Disable rshim(tmfifo_net0)
41systemctl disable rshim.service
42rm -rf ${mlnx_ofed_drv} update_drivers
43rm -rf /tmp/MLNX* /tmp/*.conf
44# update initramfs
45dracut -f /boot/initramfs-${kern_ver}.img ${kern_ver}
  • If rdma-userspace-config is uninstalled in the second step, reinstall it
Shell
1wget -q http://mirrors.baidubce.com/baidu/rdma_specs/rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
2rpm -ivh --nodeps --force rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
3service rdma start
  1. Reboot the instance to enter the new kernel;
  2. Verify the installation.
Shell
1#Check the installed 5.8-2.0.3 version and the ofed compiled under the current kernel
2[root@localhost ~]# rpm -qa | grep mlnx | grep 4.18.0_477
3mlnx-ofa_kernel-modules-5.8-OFED.5.8.2.0.3.1.kver.4.18.0_477.13.1.el8_8.x86_64.x86_64
4mlnx-ofa_kernel-devel-5.8-OFED.5.8.2.0.3.1.kver.4.18.0_477.13.1.el8_8.x86_64.x86_64
5knem-modules-1.1.4.90mlnx1-OFED.5.8.0.4.7.1.kver.4.18.0_477.13.1.el8_8.x86_64.x86_64
6[root@localhost ~]# modinfo mlx5_core
7filename:       /lib/modules/4.18.0-477.13.1.el8_8.x86_64/extra/mlnx-ofa_kernel/drivers/net/ethernet/mellanox/mlx5/core/mlx5_core.ko
8alias:          auxiliary:mlx5_core.eth-rep
9alias:          auxiliary:mlx5_core.eth
10basedon:        Korg 5.17-rc4
11version:        5.8-2.0.3
12license:        Dual BSD/GPL
13description:    Mellanox 5th generation network adapters (ConnectX series) core driver
14author:         Eli Cohen <eli@mellanox.com>
15rhelversion:    8.8
16srcversion:     AED21A09CA345D254692F69
17...
18 #Check the network interface card
19[root@localhost ~]# ifconfig
20 #Check the network interface card driver
21[root@localhost ~]# ethtool -i ethX
22 #Check the software package dependency. If dependency package is missing, software package is duplicated, or other issues exist, manually resolve by installing missing dependency package, and removing duplicate software package, etc.;
23[root@localhost ~]# yum check

Rocky Linux 9

For example, consider upgrading the kernel to kernel-5.14.0-284.11.1.el9_2.x86_64.

1. Install the kernel and kernel development package. Install the kernel-5.14.0-284.11.1.el9_2.x86_64 version and its associated development packages in the instance:

Shell
1yum install -y kernel-5.14.0-284.11.1.el9_2.x86_64 \
2                kernel-devel-5.14.0-284.11.1.el9_2.x86_64 \
3                kernel-headers-5.14.0-284.11.1.el9_2.x86_64 \
4                kernel-tools-5.14.0-284.11.1.el9_2.x86_64

2. Verify the RDMA lossless configuration package. Some instances have the RDMA lossless configuration package (rdma-userspace-config) installed. This package needs to be removed and reinstalled after installing OFED:

Shell
1# Check if rdma-userspace-config is installed
2rpm -qa |grep -i rdma-userspace-config

If the lossless configuration package is found installed, execute the command to delete it:

Plain Text
1If it exists, uninstall the package
2# rpm -e rdma-userspace-config

3. Download the OFED installation package. You can download it from the Baidu software repository or transfer it to the instance after downloading it from the official OFED website:

  1. Download from Baidu software repository: Taking the download of ofed version 5.8-2.0.3 as an example, execute the following command in the instance to download.
Shell
1wget http://mirrors.baidubce.com/mlnx-ofed/5.8-2.0.3.0/MLNX_OFED_LINUX-5.8-2.0.3.0-rhel9.1-x86_64.tgz
  1. Download from the official website: Obtain the corresponding version of ofed installation package from the ofed official site

4. Compile and install ofed

Run the script below to compile and install the OFED driver.

Note: Ensure the script and the OFED installation package are placed in the same directory, and update the OFED version and kernel version in the script to match the actual versions being used.

The entire process of compilation and installation takes about 30 minutes.

Shell
1#!/usr/bin/env bash
2# Update mlnx ofed drivers
3# -- Prepare mlnx ofed drivers
4# -- Extract and install
5mlnx_ofed_version=5.8-2.0.3.0
6mlnx_ofed_drv=MLNX_OFED_LINUX-${mlnx_ofed_version}-rhel9.1-x86_64.tgz
7kern_ver=5.14.0-284.11.1.el9_2.x86_64
8if [ ! -d /lib/modules/${kern_ver}/build ]; then
9    echo "There is no kernel build directory. Please check if kernel-devel is installed ..."
10    exit 1
11fi
12if ! which gcc >& /dev/null; then
13    yum install -y gcc
14fi
15if ! which make >& /dev/null; then
16    yum install -y make
17fi
18# Install Build Requirement
19yum install -y perl createrepo python3-devel libtool kernel-rpm-macros gdb-headless rpm-build elfutils-libelf-devel
20# Install runtime requirement for all OFED components installation
21yum install -y tk gcc-gfortran tcsh tcl libnl3-devel cmake-filesystem
22# Install updated mlnx_ofed packages including kernel modules and userspace packages
23mkdir update_drivers
24tar xf ${mlnx_ofed_drv} --strip-components 2 -C update_drivers/
25cd update_drivers
26./mlnxofedinstall --without-fw-update  --add-kernel-support -k ${kern_ver} --skip-distro-check \
27    --package-install-options "--force" --distro rhel9.1 -q
28if [ $? -ne 0 ]; then
29    echo "MLNX OFED driver install ... Failed."
30    exit 1
31fi
32cd ..
33# 82-net-setup-link.rules cause the name of NIC changed. Then disable this rule.
34if [ -f /usr/lib/udev/rules.d/82-net-setup-link.rules ]; then
35    mv /usr/lib/udev/rules.d/82-net-setup-link.rules /usr/lib/udev/rules.d/82-net-setup-link.rules.orig
36fi
37if [ -f /usr/lib/udev/rules.d/83-mlnx-sf-name.rules ]; then
38    mv /usr/lib/udev/rules.d/83-mlnx-sf-name.rules /usr/lib/udev/rules.d/83-mlnx-sf-name.rules.orig
39fi
40# Disable rshim(tmfifo_net0)
41systemctl disable rshim.service
42rm -rf ${mlnx_ofed_drv} update_drivers
43rm -rf /tmp/MLNX* /tmp/*.conf
44# update initramfs
45dracut -f /boot/initramfs-${kern_ver}.img ${kern_ver}
  • If rdma-userspace-config is uninstalled in the second step, reinstall it
Shell
1wget -q http://mirrors.baidubce.com/baidu/rdma_specs/rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
2rpm -ivh --nodeps --force rdma-userspace-config-bbc-v1.0.3-1.el7.centos.x86_64.rpm
3service rdma start
  1. Reboot the instance to enter the new kernel;
  2. Verify the installation.
Shell
1#Check the installed 5.8-2.0.3 version and the ofed compiled under the current kernel
2[root@localhost ~]# rpm -qa | grep mlnx | grep 5.14.0_284
3mlnx-ofa_kernel-modules-5.8-OFED.5.8.2.0.3.1.kver.5.14.0_284.11.1.el9_2.x86_64.x86_64
4mlnx-ofa_kernel-devel-5.8-OFED.5.8.2.0.3.1.kver.5.14.0_284.11.1.el9_2.x86_64.x86_64
5knem-modules-1.1.4.90mlnx1-OFED.5.8.0.4.7.1.kver.5.14.0_284.11.1.el9_2.x86_64.x86_64
6[root@localhost ~]# modinfo mlx5_core
7filename:       /lib/modules/5.14.0-284.11.1.el9_2.x86_64/extra/mlnx-ofa_kernel/drivers/net/ethernet/mellanox/mlx5/core/mlx5_core.ko
8alias:          auxiliary:mlx5_core.eth-rep
9alias:          auxiliary:mlx5_core.eth
10basedon:        Korg 5.17-rc4
11version:        5.8-2.0.3
12license:        Dual BSD/GPL
13description:    Mellanox 5th generation network adapters (ConnectX series) core driver
14author:         Eli Cohen <eli@mellanox.com>
15rhelversion:    9.2
16srcversion:     E4CF39F4680CAB323741675
17...
18 #Check the network interface card
19[root@localhost ~]# ifconfig
20 #Check the network interface card driver
21[root@localhost ~]# ethtool -i ethX
22 #Check the software package dependency. If dependency package is missing, software package is duplicated, or other issues exist, manually resolve by installing missing dependency package, and removing duplicate software package, etc.;
23[root@localhost ~]# yum check

Ubuntu 20.04

For example, consider upgrading the kernel to 5.4.0-152-generic.

1. Install the kernel and kernel development package. Install the 5.4.0-152-generic version and other related components in the instance:

Shell
1apt-get install -y linux-image-5.4.0-152-generic \
2                   linux-headers-5.4.0-152 \
3                   linux-headers-5.4.0-152-generic \
4                   linux-modules-5.4.0-152-generic \
5                   linux-modules-extra-5.4.0-152-generic

2. Verify the RDMA lossless configuration package. Some instances have the RDMA lossless configuration package (rdma-userspace-config) installed. This package needs to be removed and reinstalled after installing OFED:

Shell
1# Check if rdma-userspace-config is installed
2rpm -qa |grep -i rdma-userspace-config

If the lossless configuration package is found installed, execute the command to delete it:

Plain Text
1If it exists, uninstall the package
2# rpm -e rdma-userspace-config

3. Download the OFED installation package. You can download it from the Baidu software repository or transfer it to the instance after downloading it from the official OFED website:

  1. Download from Baidu software repository: Taking the download of ofed version 5.8-2.0.3 as an example, execute the following command in the instance to download.
Shell
1wget http://mirrors.baidubce.com/mlnx-ofed/5.8-2.0.3.0/MLNX_OFED_LINUX-5.8-2.0.3.0-ubuntu20.04-x86_64.tgz
  1. Download from the official website: Obtain the corresponding version of ofed installation package from the ofed official site

4. Compile and install ofed

Run the following script to compile and install the OFED driver. The entire process of compilation and installation takes about 30 minutes.

Note: Ensure the script and the OFED installation package are placed in the same directory, and update the OFED version and kernel version in the script to match the actual versions being used.

Shell
1#!/usr/bin/env bash
2# Update mlnx ofed drivers
3# -- Prepare mlnx ofed drivers
4# -- Extract and install
5mlnx_ofed_version=5.8-2.0.3.0
6mlnx_ofed_drv=MLNX_OFED_LINUX-${mlnx_ofed_version}-ubuntu20.04-x86_64.tgz
7kern_ver=5.4.0-152-generic
8if [ ! -d /lib/modules/${kern_ver}/build ]; then
9    echo "There is no kernel build directory. Please check if kernel-devel is installed ..."
10    exit 1
11fi
12apt-get update
13if ! which gcc >& /dev/null; then
14    apt-get -y install gcc
15fi
16if ! which make >& /dev/null; then
17    apt-get -y install make
18fi
19# Install updated mlnx_ofed packages including kernel modules and userspace packages
20mkdir update_drivers
21tar xf ${mlnx_ofed_drv} --strip-components 2 -C update_drivers/
22pushd update_drivers
23./mlnxofedinstall --without-fw-update  --add-kernel-support -k ${kern_ver} --skip-distro-check -q
24if [ $? -ne 0 ]; then
25    ECHO_ERROR "MLNX OFED driver install ... Failed."
26    exit 1
27fi
28popd
29# 82-net-setup-link.rules cause the name of NIC changed. Then disable this rule.
30if [ -f /lib/udev/rules.d/82-net-setup-link.rules ]; then
31    mv /lib/udev/rules.d/82-net-setup-link.rules /lib/udev/rules.d/82-net-setup-link.rules.orig
32fi
33if [ -f /usr/lib/udev/rules.d/83-mlnx-sf-name.rules ]; then
34    mv /usr/lib/udev/rules.d/83-mlnx-sf-name.rules /usr/lib/udev/rules.d/83-mlnx-sf-name.rules.orig
35fi
36# Disable rshim(tmfifo_net0)
37systemctl disable rshim.service
38rm -rf ${mlnx_ofed_drv} update_drivers
39rm -rf /tmp/MLNX* /tmp/*.conf
40# update initramfs
41dracut  -f /boot/initrd.img-${kern_ver} ${kern_ver}
  • If rdma-userspace-config is uninstalled in the second step, reinstall it
Shell
1wget -q http://mirrors.baidubce.com/baidu/rdma_specs/rdma-userspace-config-bbc_1.0.4_x86.deb
2dpkg -i rdma-userspace-config-bbc_1.0.4_x86.deb
3service rdma start
  1. Reboot the instance to enter the new kernel;
  2. Verify the installation.
Shell
1#Check the installed 5.8-2.0.3 version and the ofed compiled under the current kernel
2[root@localhost ~]# dpkg -l | grep mlnx | grep 5.4.0-152
3ii  knem-modules                          1.1.4.90mlnx1-OFED.5.8.0.4.7.1.kver.5.4.0-152-generic amd64        kernel module for high-performance intra-node MPI communication for large messages
4ii  mlnx-ofed-kernel-modules              5.8-OFED.5.8.2.0.3.1.kver.5.4.0-152-generic           amd64        mlnx-ofed kernel modules
5ii  mlnx-ofed-kernel-utils                5.8-OFED.5.8.2.0.3.1.kver.5.4.0-152-generic           amd64        Userspace tools to restart and tune mlnx-ofed kernel modules  
6[root@localhost ~]# modinfo mlx5_core
7filename:       /lib/modules/5.4.0-152-generic/updates/dkms/mlx5_core.ko
8alias:          auxiliary:mlx5_core.eth-rep
9alias:          auxiliary:mlx5_core.eth
10basedon:        Korg 5.17-rc4
11version:        5.8-2.0.3
12license:        Dual BSD/GPL
13description:    Mellanox 5th generation network adapters (ConnectX series) core driver
14author:         Eli Cohen <eli@mellanox.com>
15srcversion:     E9B015CBD3F471BDD31CB24
16...
17 #Check the network interface card
18[root@localhost ~]# ifconfig
19 #Check the network interface card driver
20[root@localhost ~]# ethtool -i ethX
21 #Check the software package dependency. If dependency package is missing, software package is duplicated, or other issues exist, manually resolve by installing missing dependency package, and removing duplicate software package, etc.;
22[root@localhost ~]# apt-get check

Ubuntu 22.04

For example, consider upgrading the kernel to 5.15.0-75-generic.

1. Install the kernel and kernel development package. Install the 5.15.0-75-generic version and other related components in the instance:

Shell
1apt-get install -y linux-image-5.15.0-75-generic \
2                   linux-headers-5.15.0-75 \
3                   linux-headers-5.15.0-75-generic \
4                   linux-modules-5.15.0-75-generic \
5                   linux-modules-extra-5.15.0-75-generic
  1. Some instances have the RDMA lossless configuration package (rdma-userspace-config) installed. This package needs to be removed and reinstalled after installing OFED:
Shell
1#Check if rdma-userspace-config is installed
2dpkg -l |grep -i rdma-userspace-config
3 If it exists, uninstall the package
4dpkg -r rdma-userspace-config

3. Download the OFED installation package. You can download it from the Baidu software repository or transfer it to the instance after downloading it from the official OFED website:

  1. Download from Baidu software repository: Taking the download of ofed version 5.8-2.0.3 as an example, execute the following command in the instance to download.
Shell
1wget http://mirrors.baidubce.com/mlnx-ofed/5.8-2.0.3.0/MLNX_OFED_LINUX-5.8-2.0.3.0-ubuntu22.04-x86_64.tgz
  1. Download from the official website: Obtain the corresponding version of ofed installation package from the ofed official site

4. Compile and install ofed

Run the following script to compile and install the OFED driver. The entire process of compilation and installation takes about 30 minutes.

Note: Ensure the script and the OFED installation package are placed in the same directory, and update the OFED version and kernel version in the script to match the actual versions being used.

Shell
1#!/usr/bin/env bash
2# Update mlnx ofed drivers
3# -- Prepare mlnx ofed drivers
4# -- Extract and install
5mlnx_ofed_version=5.8-2.0.3.0
6mlnx_ofed_drv=MLNX_OFED_LINUX-${mlnx_ofed_version}-ubuntu22.04-x86_64.tgz
7kern_ver=5.15.0-75-generic
8if [ ! -d /lib/modules/${kern_ver}/build ]; then
9    echo "There is no kernel build directory. Please check if kernel-devel is installed ..."
10    exit 1
11fi
12apt-get update
13if ! which gcc >& /dev/null; then
14    apt-get -y install gcc
15fi
16if ! which make >& /dev/null; then
17    apt-get -y install make
18fi
19# Install required packages for installing MLNX_OFED_LINUX
20apt-get install -y libnl-route-3-200 libnl-route-3-dev libnl-3-dev bison libfuse2 flex gfortran tk libnuma-dev libgfortran5
21# Install updated mlnx_ofed packages including kernel modules and userspace packages
22mkdir update_drivers
23tar xf ${mlnx_ofed_drv} --strip-components 2 -C update_drivers/
24pushd update_drivers
25./mlnxofedinstall --without-fw-update  --add-kernel-support -k ${kern_ver} --skip-distro-check -q
26if [ $? -ne 0 ]; then
27    ECHO_ERROR "MLNX OFED driver install ... Failed."
28    exit 1
29fi
30popd
31# 82-net-setup-link.rules cause the name of NIC changed. Then disable this rule.
32if [ -f /lib/udev/rules.d/82-net-setup-link.rules ]; then
33    mv /lib/udev/rules.d/82-net-setup-link.rules /lib/udev/rules.d/82-net-setup-link.rules.orig
34fi
35if [ -f /usr/lib/udev/rules.d/83-mlnx-sf-name.rules ]; then
36    mv /usr/lib/udev/rules.d/83-mlnx-sf-name.rules /usr/lib/udev/rules.d/83-mlnx-sf-name.rules.orig
37fi
38# Disable rshim(tmfifo_net0)
39systemctl disable rshim.service
40rm -rf ${mlnx_ofed_drv} update_drivers
41rm -rf /tmp/MLNX* /tmp/*.conf
42# update initramfs
43dracut  -f /boot/initrd.img-${kern_ver} ${kern_ver}
  • If rdma-userspace-config is uninstalled in the second step, reinstall it
Shell
1wget -q http://mirrors.baidubce.com/baidu/rdma_specs/rdma-userspace-config-bbc_1.0.4_x86.deb
2dpkg -i rdma-userspace-config-bbc_1.0.4_x86.deb
3service rdma start
  1. Reboot the instance to enter the new kernel;
  2. Verify the installation.
Shell
1#Check the installed 5.8-2.0.3 version and the ofed compiled under the current kernel
2[root@localhost ~]# dpkg -l | grep mlnx | grep 5.15.0-75
3ii  knem-modules                          1.1.4.90mlnx1-OFED.5.8.0.4.7.1.kver.5.15.0-75-generic amd64        kernel module for high-performance intra-node MPI communication for large messages
4ii  mlnx-ofed-kernel-modules              5.8-OFED.5.8.2.0.3.1.kver.5.15.0-75-generic           amd64        mlnx-ofed kernel modules
5ii  mlnx-ofed-kernel-utils                5.8-OFED.5.8.2.0.3.1.kver.5.15.0-75-generic           amd64        Userspace tools to restart and tune mlnx-ofed kernel modules
6[root@localhost ~]# modinfo mlx5_core
7filename:       /lib/modules/5.15.0-75-generic/updates/dkms/mlx5_core.ko
8alias:          auxiliary:mlx5_core.eth-rep
9alias:          auxiliary:mlx5_core.eth
10basedon:        Korg 5.17-rc4
11version:        5.8-2.0.3
12license:        Dual BSD/GPL
13description:    Mellanox 5th generation network adapters (ConnectX series) core driver
14author:         Eli Cohen <eli@mellanox.com>
15srcversion:     E4CF39F4680CAB323741675
16...
17 #Check the network interface card
18[root@localhost ~]# ifconfig
19 #Check the network interface card driver
20[root@localhost ~]# ethtool -i ethX
21 #Check the software package dependency. If dependency package is missing, software package is duplicated, or other issues exist, manually resolve by installing missing dependency package, and removing duplicate software package, etc.;
22[root@localhost ~]# apt-get check

Previous
Linux PIP Configuration with Baidu Source
Next
Windows PIP Configuration with Baidu Source