Comments (16)
I installed a .100 release tonight for Ubuntu. Seems to have resolved the issue on my end.
from longhorn.
Besides of the environment_check script, I'm considering adding a condition in node.status.conditions to check the environment, like the kernel version.
WDYT? @innobead @james-munson
from longhorn.
Is there currently a workaround?
I think it might just be either downgrading to the .92 kernel release or wait for the .100 release
@derekbit Is that accurate?
Yes, totally right!
from longhorn.
Here we same, .100 kernel solved it. Also the actual /new her-kernel (6.5) solves it.
from longhorn.
Sounds good to me @derekbit . Let's update the requirements of this request.
Updated. Thank you.
from longhorn.
@james-munson remember to update the zenhub status.
from longhorn.
Looking at the probable culprit change and its fix it appears that this is an NFS v4.1 issue, but that is our default.
from longhorn.
Sounds good to me @derekbit . Let's update the requirements of this request.
from longhorn.
Is there currently a workaround?
from longhorn.
Is there currently a workaround?
I think it might just be either downgrading to the .92 kernel release or wait for the .100 release
@derekbit Is that accurate?
from longhorn.
Thank you, downgrading indeed solved the problem. 👍
from longhorn.
Pre Ready-For-Testing Checklist
-
Where is the reproduce steps/test steps documented?
The reproduce steps/test steps are
Find a cluster or install a problematic kernel, such as5.15.94
or6.5.6
. Create an RWX volume and mount it. A failing mount should log the host kernel release and OS distro in the CSI plugin logging. -
Is there a workaround for the issue? If so, where is it documented?
The workaround is at: In the defect. Essentially, install a different kernel. (This does not fix it, but it makes it easier to detect on the fly.) -
Does the PR include the explanation for the fix or the feature?
-
Does the PR include deployment change (YAML/Chart)? If so, where are the PRs for both YAML file and Chart?
The PR for the YAML change is at:
The PR for the chart change is at: -
Have the backend code been merged (Manager, Engine, Instance Manager, BackupStore etc) (including
backport-needed/*
)?
The PR is at -
Which areas/issues this PR might have potential impacts on?
Area
Issues -
If labeled: require/doc Has the necessary document PR submitted or merged (including
backport-needed/*
)?
The documentation issue/PR is at longhorn/website#873
from longhorn.
May need to update the fixed
and broken
versions. I see in #6857 (comment) that 6.5.8 is broken, but the fix came in or before 6.5.11.
from longhorn.
Note that this issue happens to me on 5.15.0-1047-raspi
from longhorn.
And also (reported in Slack) Ubuntu kernel 6.5.0-21-generic #21~22.04.1-Ubuntu
from longhorn.
And also (reported in Slack) Ubuntu kernel
6.5.0-21-generic #21~22.04.1-Ubuntu
I am tracking this same bug with NFS4 on vsphere-csi but can confirm that it exists in the current HWE kernels for 20.04, 22.04 and the current release kernels of 23.10 along with haphazardly most of the current versions of the variant kernels that I tested. Finding an affected kernel in any current Ubuntu release is like shooting fish in a barrel. The list extends to pretty much every variant they currently ship -- publishing a list of affected package versions without actually testing all the packaged kernels is going to lead to a lot more reports over the coming weeks as an Ubuntu system is more or less going to unavoidably upgrade to a broken kernel.
I do not see any fixes on launchpad for the HWE kernel 6.5.0 similar to those already in the pipeline for 5.15.0-100 so it's tough to understand if this is actually getting the proper attention.
from longhorn.
Related Issues (20)
- [BUG] Occasionally recurring backup job failed to type 1 acquisition error HOT 5
- [FEATURE] Add a variable related with fromBackup to helm chart of longhorn. HOT 1
- [IMPROVEMENT] Building longhorn-manager takes long time HOT 1
- [BUG] Rebuilding Replica fails on larger volumes
- [IMPROVEMENT] Make backup wait until there is no backup being delete and Add the progress time
- [Question] How can I get diff between two snapshots of the same volume?
- [UI][IMPROVEMENT] Make backup wait until there is no backup being delete and Add the progress time
- [BUG] Assigned Replicas on VM Nodes keep flickering around HOT 3
- [BUG] unable to drain nodes with strict-local volumes HOT 3
- [IMPROVEMENT] All logs are displayed in UTC timezone and cannot be changed
- [IMPROVEMENT] Consider disk space when creating BackingImage. Plus monitoring the disk usage of the BackingImage HOT 1
- [IMPROVEMENT] Concurrent BackingImage creation limit Per Node
- [IMPROVEMENT] Collect and display disk space usage for the backing images
- [IMPROVEMENT] Cross-disk and cross-node backing image encryption and decryption
- [UI][IMPROVEMENT] Collect and display disk space usage for the backing images
- [IMPROVEMENT] Check the disk usage before encrypting or decryption a backing image
- [FEATURE] Re-encryption of an encrypted backing image
- [TEST][FEATURE] Re-encryption of an encrypted backing image
- [BACKPORT][v1.6.3][BUG] unable to drain nodes with strict-local volumes
- [BACKPORT][v1.5.6][BUG] unable to drain nodes with strict-local volumes
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from longhorn.