Re: regression: 4.13 cannot follow symlinks on some ext3 fs
From: Andreas Dilger
Date: Fri Nov 24 2017 - 01:12:59 EST
On Nov 23, 2017, at 7:04 PM, Andi Kleen <andi@xxxxxxxxxxxxxx> wrote:
>
>> As a workaround, you could delete and recreate the symlink with the new
>
> I revert the patch for now. Everything seems to work.
>
>> kernel to create a proper fast symlink. It would be useful to scan
>> the image to see if there are other similar symlinks present:
>>
>> find /myth/tmp -type l -size -60 -ls | awk '$2 != 0 { print }'
>
> Doesn't find anything. Your recipe must be wrong.
I see that I should have used "-60c" to properly limit the listing to
short symlinks, but this doesn't appear to be the core problem. It
looks like there is a bug in find (at least version 4.4.2 that I'm
testing with) that it doesn't print the blocks count properly.
According to find(1) the "-ls" argument should list the file the same
as "ls -dils" format (blocks is $2), but as shown below "find -ls"
prints "0" for blocks when it should be "4" (for a long symlink using
"+60c" in my example, I couldn't find any short+external symlinks on a
couple of 8 year old root filesystems):
$ find /etc/alternatives/rmid -type l -size +60c -ls
327877 0 lrwxrwxrwx 1 root root 73 Jan 4 2017 /etc/alternatives/rmid -> /usr/lib/jvm/java-1.8.0-openjdk-1.8.0.111-0.b15.el6_8.x86_64/jre/bin/rmid
$ ls -dils /etc/alternatives/rmid
327877 4 lrwxrwxrwx 1 root root 73 Jan 4 2017 /etc/alternatives/rmid -> /usr/lib/jvm/java-1.8.0-openjdk-1.8.0.111-0.b15.el6_8.x86_64/jre/bin/rmid*
Try the following command instead:
find / -type l -size -60c -print0 | xargs -0r ls -dils | awk '$2 != 0 { print }'
>> This is probably something that e2fsck should check for and fix.
>
> Nah the kernel should just support it like it always did.
The reason we changed this code in the first place was because the
old check would repeatedly break when some new reason for storing
blocks on a symlink appeared. It broke when xattrs were allowed
on symlinks for SELinux. It broke when bigalloc blocks were added.
It broke when inline_data was added, and it would have broken (and
been really hard to fix efficiently) when large xattrs were added.
We checked old kernels, and old e2fsprogs, and didn't see any cases
where fast (<= 60 chars) symlinks were created using external blocks.
It seems that _something_ did create them, and it would be good to
figure that out so we can determine if it is a widespread problem.
I think e2fsck can fix this quite easily, and there really isn't
an easy way to revert to the old method if the large xattr feature
is enabled. If you are willing to run a new kernel, you should also
be willing to run a new e2fsck.
We could probably add a fallback to the old mechanism (and print
a one-time warning to upgrade to a newer e2fsck) if an external fast
symlink is found and the large xattr feature is not enabled, which
would give more time to fix this (hopefully rare in the wild) case.
Cheers, Andreas
Attachment:
signature.asc
Description: Message signed with OpenPGP