Re: [PATCH v2] binder: use cred instead of task for selinux checks
From: Jann Horn
Date: Mon Oct 04 2021 - 20:27:16 EST
On Tue, Oct 5, 2021 at 1:38 AM Casey Schaufler <casey@xxxxxxxxxxxxxxxx> wrote:
> On 10/4/2021 3:28 PM, Jann Horn wrote:
> > On Mon, Oct 4, 2021 at 6:19 PM Casey Schaufler <casey@xxxxxxxxxxxxxxxx> wrote:
> >> On 10/1/2021 3:58 PM, Jann Horn wrote:
> >>> On Fri, Oct 1, 2021 at 10:10 PM Casey Schaufler <casey@xxxxxxxxxxxxxxxx> wrote:
> >>>> On 10/1/2021 12:50 PM, Jann Horn wrote:
> >>>>> On Fri, Oct 1, 2021 at 9:36 PM Jann Horn <jannh@xxxxxxxxxx> wrote:
> >>>>>> On Fri, Oct 1, 2021 at 8:46 PM Casey Schaufler <casey@xxxxxxxxxxxxxxxx> wrote:
> >>>>>>> On 10/1/2021 10:55 AM, Todd Kjos wrote:
> >>>>>>>> Save the struct cred associated with a binder process
> >>>>>>>> at initial open to avoid potential race conditions
> >>>>>>>> when converting to a security ID.
> >>>>>>>>
> >>>>>>>> Since binder was integrated with selinux, it has passed
> >>>>>>>> 'struct task_struct' associated with the binder_proc
> >>>>>>>> to represent the source and target of transactions.
> >>>>>>>> The conversion of task to SID was then done in the hook
> >>>>>>>> implementations. It turns out that there are race conditions
> >>>>>>>> which can result in an incorrect security context being used.
> >>>>>>> In the LSM stacking patch set I've been posting for a while
> >>>>>>> (on version 29 now) I use information from the task structure
> >>>>>>> to ensure that the security information passed via the binder
> >>>>>>> interface is agreeable to both sides. Passing the cred will
> >>>>>>> make it impossible to do this check. The task information
> >>>>>>> required is not appropriate to have in the cred.
> >>>>>> Why not? Why can't you put the security identity of the task into the creds?
> >>>>> Ah, I get it now, you're concerned about different processes wanting
> >>>>> to see security contexts formatted differently (e.g. printing the
> >>>>> SELinux label vs printing the AppArmor label), right?
> >>>> That is correct.
> >>>>
> >>>>> But still, I don't think you can pull that information from the
> >>>>> receiving task. Maybe the easiest solution would be to also store that
> >>>>> in the creds? Or you'd have to manually grab that information when
> >>>>> /dev/binder is opened.
> >>>> I'm storing the information in the task security blob because that's
> >>>> the appropriate scope. Today the LSM hook is given both task_struct's.
> >>> Which is wrong, because you have no idea who the semantic "recipient
> >>> task" is - any task that has a mapping of the binder fd can
> >>> effectively receive transactions from it.
> >>>
> >>> (And the current "sender task" is also wrong, because binder looks at
> >>> the task that opened the binder device, not the task currently
> >>> performing the action.)
> >> I'm confused. Are you saying that the existing binder code is
> >> completely broken? Are you saying that neither "task" is correct?
> > Yeah, basically
>
> Well, hot biscuits and gravy!
>
> > - but luckily the actual impact this has is limited by
> > the transitions that SELinux permits. If domain1 has no way to
> > transition to domain2, then it can't abuse this bug to pretend to be
> > domain2. I do have a reproducer that lets Android's "shell" domain
> > send a binder transaction that appears to come from "runas", but
> > luckily "runas" has no interesting privileges with regards to binder,
> > so that's not exploitable.
>
> You're counting on the peculiarities of the SELinux policy you're
> assuming is used to mask the fact that the hook isn't really doing
> what it is supposed to? Ouch.
I'm not saying I like the current situation - I do think that this
needs to change. I'm just saying it probably isn't *exploitable*, and
exploitability often hinges on these little circumstantial details.
> >> How does passing the creds from the wrong tasks "fix" the problem?
> > This patch is not passing the creds from the "wrong" tasks at all. It
> > relies on the basic idea that when a security context opens a
> > resource, and then hands that resource to another context for
> > read/write operations, then you can effectively treat this as a
> > delegation of privileges from the original opener, and perform access
> > checks against the credentials using which the resource was opened.
>
> OK. I can understand that without endorsing it.
>
> > In particular, we already have those semantics in the core kernel for
> > ->read() and ->write() VFS operations - they are *not allowed* to look
> > at the credentials of the caller, and if they want to make security
> > checks, they have to instead check against file->f_cred, which are the
> > credentials using which the file was originally opened. (Yes, some
> > places still get that wrong.) Passing a file descriptor to another
> > task is a delegation of access, and the other task can then call
> > syscalls like read() / write() / mmap() on the file descriptor without
> > needing to have any access to the underlying file.
>
> A mechanism sufficiently entrenched.
It's not just "entrenched", it is a fundamental requirement for being
able to use file descriptor passing with syscalls like write(). If
task A gives a file descriptor to task B, then task B must be able to
write() to that FD without having to worry that the FD actually refers
to some sort of special file that interprets the written data as some
type of command, or something like that, and that this leads to task B
unknowingly passing through access checks.
> > You can't really attribute binder transactions to specific tasks that
> > are actually involved in the specific transaction, neither on the
> > sending side nor on the receiving side, because binder is built around
> > passing data through memory mappings. Memory mappings can be accessed
> > by multiple tasks, and even a task that does not currently have it
> > mapped could e.g. map it at a later time. And on top of that you have
> > the problem that the receiving task might also go through privileged
> > execve() transitions.
>
> OK. I'm curious now as to why the task_struct was being passed to the
> hook in the first place.
Probably because that's what most other LSM hooks looked like and the
authors/reviewers of the patch didn't realize that this model doesn't
really work for binder? FWIW, these hooks were added in commit
79af73079d75 ("Add security hooks to binder and implement the hooks
for SELinux."). The commit message also just talks about "processes".
> And about where you are getting the cred from
> if not a task.
This patch still ultimately gets the creds from a task. But it's not
looking at the *current* credentials of any task, but instead looks at
the credentials that the task that opened /dev/binder had at that
point.
> >>>> It's easy to compare to make sure the tasks are compatible.
> >>> It would be, if you actually had a pair of tasks that accurately
> >>> represent the sender and the recipient.
> >>>
> >>>> Adding the
> >>>> information to the cred would be yet another case where the scope of
> >>>> security information is wrong.
> >>> Can you elaborate on why you think that?
> >> The information identifies how the task is going to display
> >> the security "context". It isn't used in access checks.
> > But it is data that AFAICS needs to be preserved in the same places
> > where the creds need to be preserved, and it is also related to
> > security labels, so isn't "struct cred" a logical place to stuff it
> > anyway?
>
> I am probably the only person on the planet who dislikes shared creds.
> One of the things that made me happiest when I switched from UNIX
> development to Linux was that it didn't have shared creds and all the
> associated management. Oh well. Yes, it could go in the cred.
>
> But that raises another question. Where are the creds coming from?
They are the creds that the task that opened /dev/binder had.
> Is it even rational to make access decisions based on them?
Yes, if you assume that handing file descriptors or memory mappings to
other security contexts constitutes delegation of privilege.
> You've
> explained how SELinux ends up with an Uncle Bob, but that's doesn't
> leave me confident that another security module would be able to
> come up with something sensible.
Just like SELinux, they can stuff information about a task's identity
into the cred security blob. See e.g. AppArmor, which AFAICS also does
that.
> At this point I'm really looking for something that I can put in
> the change log explaining why creds work and task_structs don't.
creds work because they can snapshot the privileges a task had at some
point in time for future security checks. task_structs don't work
because you can't use them to figure out which privileges a task had
in the past (or will have in the future).