Re: [malware-list] scanner interface proposal was: [TALPA] Intro toa linux interface for on access scanning

From: david
Date: Mon Aug 18 2008 - 13:29:50 EST


On Mon, 18 Aug 2008, Eric Paris wrote:

But lets talk about a real design and what people want to see.

Userspace program needs to 'register' with a priority. HSMs would want
a low priority on the blocking calls AV Scanners would want a higher
priority and indexers would want a very high priority.

why do you need to introduce a priority mechanism for notifying scanners? just notify them all and let them take action at their own rate for async notifications. if you are waiting for their result you need to invoke them directly, but whether you do it in order or in parallel will depend on the config. this is an optmization problem that the kernel should not be trying to figure out becouse the right answer is policy dependant (if they all need to bless the file for it to be accepted then fire them off in parallel (system resources allowing), if one accepting it is good enough you may want to just run that one and see if you get lucky rather then eating up resources of all the others)

On async notification we fire a message to everything that registered
'simultaneously.' On blocking we fire a message to everything in
priority order and block until we get a response. That response should
be of the form ALLOW/DENY and should include "mark result"/"don't mark
result."

why in the world would you block for an _async_ notification mechanism?

If everything responds with ALLOW/"mark result" we will flip a bit IN
CORE so operations on that inode are free from then on. If any program
responds with DENY/"mark result" we will flip the negative bit IN CORE
so deny operations on the inode are free from then on.

this requires syncronous scanning by all scanners.

Userspace 'scanners' if intelligent should have set a timespace in a
particular xattr of their choosing to do their own userspace results
caching to speed up things if the inode is evicted from core. This
means that the 'normal' flow of operations for an inode will look like:

open -> async to userspace -> userspace scans and writes timestamp

read -> blocking to userspace -> userspace checks xattr timestamp and
mtime and responds with ALLOW/"mark result"

you can't trust timestamps, they go forwareds and backwords. they need to have some sort of 'generation id' but don't try to define meaning for it, leave that to the scanner. have everything else treat it as a simple "it matches" or "it doesn't match"

read -> we have the ALLOW/mark result bit in core set so just allow.

mtime update -> clear ALLOW/"mark result" bit in core, send async
notification to userspace

you keep planning to do this with a single allow mark. it may not be that simple.

close -> send async notification to userspace

as several others have noted, alerting on close is not good enough, we need to alert on the scanned->dirty transition (by the way, this contridicts the part of your message I snipped where you were advocating notification on every write)

If some general xattr namespace is agreed upon for such a thing someday
a patch may be acceptable to clear that namespace on mtime update, but I
don't plan to do that at this time since comparing the timestamp in the
xattr vs mtime should be good enough.

if you are already accessing xattrs, why not just use the value rather then trying to make it into a time?


******************************

Great, how to build this interface. THIS IS WHAT I ACTUALLY CARE ABOUT

The communication with userspace has a very specific need. The scanning
process needs to get 'something' that will give it access to the
original file/inode/data being worked on. My previous patch set does
this with a special securityfs file. Scanners would block on 'read.'
This block was waiting for something to be scanned and when available a
dentry_open() was called in the context of the scanner for the inode in
question. This means that the fd in the scanner had to be the same data
as the fd in the original process.

having scanners access a file blocking on read won't work for multiple scanners (unless you are going to create multiple files for them to read)

If people want me to use something like netlink to send async
notifications to the scanner how do I also get the file/inode/data to
the scanning process? Can anyone think of a better/cleaner method to
get a file descriptor into the context of the scanner other than having
the scanner block/poll on a special file inside the securityfs?

this is easy, the userspace library (libmalware or glibc) intercepts the open and is invoking the scanners if the checks tell it to. they can send the file descripter over a unix socket on the machine to a scanner daemon, or they can invoke the scanner in the existing user context.

David Lang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/