Re: Exposing secid to secctx mapping to user-space

Stephen Smalley Tue, 15 Dec 2015 09:09:59 -0800

On 12/15/2015 11:06 AM, Casey Schaufler wrote:

On 12/15/2015 7:00 AM, Stephen Smalley wrote:

On 12/14/2015 05:57 PM, Roberts, William C wrote:

<snip>


If I understand correctly, the goal here is to avoid the lookup from
pid to context. If we somehow Had the context or a token to a context
during the ipc transaction to userspace, we could just use that In
computing the access decision. If that is correct, then since we have
PID, why not just extend the SE Linux compute av decision interface to support

passing of PID and then it can do the lookup in the Kernel?

That's no less racy than getpidcon().


I got a bounce from when I sent this from gmail, resending.

True, but in this case the binder transaction would be dead...

Why not just pass ctx? It's less than ideal, but it might be good enough for 
now until contexts get unwieldy big.

grep -rn '^type ' * | grep domain | cut -d' ' -f 2-2 | sed s/','//g | awk ' {  
thislen=length($0); printf("%-5s %dn", NR, thislen); totlen+=thislen}
END { printf("average: %dn", totlen/NR); } '

The avg type length for domain types in external/sepolicy is 7. Add the full 
ctx:

u:r:xxxxxxx:s0(cat)

1. We're looking at like 18 or so bytes, how do we know this won't be "fast 
enough"?
2. What's the current perf numbers, and what's agreed upon on what you need to 
hit to be fast enough?
3. I'm assuming the use case is in service manager, but would a userspace cache 
of AVD's help? Then you can (possibly) avoid both kernel trips, and you can 
invalidate the cache on policy reload and on PID deaths? In the case of service 
manager would it always be a miss based on usage pattern? I'm assuming things 
would say hey resolve this once, and be done. However, you could only do the 
ctx lookup and do the avd based on the policy in user space, thus avoiding 1 of 
two trips.


1. I don't think it is the size of the context that is the concern but rather 
the fact that it is a variable-length string, whereas current binder commands 
use fixed-size arguments and encode the size in the command value (common for 
ioctls).  Supporting passing a variable-length string would be a change to the 
protocol and would complicate the code.  On the performance side, it means 
generating a context string from the secid and then copying it to userspace on 
each IPC (versus just getting the secid and putting that in the existing 
binder_transaction_data that is already copied to userspace).


I have long wondered why SELinux generates the context string
of the secid more than once. Audit performance alone would
justify keeping it around. The variable length issue isn't
so difficult as you make it out. As William pointed out earlier,
most SELinux contexts are short. Two protocols, one with a
fixed length of 16 chars (typical is 7) and one with a fixed
length of 256 (for abnormal contexts) solve the problem without
complicating the code hardly at all.

If it's such a problem, why do we have SO_PEERSEC return a
variable length string? That's been around forever and seems
to work just fine.

Generating an audit record means you are already on the slow path, soadding the cost of generating a context at that point shouldn't besignificant (but if one has performance data to the contrary, that wouldalways be of interest). Keeping complete context strings in kernelmemory at all times wasn't viewed as desirable, particularly when youconsider the next point.

There is no internal limit on SELinux context size and one can in factgenerate a SELinux context string that exceeds 256 bytes (just create acategory set with many different, non-contiguous categories, e.g. acategory set with every other category), so you can't just pick a coupleof sizes (16 and 256 in your example) and be done with it. It may beunusual to exceed 256 but it isn't impossible, and users of MLS systemshave in fact run up against even the page size limitations of the/proc/self/attr interfaces as a pain point.

getsockopt SO_PEERSEC, as I mentioned earlier, is for stream sockets, soyou are dealing with a connection-oriented model (with its attendantbaseline overhead), server applications opt into getting the securitylabel after the connection is established via getsockopt (which meansthat the overhead is not imposed on all connections, and which providesa straightforward way of passing variable-length data already), and allof the payload data is typically being copied through the kernel (to andfrom socket buffers). With binder, we are talking about a synchronousIPC mechanism where the kernel copies the sender information (currentlyPID and UID) to the receiver on every IPC, the protocol currently passesno variable-length data, and the data is copied directly from thesender's memory into kernel-managed buffers that are mapped read-onlyinto the receiver. So introducing variable-length string copy to binderis a more substantial change, and the baseline performancecharacteristics and thus the relative overhead could differ.

In any event, I think we can wait for some performance data to showwhether this is a real concern or not.


--
To unsubscribe from this list: send the line "unsubscribe 
linux-security-module" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Exposing secid to secctx mapping to user-space

Reply via email to