So I am using spin_lock_irqsave now to access shared data. Shared
data being the hints that are accessed in do_select(), do_poll() and
the network stack. The irqsave version of spin_lock is supposed to be
very slow but just using spin_lock leads to deadlocks. Is there any
other way that this could be handled?
The test system I have been using for benchmarking is a 4-way Xeon
450Mhz. When I run httpperf from another machine connected via 100
Mbit ethernet, the reply rate for requests never seems to exceed 1200
r/s. I noticed that there are never more than 4000 sockets in the
system. Is there a way to increase this number? Also most of the
sockets are in TIME_WAIT and readprofile shows that a large amount of
the CPU time is spent in the tcp_timewait_* functions.
I have implemented the hinting system on top of 2.2.3-ac4.
Any ideas?
Greetings,
Niels.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/