kernel-ark/net
Dipankar Sarma 529bf6be5c [PATCH] fix file counting
I have benchmarked this on an x86_64 NUMA system and see no significant
performance difference on kernbench.  Tested on both x86_64 and powerpc.

The way we do file struct accounting is not very suitable for batched
freeing.  For scalability reasons, file accounting was
constructor/destructor based.  This meant that nr_files was decremented
only when the object was removed from the slab cache.  This is susceptible
to slab fragmentation.  With RCU based file structure, consequent batched
freeing and a test program like Serge's, we just speed this up and end up
with a very fragmented slab -

llm22:~ # cat /proc/sys/fs/file-nr
587730  0       758844

At the same time, I see only a 2000+ objects in filp cache.  The following
patch I fixes this problem.

This patch changes the file counting by removing the filp_count_lock.
Instead we use a separate percpu counter, nr_files, for now and all
accesses to it are through get_nr_files() api.  In the sysctl handler for
nr_files, we populate files_stat.nr_files before returning to user.

Counting files as an when they are created and destroyed (as opposed to
inside slab) allows us to correctly count open files with RCU.

Signed-off-by: Dipankar Sarma <dipankar@in.ibm.com>
Cc: "Paul E. McKenney" <paulmck@us.ibm.com>
Cc: "David S. Miller" <davem@davemloft.net>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2006-03-08 14:14:01 -08:00
..
802 [P8023]: Fix tainting of kernel. 2006-02-13 15:38:42 -08:00
8021q
appletalk
atm [ATM]: keep atmsvc failure messages quiet 2006-03-04 21:06:25 -08:00
ax25
bluetooth [Bluetooth] Fix NULL pointer dereferences of the HCI socket 2006-02-13 11:40:03 +01:00
bridge [BRIDGE]: generate kobject remove event 2006-03-04 21:06:23 -08:00
core [REQSK]: Don't reset rskq_defer_accept in reqsk_queue_alloc 2006-02-27 13:30:43 -08:00
dccp [DCCP] ccid3: Divide by zero fix 2006-03-04 21:06:29 -08:00
decnet
econet
ethernet [NET] ethernet: Fix first packet goes out with MAC 00:00:00:00:00:00 2006-02-23 16:18:01 -08:00
ieee80211 [PATCH] ieee80211_rx.c: is_beacon 2006-02-27 20:12:02 -05:00
ipv4 [NETFILTER] ip_queue: Fix wrong skb->len == nlmsg_len assumption 2006-03-07 14:56:12 -08:00
ipv6 [NETFILTER] ip_queue: Fix wrong skb->len == nlmsg_len assumption 2006-03-07 14:56:12 -08:00
ipx
irda [IRDA]: out of range array access 2006-02-09 16:59:48 -08:00
key [XFRM]: Eliminate refcounting confusion by creating __xfrm_state_put(). 2006-02-23 16:10:53 -08:00
lapb
llc
netfilter [NETFILTER]: nf_queue: fix end-of-list check 2006-02-27 13:03:55 -08:00
netlink [NETLINK] genetlink: Fix bugs spotted by Andrew Morton. 2006-02-13 15:51:24 -08:00
netrom
packet
rose
rxrpc
sched
sctp [SCTP]: Fix 'fast retransmit' to send a TSN only once. 2006-02-02 16:57:31 -08:00
sunrpc SUNRPC: Move upcall out of auth->au_ops->crcreate() 2006-02-01 12:52:25 -05:00
tipc
unix [PATCH] fix file counting 2006-03-08 14:14:01 -08:00
wanrouter
x25
xfrm [IPSEC]: Kill post_input hook and do NAT-T in esp_input directly 2006-02-27 13:00:40 -08:00
compat.c
Kconfig [NET]: Add CONFIG_NETDEBUG to suppress bad packet messages. 2006-02-02 20:40:09 -08:00
Makefile
nonet.c
socket.c [PATCH] percpu data: only iterate over possible CPUs 2006-02-05 11:06:51 -08:00
sysctl_net.c
TUNABLE