Commits · fdba8dd20e328b07327105788e4802203120ca98 · Kirill Smelkov / linux

26 Sep, 2002 24 commits
- Merge · fdba8dd2
  Christoph Hellwig authored Sep 25, 2002
  
  fdba8dd2
- XFS: Avoid writing data out to disk twice! · 919dc9e2
  Stephen Lord authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:128467a
```
  919dc9e2
- XFS: Implement readv/writev · b1eaf041
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:128366a
```
  b1eaf041
- XFS: Remove unused function xfs_vn_iget() · 20cebe45
  Eric Sandeen authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:128363a
```
  20cebe45
- XFS: Fold some code paths together in the xfs fsync implementation. · ca825dbc
  Stephen Lord authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:128239a
```
  ca825dbc
- XFS: Fix the mount-cleanup for single-subvolume filesystems. · 7206852c
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:128192a
```
  7206852c
- XFS: More mount code cleanups · 1cb2d696
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:128159a
```
  1cb2d696
- XFS: Switch to mpage_readpage · f6ce5f1a
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127994a
```
  f6ce5f1a
- XFS: XFS: Remove some dead prototypes in pagebuf · 9b594f1e
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127896a
```
  9b594f1e
- XFS: XFS: Cleanup mount argument manipulation, sanitize xfs_cmountfs and move the · 1f11423a
  Nathan Scott authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127944a
```
  1f11423a
- XFS: XFS: Simplify xfs_dir_lookup_int · c4caee27
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127879a
```
  c4caee27
- XFS: XFS: Sanitize some names in xfs_aops.c, especially a less offending name for linvfs_pb_bmap · 5a579d2f
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127872a
```
  5a579d2f
- XFS: XFS: Make pagebuf use the generic xfs ASSERT() instead of it's own assert() · 78330667
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127736a
```
  78330667
- XFS: Don't include <asm/softirq.h> in page_buf.c · 6cd43d77
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127734a
```
  6cd43d77
- XFS: Small comment corrections/updates · 5336b84a
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127729a
```
  5336b84a
- XFS: XFS: Use do_gettimeofday() instead of racy direct access to xtime · d5ffce3b
  Christoph Hellwig authored Sep 27, 2002
```
Modid: 2.5.x-xfs:slinx:127568a
```
  d5ffce3b
- [PATCH] export cpu_callout_map for SMP modules · 463b2b8d
  Rusty Russell authored Sep 25, 2002
  
  463b2b8d
- [PATCH] UP cpu_possible · 697dca94
  Rusty Russell authored Sep 25, 2002
```
This patch defines cpu_possible() for non-SMP.
```
  697dca94
- Merge http://gkernel.bkbits.net/misc-2.5 · b0538e0e
  Linus Torvalds authored Sep 25, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
  b0538e0e
- Merge http://gkernel.bkbits.net/net-drivers-2.5 · 7beeab1a
  Linus Torvalds authored Sep 25, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
  7beeab1a
- Merge http://gkernel.bkbits.net/irda-2.5 · 887e8e0f
  Linus Torvalds authored Sep 25, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
  887e8e0f
- Merge http://gkernel.bkbits.net/i2c-2.5 · bef9636e
  Linus Torvalds authored Sep 25, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
  bef9636e
- [PATCH] deadline ioscheduler cleanups · 82c8a4cc
  Jens Axboe authored Sep 25, 2002
```
Some various small cleanups, optimizations, and fixes.

o Make fifo_batch=32 as default, from testing this appears a good
  default value. We still get good throughput, and latency is good.

o Reintroduce the merge_cleanup logic. We need it for deadline for
  rehashing requests when they have been merged.

o Cleanup last_merge logic. Move it to the new elv_merged_request(),
  this is where it really belongs. Doing it inside the io scheduler core
  can causes false positives, when the queue merge functions reject an
  otherwise good merge

o Have deadline_move_requests() account from last entry on the dispatch
  queue, if it is non-empty. It doesn't really matter what the last
  extracted sector was, if we are not right behind it.

o Clean/optimize deadline_move_requests()

o Account size of a request just a little bit. Streaming transfer isn't
  for free, it's just a lot cheaper than a seek.

o Make deadline_check_fifo() more readable.
```
  82c8a4cc
- [PATCH] exit-fix-2.5.38-F0 · 9b34eb70
  Ingo Molnar authored Sep 25, 2002
```
From Andrew Morton.

There are a couple of places where we would enable interrupts while
write-holding the tasklist_lock ...  nasty.
```
  9b34eb70
25 Sep, 2002 16 commits

Merge mandrakesoft.com:/home/jgarzik/repo/linus-2.5 · 7878f411
Jeff Garzik authored Sep 25, 2002
```
into mandrakesoft.com:/home/jgarzik/repo/misc-2.5
```
7878f411
Merge mandrakesoft.com:/home/jgarzik/repo/linus-2.5 · 94df6c69
Jeff Garzik authored Sep 25, 2002
```
into mandrakesoft.com:/home/jgarzik/repo/irda-2.5
```
94df6c69
Merge mandrakesoft.com:/home/jgarzik/repo/linus-2.5 · 664a55c9
Jeff Garzik authored Sep 25, 2002
```
into mandrakesoft.com:/home/jgarzik/repo/net-drivers-2.5
```
664a55c9
i2c core/dev/proc cleanups, and a proc-related fix · d5f3435a
Albert Cranford authored Sep 25, 2002

d5f3435a
Merge bk://ldm.bkbits.net/linux-2.5 · 2ce067b0
Linus Torvalds authored Sep 25, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
2ce067b0

[PATCH] tighter locking in pdflush · 4ab1a3e6

Andrew Morton authored Sep 25, 2002

Had a weird oops from Bill Irwin - the pdflush_list was corrupt.

The only thing I can think of is that something sprayed out a wakeup
when it shouldn't.  So tighten things up against that, and add some
printks to catch it if it happens again.

4ab1a3e6

[PATCH] speed up sys_sync() · 57eb0613

Andrew Morton authored Sep 25, 2002

Well it's a one-liner.  sys_sync() only syncs one queue at a time, and
can be slow if you have a lot of disks.  So poke pdflush, which knows
how to write all the queues in parallel.

57eb0613

[PATCH] increase traffic on linux-kernel · 4f3e8109

Andrew Morton authored Sep 25, 2002

[This has four scalps already.  Thomas Molina has agreed
 to track things as they are identified ]

Infrastructure to detect sleep-inside-spinlock bugs.  Really only
useful if compiled with CONFIG_PREEMPT=y.  It prints out a whiny
message and a stack backtrace if someone calls a function which might
sleep from within an atomic region.

This patch generates a storm of output at boot, due to
drivers/ide/ide-probe.c:init_irq() calling lots of things which it
shouldn't under ide_lock.

It'll find other bugs too.

4f3e8109

[PATCH] slab reclaim balancing · b65bbded

Andrew Morton authored Sep 25, 2002

A patch from Ed Tomlinson which improves the way in which the kernel
reclaims slab objects.

The theory is: a cached object's usefulness is measured in terms of the
number of disk seeks which it saves.  Furthermore, we assume that one
dentry or inode saves as many seeks as one pagecache page.

So we reap slab objects at the same rate as we reclaim pages.  For each
1% of reclaimed pagecache we reclaim 1% of slab.  (Actually, we _scan_
1% of slab for each 1% of scanned pages).

Furthermore we assume that one swapout costs twice as many seeks as one
pagecache page, and twice as many seeks as one slab object.  So we
double the pressure on slab when anonymous pages are being considered
for eviction.

The code works nicely, and smoothly.  Possibly it does not shrink slab
hard enough, but that is now very easy to tune up and down.  It is just:

	ratio *= 3;

in shrink_caches().

Slab caches no longer hold onto completely empty pages.  Instead, pages
are freed as soon as they have zero objects.  This is possibly a
performance hit for slabs which have constructors, but it's doubtful.
Most allocations after a batch of frees are satisfied from inside
internally-fragmented pages and by the time slab gets back onto using
the wholly-empty pages they'll be cache-cold.  slab would be better off
going and requesting a new, cache-warm page and reconstructing the
objects therein.  (Once we have the per-cpu hot-page allocator in
place.  It's happening).

As a consequence of the above, kmem_cache_shrink() is now unused.  No
great loss there - the serialising effect of kmem_cache_shrink and its
semaphore in front of page reclaim was measurably bad.

Still todo:

- batch up the shrinking so we don't call into prune_dcache and
  friends at high frequency asking for a tiny number of objects.

- Maybe expose the shrink ratio via a tunable.

- clean up slab.c

- highmem page reclaim in prune_icache: highmem pages can pin
  inodes.

b65bbded

[PATCH] use prepare_to_wait in VM/VFS · dfdacf59

Andrew Morton authored Sep 25, 2002

This uses the new wakeup machinery in some hot parts of the VFS and
block layers.

wait_on_buffer(), wait_on_page(), lock_page(), blk_congestion_wait().
Also in get_request_wait(), although the benefit for exclusive wakeups
will be lower.

dfdacf59

[PATCH] prepare_to_wait/finish_wait sleep/wakeup API · 3da08d6c

Andrew Morton authored Sep 25, 2002

This is worth a whopping 2% on spwecweb on an 8-way.  Which is faintly
surprising because __wake_up and other wait/wakeup functions are not
apparent in the specweb profiles which I've seen.


The main objective of this is to reduce the CPU cost of the wait/wakeup
operation.  When a task is woken up, its waitqueue is removed from the
waitqueue_head by the waker (ie: immediately), rather than by the woken
process.

This means that a subsequent wakeup does not need to revisit the
just-woken task.  It also means that the just-woken task does not need
to take the waitqueue_head's lock, which may well reside in another
CPU's cache.

I have no decent measurements on the effect of this change - possibly a
20-30% drop in __wake_up cost in Badari's 40-dds-to-40-disks test (it
was the most expensive function), but it's inconclusive.  And no
quantitative testing of which I am aware has been performed by
networking people.

The API is very simple to use (Linus thought it up):

my_func(waitqueue_head_t *wqh)
{
	DEFINE_WAIT(wait);

	prepare_to_wait(wqh, &wait, TASK_UNINTERRUPTIBLE);
	if (!some_test)
		schedule();
	finish_wait(wqh, &wait);
}

or:

	DEFINE_WAIT(wait);

	while (!some_test_1) {
		prepare_to_wait(wqh, &wait, TASK_UNINTERRUPTIBLE);
		if (!some_test_2)
			schedule();
		...
	}
	finish_wait(wqh, &wait);

You need to bear in mind that once prepare_to_wait has been performed,
your task could be removed from the waitqueue_head and placed into
TASK_RUNNING at any time.  You don't know whether or not you're still
on the waitqueue_head.

Running prepare_to_wait() when you're already on the waitqueue_head is
fine - it will do the right thing.

Running finish_wait() when you're actually not on the waitqueue_head is
fine.

Running finish_wait() when you've _never_ been on the waitqueue_head is
fine, as ling as the DEFINE_WAIT() macro was used to initialise the
waitqueue.

You don't need to fiddle with current->state.  prepare_to_wait() and
finish_wait() will do that.  finish_wait() will always return in state
TASK_RUNNING.

There are plenty of usage examples in vm-wakeups.patch and
tcp-wakeups.patch.

3da08d6c

[PATCH] mprotect_fixup fix · 02b1783c

Andrew Morton authored Sep 25, 2002

From David M-T.

When this function successfully merges the new range into an existing
VMA, it forgets to extend the new protection mode into the just-merged
pages.

02b1783c

[PATCH] hugetlb fix · 5538fdaa

Andrew Morton authored Sep 25, 2002

Patch from Rohit Seth

It fixes the problem which Andrea noted in his initial review of the
hugetlb code:

"In short doing "addr = vma->vm_end" and then checking if vm_end + len
 is below vm_next->vm_start is broken, because there's no guarantee
 that "addr" will be a largepage aligned address.  the LPAGE_ALIGN in
 found_addr should be dropped becaue moving the addr ahead without
 checking that addr+len doesn't then fall into a vma, will generate
 do_munmaps and in turn userspace mem corruption."

5538fdaa

[PATCH] NUMA-Q fixes · bce5aeb5

Martin J. Bligh authored Sep 25, 2002

 - Remove the const that someone incorrectly stuck in there, it type conflicts.
   Alan has a better plan for fixing this long term, but this fixes the compile
   warning for now.

 - Move the printk of the xquad_portio setup *after* we put something in the variable
   so it actually prints something useful, not 0 ;-)

 - To derive the size of the xquad_portio area, multiply the number of nodes by the
   size of each nodes, not the size of two nodes (and remove define). Doh!

bce5aeb5

Remove busy-wait for short RT nanosleeps. It's a random special case · 98ae8e2b
Linus Torvalds authored Sep 25, 2002
```
and does the wrong thing for higher HZ values anyway.
```
98ae8e2b
Merge bk://ldm@bkbits.net/linux-2.5 · 56d8b39d
Patrick Mochel authored Sep 25, 2002
```
into osdl.org:/home/mochel/src/kernel/devel/linux-2.5
```
56d8b39d