Commits · b6b518079e3793de9f1bb1bc63a34e17b9e36ccc · nexedi / linux

05 Oct, 2002 19 commits

[PATCH] ubd switched to alloc_disk() · b6b51807
Alexander Viro authored Oct 05, 2002

b6b51807
[PATCH] dasd switched to alloc_disk() · 91e7ecdc
Alexander Viro authored Oct 05, 2002

91e7ecdc
[PATCH] nbd switched to alloc_disk() · 529ba807
Alexander Viro authored Oct 05, 2002

529ba807
[PATCH] stram/z2ram switched to alloc_disk() · f53197de
Alexander Viro authored Oct 05, 2002

f53197de
[PATCH] i2o switched to alloc_disk() · 2b6df45e
Alexander Viro authored Oct 05, 2002

2b6df45e
[PATCH] acorn mfm switched to alloc_disk() · e102c579
Alexander Viro authored Oct 05, 2002

e102c579
[PATCH] xd switched to alloc_disk() · 9668d370
Alexander Viro authored Oct 05, 2002

9668d370
[PATCH] ps2esdi switched to alloc_disk() · 950dd003
Alexander Viro authored Oct 05, 2002

950dd003
[PATCH] umem switched to alloc_disk() · be95d9fb
Alexander Viro authored Oct 05, 2002

be95d9fb
[PATCH] initrd fix (missing set_capacity) · 7afbe994
Alexander Viro authored Oct 05, 2002

7afbe994
[PATCH] pcd switched to alloc_disk() · af0e4bd3
Alexander Viro authored Oct 05, 2002

af0e4bd3
[PATCH] fix sgalaxy.c driver cli/sti code. · 0163a9e3
Jaroslav Kysela authored Oct 05, 2002

0163a9e3
Merge http://linux-isdn.bkbits.net/linux-2.5.make · 4d4e2fe1
Linus Torvalds authored Oct 05, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
4d4e2fe1

kbuild: Put .bss back to the end of vmlinux · 345af2c9

Kai Germaschewski authored Oct 05, 2002

The kallsyms patches added __kallsyms as last section into vmlinux,
behind .bss.

This was done to save two additional kallsyms passes, since as the
added section was last, it did not change the symbols before it.

With the new infrastructure in the top-level Makefile, we do not need
to do full relinks for these passes, so they are cheaper. We now
use one additional link/kallsyms run to be able to place the __kallsyms
section before .bss. The other pass is saved by adding an empty but 
allocated __kallsyms section in kernel/kallsyms.c, so the first kallsyms
pass already generates a section of the final size.

345af2c9

kbuild: Generalize adding of additional sections to vmlinux · 8cc7a297

Kai Germaschewski authored Oct 05, 2002

kallsyms needs to actually have a final vmlinux to extract the symbols,
and then add this information as a new section to the final vmlinux.

Currently, we basically just do the vmlinux link twice, adding
.tmp_kallsyms.o the second time. However, it's actually possible to just
link together the temporary vmlinux generated the first time and the
new object file directly without going back to all the single parts
that the temporary vmlinux was linked from.

This mechanism should be useful for sparc as well, where the btfix
mechanism needs an already linked vmlinux, too.

IMPORTANT: This does only work as desired if the link script can be
used recursively, i.e.

ld <flags> -T arch/$(ARCH)/vmlinux.lds.s -o vmlinux.test vmlinux

generates a vmlinux.test which is identical to vmlinux.
arch/i386/vmlinux.lds.S needed a little tweaking, so probably the
other archs do as well.

8cc7a297

Merge tp1.ruhr-uni-bochum.de:/home/kai/src/kernel/v2.5/linux-2.5 · 91990be9
Kai Germaschewski authored Oct 05, 2002
```
into tp1.ruhr-uni-bochum.de:/home/kai/src/kernel/v2.5/linux-2.5.make
```
91990be9

kbuild: Don't descend into arch/i386/boot · abcdaf4b

Kai Germaschewski authored Oct 05, 2002

We don't descend anymore when building vmlinux, so don't do so for
the i386 specific boot targets, either.

Plus, more cleanup in arch/i386/Makefile

abcdaf4b

Increase the delay in waiting for pcmcia drivers to register. · b74d4bcc

Linus Torvalds authored Oct 05, 2002

Reported by Peter Osterlund.

(Yeah, the real fix would be to make driver services not have to
know about low-level pcmcia core drivers beforehand, but that's not
life as we know it right now).

b74d4bcc

Merge bk://bk.arm.linux.org.uk · 6c3b738c
Linus Torvalds authored Oct 05, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
6c3b738c

06 Oct, 2002 2 commits

[SERIAL] Fix serial includes for modversions/modules. · 9700fe23
Russell King authored Oct 06, 2002
```
This fixes the build error that occurs if you have a certain selection
of module/modversions settings.
```
9700fe23

[SERIAL] Allow PCMCIA serial cards to work again. · ae817269

Russell King authored Oct 06, 2002

The PCMCIA layer claims the IO or memory regions for all cards.  This
means that any port registered via 8250_cs must not cause the 8250
code to claim the resources itself.

We also add support for iomem-based ports at initialisation time for
PPC.

ae817269

05 Oct, 2002 19 commits

kbuild: Nicer warnings · 56a8f5d4

Kai Germaschewski authored Oct 05, 2002

Improve the warning messages when using obsolete features, kill one
remaining user of $(list-multi)

(by Sam Ravnborg)

I also made O_TARGET != built-in.o an error, since compatibility code for
that case has already been dropped

56a8f5d4

Merge bk://linux-bt.bkbits.net/bt-2.5 · 6cab0e06
Linus Torvalds authored Oct 05, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
6cab0e06

[PATCH] clean up ll_rw_block() · 61c4b8fb

Andrew Morton authored Oct 04, 2002

Hardly anything uses this function, so the debug checks in there are
not of much value.

The check for bdev_readonly() should be done in submit_bio().

Local variable `major' was altogether unused.

61c4b8fb

[PATCH] stricter dirty memory clamping · 3669e824

Andrew Morton authored Oct 04, 2002

The ratelimiting logic in balance_dirty_pages_ratelimited() is designed
to prevent excessive calls to the expensive get_page_state(): On a big
machine we only check to see if we're over dirty memory limits once per
1024 dirtyings per cpu.

This works OK normally, but it has the effect of allowing each process
to go 1024 pages over the dirty limit before it gets throttled.

So if someone runs 16000 tiobench threads, they can go 16G over the
dirty memory threshold and die the death of buffer_head consumption.
Because page dirtiness pins the page's buffer_heads, defeating the
special buffer_head reclaim logic.

I'd left this overshoot artifact in place because it provides a degree
of adaptivity - of someone if running hundreds of dirtying processes
(dbench!) then they do want to overshoot the dirty memory limit.

But it's hard to balance, and is really not worth the futzing around.
So change the logic to only perform the get_page_state() call rate
limiting if we're known to be under the dirty memory threshold.

3669e824

[PATCH] remove page->virtual · a27efcaf

Andrew Morton authored Oct 04, 2002

The patch removes page->virtual for all architectures which do not
define WANT_PAGE_VIRTUAL.  Hash for it instead.

Possibly we could define WANT_PAGE_VIRTUAL for CONFIG_HIGHMEM4G, but it
seems unlikely.

A lot of the pressure went off kmap() and page_address() as a result of
the move to kmap_atomic().  That should be the preferred way to address
CPU load in the set_page_address() and page_address() hashing and
locking.

If kmap_atomic is not usable then the next best approach is for users
to cache the result of kmap() in a local rather than calling
page_address() repeatedly.

One heavy user of kmap() and page_address() is the ext2 directory code.

On a 7G Quad PIII, running four concurrent instances of

	while true
	do
		find /usr/src/linux > /dev/null
	done

on ext2 with everything cached, profiling shows that the new hashed
set_page_address() and page_address() implementations consume 0.4% and
1.3% of CPU time respectively.   I think that's OK.

a27efcaf

[PATCH] use buffer_boundary() for writeback scheduling hints · 343893e6

Andrew Morton authored Oct 04, 2002

This is the replacement for write_mapping_buffers().

Whenever the mpage code sees that it has just written a block which had
buffer_boundary() set, it assumes that the next block is dirty
filesystem metadata.  (This is a good assumption - that's what
buffer_boundary is for).

So we do a lookup in the blockdev mapping for the next block and it if
is present and dirty, then schedule it for IO.

So the indirect blocks in the blockdev mapping get merged with the data
blocks in the file mapping.

This is a bit more general than the write_mapping_buffers() approach.
write_mapping_buffers() required that the fs carefully maintain the
correct buffers on the mapping->private_list, and that the fs call
write_mapping_buffers(), and the implementation was generally rather
yuk.

This version will "just work" for filesystems which implement
buffer_boundary correctly.  Currently this is ext2, ext3 and some
not-yet-merged reiserfs patches.  JFS implements buffer_boundary() but
does not use ext2-like layouts - so there will be no change there.

Works nicely.

343893e6

[PATCH] remove write_mapping_buffers() · 4ac833da

Andrew Morton authored Oct 04, 2002

When the global buffer LRU was present, dirty ext2 indirect blocks were
automatically scheduled for writeback alongside their data.

I added write_mapping_buffers() to replace this - the idea was to
schedule the indirects close in time to the scheduling of their data.

It works OK for small-to-medium sized files but for large, linear writes
it doesn't work: the request queue is completely full of file data and
when we later come to scheduling the indirects, their neighbouring data
has already been written.

So writeback of really huge files tends to be a bit seeky.

So. Kill it. Will fix this problem by other means.

4ac833da

[PATCH] use bio_get_nr_vecs() for sizing direct-io BIOs · e3b12fc1

Andrew Morton authored Oct 04, 2002

From Badari Pulavarty.

Rather than allocating maximum-sized BIOs, use the new
bio_get_nr_vecs() hint when sizing the BIOs.

Also keep track of the approximate upper-bound on the number of pages
remaining to do, so we can again avoid allocating excessively-sized
BIOs.

e3b12fc1

[PATCH] Documentation/filesystems/ext3.txt · 6fb75ca4
Andrew Morton authored Oct 04, 2002
```
By Vincent Hanquez <tab@tuxfamily.org>
```
6fb75ca4
[PATCH] use bio_get_nr_vecs() hint for pagecache writeback · f2b01f8b
Andrew Morton authored Oct 04, 2002
```
Use the bio_get_nr_pages() hint for sizing the BIOs which writeback
allocates.
```
f2b01f8b

[PATCH] fix reclaim for higher-order allocations · 3209a954

Andrew Morton authored Oct 04, 2002

The page reclaim logic will bail out if all zones are at pages_high.
But if the caller is requesting a higher-order allocation we need to go
on and free more memory anyway.  That's the only way we have of
addressing buddy fragmentation.

3209a954

[PATCH] separation of direct-reclaim and kswapd functions · bf3f607a

Andrew Morton authored Oct 04, 2002

There is some lack of clarity in what kswapd does and what
direct-reclaim tasks do; try_to_free_pages() tries to service both
functions, and they are different.

- kswapd's role is to keep all zones on its node at

	zone->free_pages >= zone->pages_high.

  and to never stop as long as any zones do not meet that condition.

- A direct reclaimer's role is to try to free some pages from the
  zones which are suitable for this particular allocation request, and
  to return when that has been achieved, or when all the relevant zones
  are at

	zone->free_pages >= zone->pages_high.

The patch explicitly separates these two code paths; kswapd does not
run try_to_free_pages() any more.  kswapd should not be aware of zone
fallbacks.

bf3f607a

[PATCH] mempool wakeup fix · fe66ad33

Andrew Morton authored Oct 04, 2002

When the mempool is empty, tasks wait on the waitqueue in "exclusive
mode".  So one task is woken for each returned element.

But if the number of tasks which are waiting exceeds the mempool's
specified size (min_nr), mempool_free() ends up deciding that as the
pool is fully replenished, there cannot possibly be anyone waiting for
more elements.

But with 16384 threads running tiobench, it happens.

We could fix this with a waitqueue_active() test in mempool_free().
But rather than adding that test to this fastpath I changed the wait to
be non-exclusive, and used the prepare_to_wait/finish_wait API, which
will be quite beneficial in this case.

Also, convert the schedule() in mempool_alloc() to an io_schedule(), so
this sleep time is accounted as "IO wait".  Which is a bit approximate
- we don't _know_ that the caller is really waiting for IO completion.
But for most current users of mempools, io_schedule() is more accurate
than schedule() here.

fe66ad33

[PATCH] O_DIRECT invalidation fix · a7634cff

Andrew Morton authored Oct 04, 2002

If the alignment checks in generic_direct_IO() fail, we end up not
forcing writeback of dirty pagecache pages, but we still run
invalidate_inode_pages2().  The net result is that dirty pagecache gets
incorrectly removed.  I guess this will expose unwritten disk blocks.

So move the sync up into generic_file_direct_IO(), where we perform the
invalidation.  So we know that pagecache and disk are in sync before we
do anything else.

a7634cff

[PATCH] truncate fixes · 911ceab5

Andrew Morton authored Oct 04, 2002

The new truncate code needs to check page->mapping after acquiring the
page lock.  Because the page could have been unmapped by page reclaim
or by invalidate_inode_pages() while we waited for the page lock.

Also, the page may have been moved between a tmpfs inode and
swapper_space.  Because we don't hold the mapping->page_lock across the
entire truncate operation any more.

Also, change the initial truncate scan (the non-blocking one which is
there to stop as much writeout as possible) so that it is immune to
other CPUs decreasing page->index.

Also fix negated test in invalidate_inode_pages2().  Not sure how that
got in there.

911ceab5

[PATCH] distinguish between address span of a zone and the number · d3975580

Andrew Morton authored Oct 04, 2002

From David Mosberger

The patch below fixes a bug in nr_free_zone_pages() which shows when a
zone has hole.  The problem is due to the fact that "struct zone"
didn't keep track of the amount of real memory in a zone.  Because of
this, nr_free_zone_pages() simply assumed that a zone consists entirely
of real memory.  On machines with large holes, this has catastrophic
effects on VM performance, because the VM system ends up thinking that
there is plenty of memory left over in a zone, when in fact it may be
completely full.

The patch below fixes the problem by replacing the "size" member in
"struct zone" with "spanned_pages" and "present_pages" and updating
page_alloc.c.

d3975580

[PATCH] remove debug code from list_del() · 9d66d9e9

Andrew Morton authored Oct 04, 2002

It hasn't caught any bugs, and it is causing confusion over whether
this is a permanent part of list_del() behaviour.

9d66d9e9

[PATCH] hugetlb kmap fix · db12b88f

Andrew Morton authored Oct 04, 2002

From Bill Irwin

This patch makes alloc_hugetlb_page() kmap() the memory it's zeroing,
and cleans up a tiny bit of list handling on the side.  Without this
fix, it oopses every time it's called.

db12b88f

[PATCH] fix /proc/vmstat:pgpgout/pgpgin · 908325dc

Andrew Morton authored Oct 04, 2002

These numbers are being sent to userspace as number-of-sectors, whereas
they should be number-of-k.

908325dc