Commits · c06fd892405fbedfbf4572a2f8d5df2721a86937 · Kirill Smelkov / linux

23 Sep, 2002 11 commits

Merge http://linux-isdn.bkbits.net/linux-2.5.make · c06fd892
Linus Torvalds authored Sep 22, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
c06fd892
kbuild: Convert missed L_TARGET references · a8c7db20
Kai Germaschewski authored Sep 23, 2002
```
When converting all L_TARGETs to lib.a, I missed these instances.
```
a8c7db20

[PATCH] Compile fixes for alpha arch · 41447041

Peter Rival authored Sep 22, 2002

Update alpha port to work with new nanosecond xtime, and the in_atomic()
requirements.

41447041

Merge bk://thebsh.namesys.com/bk/reiser3-linux-2.5 · 59e8b32c
Linus Torvalds authored Sep 22, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
59e8b32c

[PATCH] fix UP_APIC linkage problem in 2.5.3[78] · cb45d949

Mikael Pettersson authored Sep 22, 2002

The problem is that the local APIC code references stuff in
mpparse, but 2.5.37 changed arch/i386/kernel/Makefile to only
compile mpparse for SMP.

This patch works around this by enforcing CONFIG_X86_MPPARSE
for all LOCAL_APIC-enabled configs.

cb45d949

[PATCH] bio_get_nr_vecs · 63b9d36d

Jens Axboe authored Sep 22, 2002

Add bio_get_nr_vecs(). It returns an approximate number of pages that
can be added to a block device. It's just a ballpark number, but I think
this is quite fine for the type of thing it is needed for: mpage etc
need to know an approx size of a bio that they need to allocate. It
would be silly to continously allocate 64-page sized bio_vec entries, if
the target cannot do more than 8, for example.

63b9d36d

[PATCH] pdc4030 · 58c1b542
Jens Axboe authored Sep 22, 2002
```
make pdc4030 work
```
58c1b542

[PATCH] trm compile · 6c99eec3

Jens Axboe authored Sep 22, 2002

Bad merge from 2.4.20-pre-ac, ide_build_dmatable() does not need data
direction argument in 2.5 (it's implicit in the request)

6c99eec3

[PATCH] trivial typo in drivers/ide/pci/sl82c105.c · 76dc17b4
Tim Schmielau authored Sep 22, 2002

76dc17b4

[PATCH] Re: 2.5.36 IDE fixes · 7d663f71

Ivan Kokshaysky authored Sep 22, 2002

I'm terribly sorry - I've sent you the wrong diff, it was
some intermediate variant. Actually it added extra breakage to
ide_hwif_configure().

Desired behavior was:

if ctl == base == 0, the device is in "true legacy" mode (as per PCI
spec); use values from the base address registers otherwise.

7d663f71

[PATCH] more bio updates · ef869838

Jens Axboe authored Sep 22, 2002

cleanup end_that_request_first() end_io handling, and fix bug where
partial completes didn't get accounted right wrt blk_recalc_rq_sectors()

ef869838

22 Sep, 2002 29 commits

Merge master.kernel.org:/home/davem/BK/sparc-2.5 · f20bf018
Linus Torvalds authored Sep 22, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
f20bf018
Merge master.kernel.org:/home/davem/BK/net-2.5 · e7144e64
Linus Torvalds authored Sep 22, 2002
```
into home.transmeta.com:/home/torvalds/v2.5/linux
```
e7144e64

[PATCH] low-latency page reclaim · 407ee6c8

Andrew Morton authored Sep 22, 2002

Convert the VM to not wait on other people's dirty data.

 - If we find a dirty page and its queue is not congested, do some writeback.

 - If we find a dirty page and its queue _is_ congested then just
   refile the page.

 - If we find a PageWriteback page then just refile the page.

 - There is additional throttling for write(2) callers.  Within
   generic_file_write(), record their backing queue in ->current.
   Within page reclaim, if this tasks encounters a page which is dirty
   or under writeback onthis queue, block on it.  This gives some more
   writer throttling and reduces the page refiling frequency.

It's somewhat CPU expensive - under really heavy load we only get a 50%
reclaim rate in pages coming off the tail of the LRU.  This can be
fixed by splitting the inactive list into reclaimable and
non-reclaimable lists.  But the CPU load isn't too bad, and latency is
much, much more important in these situations.

Example: with `mem=512m', running 4 instances of `dbench 100', 2.5.34
took 35 minutes to compile a kernel.  With this patch, it took three
minutes, 45 seconds.

I haven't done swapcache or MAP_SHARED pages yet.  If there's tons of
dirty swapcache or mmap data around we still stall heavily in page
reclaim.  That's less important.

This patch also has a tweak for swapless machines: don't even bother
bringing anon pages onto the inactive list if there is no swap online.

407ee6c8

[PATCH] use the congestion APIs in pdflush · c9b22619

Andrew Morton authored Sep 22, 2002

The key concept here is that pdflush does not block on request queues
any more.  Instead, it circulates across the queues, keeping any
non-congested queues full of write data.  When all queues are full,
pdflush takes a nap, to be woken when *any* queue exits write
congestion.

This code can keep sixty spindles saturated - we've never been able to
do that before.

 - Add the `nonblocking' flag to struct writeback_control, and teach
   the writeback paths to honour it.

 - Add the `encountered_congestion' flag to struct writeback_control
   and teach the writeback paths to set it.

So as soon as a mapping's backing_dev_info indicates that it is getting
congested, bale out of writeback.  And don't even start writeback
against filesystems whose queues are congested.

 - Convert pdflush's background_writeback() function to use
   nonblocking writeback.

This way, a single pdflush thread will circulate around all the
dirty queues, keeping them filled.

 - Convert the pdlfush `kupdate' function to do the same thing.

This solves the problem of pdflush thread pool exhaustion.

It solves the problem of pdflush startup latency.

It solves the (minor) problem wherein `kupdate' writeback only writes
back a single disk at a time (it was getting blocked on each queue in
turn).

It probably means that we only ever need a single pdflush thread.

c9b22619

[PATCH] use the queue congestion API in ext2_preread_inode() · f3332384

Andrew Morton authored Sep 22, 2002

Use the new queue congestion detector in ext2_preread_inode(). Don't
try the speculative read if the read queue is congested.

Also, don't try it if the disk is write-congested. Presumably it is
more important to get the dirty memory cleaned out.

f3332384

[PATCH] infrastructure for monitoring queue congestion state · 4cef1b04

Andrew Morton authored Sep 22, 2002

The patch provides a means for the VM to be able to determine whether a
request queue is in a "congested" state.  If it is congested, then a
write to (or read from) the queue may cause blockage in
get_request_wait().

So the VM can do:

	if (!bdi_write_congested(page->mapping->backing_dev_info))
		writepage(page);

This is not exact.  The code assumes that if the request queue still
has 1/4 of its capacity (queue_nr_requests) available then a request
will be non-blocking.  There is a small chance that another CPU could
zoom in and consume those requests.  But on the rare occasions where
that may happen the result will mereley be some unexpected latency -
it's not worth doing anything elaborate to prevent this.

The patch decreases the size of `batch_requests'.  batch_requests is
positively harmful - when a "heavy" writer and a "light" writer are
both writing to the same queue, batch_requests provides a means for the
heavy writer to massively stall the light writer.  Instead of waiting
for one or two requests to come free, the light writer has to wait for
32 requests to complete.

Plus batch_requests generally makes things harder to tune, understand
and predict.  I wanted to kill it altogether, but Jens says that it is
important for some hardware - it allows decent size requests to be
submitted.

The VM changes which go along with this code cause batch_requests to be
not so painful anyway - the only processes which sleep in
get_request_wait() are the ones which we elect, by design, to wait in
there - typically heavy writers.


The patch changes the meaning of `queue_nr_requests'.  It used to mean
"total number of requests per queue".  Half of these are for reads, and
half are for writes.  This always confused the heck out of me, and the
code needs to divide queue_nr_requests by two all over the place.

So queue_nr_requests now means "the number of write requests per queue"
and "the number of read requests per queue".  ie: I halved it.

Also, queue_nr_requests was converted to static scope.  Nothing else
uses it.


The accuracy of bdi_read_congested() and bdi_write_congested() depends
upon the accuracy of mapping->backing_dev_info.  With complex block
stacking arrangements it is possible that ->backing_dev_info is
pointing at the wrong queue.  I don't know.

But the cost of getting this wrong is merely latency, and if it is a
problem we can fix it up in the block layer, by getting stacking
devices to communicate their congestion state upwards in some manner.

4cef1b04

[PATCH] don't hold mapping->private_lock while marking a page dirty · b5742733

Andrew Morton authored Sep 22, 2002

__set_page_dirty_buffers() is calling __mark_inode_dirty under
mapping->private_lock.

We don't need to hold ->private_lock across that call.  It's only there
to pin page->buffers.

This simplifies the VM locking heirarchy.

b5742733

[PATCH] fix ext3 in data=writeback mode · c8b254cc

Andrew Morton authored Sep 22, 2002

When I converted ext3 to use to use direct-to-BIO writeback for
data=writeback mode I forgot that we need to hold a transaction open on
behalf of MAP_SHARED pages.  The fileystem is BUGging in get_block()
because there is no transaction open.

So let's forget that idea for now and send data=writeback mode back to
ext3_writepage.

c8b254cc

Merge nuts.ninka.net:/home/davem/src/BK/sparcwork-2.5 · 2d35bd3f
David S. Miller authored Sep 22, 2002
```
into nuts.ninka.net:/home/davem/src/BK/sparc-2.5
```
2d35bd3f
Merge nuts.ninka.net:/home/davem/src/BK/network-2.5 · da29f6a8
David S. Miller authored Sep 22, 2002
```
into nuts.ninka.net:/home/davem/src/BK/net-2.5
```
da29f6a8
Merge master.kernel.org:/home/acme/BK/llc-2.5 · e1ec2e00
David S. Miller authored Sep 22, 2002
```
into nuts.ninka.net:/home/davem/src/BK/net-2.5
```
e1ec2e00
[LLC] move reason to the {station,sap,conn}_ev structs · 1502caff
Arnaldo Carvalho de Melo authored Sep 22, 2002
```
Slowly killing the ugly struct forest.
```
1502caff
[LLC] Make llc_save_primitive ready for dataunit/xid/test DGRAM packets · 86b74abd
Arnaldo Carvalho de Melo authored Sep 22, 2002

86b74abd

[LLC] use the core lists to get info for /proc/net/llc · 5d8c0602

Arnaldo Carvalho de Melo authored Sep 22, 2002

With this llc_ui_sockets is almost not needed anymore, next
changesets will deal with the dataunit/xid/test primitives, that
are still using it.

5d8c0602

[LLC] use sk->state_change when p_flag is cleared or core state changes · 1d84746d
Arnaldo Carvalho de Melo authored Sep 22, 2002

1d84746d
kbuild: arch/um cleanup / O_TARGET removal · 050aa25b
Kai Germaschewski authored Sep 22, 2002

050aa25b
kbuild: arch/sparc64 cleanup / O_TARGET removal · 3c6c1425
Kai Germaschewski authored Sep 22, 2002

3c6c1425
kbuild: arch/sparc cleanup / O_TARGET removal · d779a520
Kai Germaschewski authored Sep 22, 2002

d779a520
kbuild: arch/sh cleanup / O_TARGET removal · 3d234231
Kai Germaschewski authored Sep 22, 2002

3d234231
kbuild: arch/s390x cleanup / O_TARGET removal · 527bbbb9
Kai Germaschewski authored Sep 22, 2002

527bbbb9
kbuild: arch/s390 cleanup / O_TARGET removal · 21b0adc0
Kai Germaschewski authored Sep 22, 2002

21b0adc0
kbuild: arch/ppc64 cleanup / O_TARGET removal · ba84c823
Kai Germaschewski authored Sep 22, 2002

ba84c823
kbuild: arch/ppc cleanup / O_TARGET removal · ef7b31bd
Kai Germaschewski authored Sep 22, 2002

ef7b31bd
kbuild: arch/parisc cleanup / O_TARGET removal · 51345335
Kai Germaschewski authored Sep 22, 2002

51345335
kbuild: arch/mips64 cleanup / O_TARGET removal · f24aadaf
Kai Germaschewski authored Sep 22, 2002

f24aadaf
kbuild: arch/mips cleanup / O_TARGET removal · 94a3c7d6
Kai Germaschewski authored Sep 22, 2002

94a3c7d6
kbuild: arch/m68k cleanup / O_TARGET removal · 61e1f973
Kai Germaschewski authored Sep 22, 2002

61e1f973
kbuild: arch/ia64 cleanup / O_TARGET removal · 08e630ad
Kai Germaschewski authored Sep 22, 2002

08e630ad
kbuild: arch/cris cleanup / O_TARGET removal · 95c1628e
Kai Germaschewski authored Sep 22, 2002

95c1628e