- 30 Apr, 2002 16 commits
-
-
Andrew Morton authored
Implements hashed waitqueues for buffer_heads. Drops twelve bytes from struct buffer_head.
-
Andrew Morton authored
Moves all buffer_head-related stuff out of linux/fs.h and into linux/buffer_head.h. buffer_head.h is currently included at the very end of fs.h. So it is possible to include buffer_head directly from all .c files and remove this nested include. Also rationalises all the set_buffer_foo() and mark_buffer_bar() functions. We have: set_buffer_foo(bh) clear_buffer_foo(bh) buffer_foo(bh) and, in some cases, where needed: test_set_buffer_foo(bh) test_clear_buffer_foo(bh) And that's it. BUFFER_FNS() and TAS_BUFFER_FNS() macros generate all the above real inline functions. Normally not a big fan of cpp abuse, but in this case it fits. These function-generating macros are available to filesystems to expand their own b_state functions. JBD uses this in one case.
-
Andrew Morton authored
Remove show_buffers(). It really has nothing to show any more. just buffermem_pages() - move that out into the callers. There's a lot of duplication in this code. better approach would be to remove all the duplicated code out in the architectures and implement generic show_memory_state(). Later.
-
Andrew Morton authored
Remove PG_skip. Nothing is using it (the change was acked by rmk a while back)
-
Andrew Morton authored
Removes inode.i_dirty_data_buffers. It's no longer used - all dirty buffers have their pages marked dirty and filemap_fdatasync() / filemap_fdatawait() catches it all. Updates all callers. This required a change in JFS - it has "metapages" which are a container around a page which holds metadata. They were holding these pages locked and were relying on fsync_inode_data_buffers for writing them out. So fdatasync() deadlocked. I've changed JFS to not lock those pages. Change was acked by Dave Kleikamp <shaggy@austin.ibm.com> as the right thing to do, but may not be complete. Probably igrab() against ->host is needed to pin the address_space down.
-
Andrew Morton authored
Removal of buffer_head.b_inode. The list_emptiness of b_inode_buffers is used to indicate whether the buffer is on an inode's i_dirty_buffers.
-
Andrew Morton authored
Remove writeout_one_page(), waitfor_one_page() and the now-unused generic_buffer_fdatasync(). Add new write_one_page(struct page *page, int wait) which is exported to modules. Update callers to use that. It's only used for IS_SYNC operations.
-
Andrew Morton authored
Removes some redundant BUG checks - trueness of PageSwapCache() implies that page->mapping is non-NULL, and we've already checked that.
-
Andrew Morton authored
page->flags cleanup. Moves the definitions of the page->flags bits and all the PageFoo macros into linux/page-flags.h. That file is currently included from mm.h, but the stage is set to remove that and include page-flags.h direct in all .c files which require that. (120 of them). The patch also makes all the page flag macros and functions consistent: For PG_foo, the following functions are defined: SetPageFoo ClearPageFoo TestSetPageFoo TestClearPageFoo PageFoo and that's it. - Page_Uptodate is renamed to PageUptodate - LockPage is removed. All users updated to use SetPageLocked - UnlockPage is removed. All callers updated to use unlock_page(). it's a real function - there's no need to hide that fact. - PageTestandClearReferenced renamed to TestClearPageReferenced - PageSetSlab renamed to SetPageSlab - __SetPageReserved is removed. It's an infinitesimally small microoptimisation, and is inconsistent. - TryLockPage is renamed to TestSetPageLocked - PageSwapCache() is renamed to page_swap_cache(), so it doesn't pretend to be a page->flags bit test.
-
Andrew Morton authored
Convert minixfs directory code to not rely on the state of data outside i_size.
-
Andrew Morton authored
Removes the buffer_head unused list. Use a mempool instead. The reduced lock contention provided about a 10% boost on ANton's 12-way.
-
Andrew Morton authored
[ I reversed the order in which writeback walks the superblock's dirty inodes. It sped up dbench's unlink phase greatly. I'm such a sleaze ] The core writeback patch. Switches file writeback from the dirty buffer LRU over to address_space.dirty_pages. - The buffer LRU is removed - The buffer hash is removed (uses blockdev pagecache lookups) - The bdflush and kupdate functions are implemented against address_spaces, via pdflush. - The relationship between pages and buffers is changed. - If a page has dirty buffers, it is marked dirty - If a page is marked dirty, it *may* have dirty buffers. - A dirty page may be "partially dirty". block_write_full_page discovers this. - A bunch of consistency checks of the form if (!something_which_should_be_true()) buffer_error(); have been introduced. These fog the code up but are important for ensuring that the new buffer/page code is working correctly. - New locking (inode.i_bufferlist_lock) is introduced for exclusion from try_to_free_buffers(). This is needed because set_page_dirty is called under spinlock, so it cannot lock the page. But it needs access to page->buffers to set them all dirty. i_bufferlist_lock is also used to protect inode.i_dirty_buffers. - fs/inode.c has been split: all the code related to file data writeback has been moved into fs/fs-writeback.c - Code related to file data writeback at the address_space level is in the new mm/page-writeback.c - try_to_free_buffers() is now non-blocking - Switches vmscan.c over to understand that all pages with dirty data are now marked dirty. - Introduces a new a_op for VM writeback: ->vm_writeback(struct page *page, int *nr_to_write) this is a bit half-baked at present. The intent is that the address_space is given the opportunity to perform clustered writeback. To allow it to opportunistically write out disk-contiguous dirty data which may be in other zones. To allow delayed-allocate filesystems to get good disk layout. - Added address_space.io_pages. Pages which are being prepared for writeback. This is here for two reasons: 1: It will be needed later, when BIOs are assembled direct against pagecache, bypassing the buffer layer. It avoids a deadlock which would occur if someone moved the page back onto the dirty_pages list after it was added to the BIO, but before it was submitted. (hmm. This may not be a problem with PG_writeback logic). 2: Avoids a livelock which would occur if some other thread is continually redirtying pages. - There are two known performance problems in this code: 1: Pages which are locked for writeback cause undesirable blocking when they are being overwritten. A patch which leaves pages unlocked during writeback comes later in the series. 2: While inodes are under writeback, they are locked. This causes namespace lookups against the file to get unnecessarily blocked in wait_on_inode(). This is a fairly minor problem. I don't have a fix for this at present - I'll fix this when I attach dirty address_spaces direct to super_blocks. - The patch vastly increases the amount of dirty data which the kernel permits highmem machines to maintain. This is because the balancing decisions are made against the amount of memory in the machine, not against the amount of buffercache-allocatable memory. This may be very wrong, although it works fine for me (2.5 gigs). We can trivially go back to the old-style throttling with s/nr_free_pagecache_pages/nr_free_buffer_pages/ in balance_dirty_pages(). But better would be to allow blockdev mappings to use highmem (I'm thinking about this one, slowly). And to move writer-throttling and writeback decisions into the VM (modulo the file-overwriting problem). - Drops 24 bytes from struct buffer_head. More to come. - There's some gunk like super_block.flags:MS_FLUSHING which needs to be killed. Need a better way of providing collision avoidance between pdflush threads, to prevent more than one pdflush thread working a disk at the same time. The correct way to do that is to put a flag in the request queue to say "there's a pdlfush thread working this disk". This is easy to do: just generalise the "ra_pages" pointer to point at a struct which includes ra_pages and the new collision-avoidance flag.
-
Andrew Morton authored
Changes the way in which the readahead code locates the readahead setting for the underlying device. - struct block_device and struct address_space gain a *pointer* to the current readahead tunable. - The tunable lives in the request queue and is altered with the traditional ioctl. - The value gets *copied* into the struct file at open() time. So a fcntl() mode to modify it per-fd is simple. - Filesystems which are not request_queue-backed get the address of the global `default_ra_pages'. If we want, this can become a tunable. - Filesystems are at liberty to alter address_space.ra_pages to point at some other fs-private default at new_inode/read_inode/alloc_inode time. - The ra_pages pointer can become a structure pointer if, at some time in the future, high-level code needs more detailed information about device characteristics. In fact, it'll need to become a struct pointer for use by writeback: my current writeback code has the problem that multiple pdflush threads can get stuck on the same request queue. That's a waste of resources. I currently have a silly flag in the superblock to try to avoid this. The proper way to get this exclusion is for the high-level writeback code to be able to do a test-and-set against a per-request_queue flag. That flag can live in a structure alongside ra_pages, conveniently accessible at the pagemap level. One thing still to-be-done is going into all callers of blk_init_queue and blk_queue_make_request and making sure that they're setting up a sensible default. ATA wants 248 sectors, and floppy drives don't want 128kbytes, I suspect. Later.
-
Andrew Morton authored
This patch provides global accounting of locked and dirty pages. It does this via lightweight per-CPU data structures. The page_cache_size accounting has been changed to use this facility as well. Locked and dirty page accounting is needed for making writeback and throttling decisions. The patch also starts to move code which is related to page->flags out of linux/mm.h and into linux/page-flags.h
-
Andrew Morton authored
Convert ext2 directory handling to not rely on the contents of pages outside i_size. This is because block_write_full_page (which is used for all writeback) zaps the page outside i_size.
-
Andrew Morton authored
Emit a printk when a page allocation fails. Considered useful for diagnosing crashes.
-
- 29 Apr, 2002 2 commits
-
-
Alexander Viro authored
OK, here comes. Patch below is an attempt to do the fastwalk stuff in right way and so far it seems to be working. - dentry leak is plugged - locked/unlocked state of nameidata doesn't depend on history - it depends only on point in code. - LOOKUP_LOCKED is gone. - following mounts and .. doesn't drop dcache_lock - light-weight permission check distinguishes between "don't know" and "permission denied", so we don't call full-blown permission() unless we have to. - code that changes root/pwd holds dcache_lock _and_ write lock on current->fs->lock. I.e. if we hold dcache_lock we can safely access our ->fs->{root,pwd}{,mnt} - __d_lookup() does not increment refcount; callers do dget_locked() if they need it (behaviour of d_lookup() didn't change, obviously). - link_path_walk() logics had been (somewhat) cleaned up.
-
Martin Dalecki authored
- Fix bogus set_multimode() change. I tough I had reverted it before diff-ing. This was causing hangs of /dev/hdparm -m8 /dev/hda and similar commands.
-
- 28 Apr, 2002 22 commits
-
-
Linus Torvalds authored
-
Linus Torvalds authored
-
bk://linuxusb.bkbits.net/linus-2.5Linus Torvalds authored
into home.transmeta.com:/home/torvalds/v2.5/linux
-
Andreas Dilger authored
The following patch is a _very minor_ cleanup in the ext3 code for ext3_prepare_write(). It simply removes the setting of "handle" from the current transaction handle, because "handle" is actually set again a couple of lines later, where it properly allocates a new transaction handle for this write. The code removed in this patch is probably left over from some development version of ext3 where the transaction handle was started before ext3_prepare_write was called. The only reason I saw this was because I was trying to find where the handle was allocated for an ext3 file write, and at first glance it didn't appear to be allocated anywhere...
-
Linus Torvalds authored
-
http://fbdev.bkbits.net/fbdev-2.5Linus Torvalds authored
into home.transmeta.com:/home/torvalds/v2.5/linux
-
Ivan Kokshaysky authored
Yes, gdb and strace are broken on alpha since 2.5.6, IIRC. Some necessary 'thread_info' changes still are missing in ptrace.c. Fixed thus.
-
http://lia64.bkbits.net/to-linus-2.5Linus Torvalds authored
into home.transmeta.com:/home/torvalds/v2.5/linux
-
Chris Wright authored
The patch below fixes sem_exit() so that the BKL is always released.
-
Sebastian Dröge authored
Here's a simple patch to allow the bttv driver to be build with newer binutils...
-
Benjamin LaHaise authored
In conjunction with some of the earlier problems found in /proc code, now it turns out that snprintf doesn't work correctly in the kernel... Without the following patch, snprintf can return lengths greater than the size argument passed. The net effect is that code using the return value from snprintf can still buffer overrun. This is fixed by not updating the pointer in the buffer unless there is actually space.
-
Russell King authored
The following patch drops the above functions from the ARM port; we've already removed them from the syscall table on ARM, so we can safely remove these from the ARM build.
-
bk://linux-ntfs.bkbits.net/ntfs-tng-2.5Linus Torvalds authored
into home.transmeta.com:/home/torvalds/v2.5/linux
-
Dave Hansen authored
I added ->readdir(), and a new section referring to the sb_op BKL shift as well.
-
Douglas Gilbert authored
Here is some documentation to describe the interface of lower level drivers (e.g. aic7xxx) in the scsi subsystem. The patch is made up of a text file: drivers/scsi/scsi_mid_low_api.txt and some Documentation/DocBook additions to enable html, ps and pdf renderings.
-
Andrew Morton authored
Here's an array-based implementation.
-
Brian Gerst authored
ARM arch
-
Brian Gerst authored
m68k arch
-
Brian Gerst authored
SuperH arch
-
Brian Gerst authored
CRIS and x86-64 arches
-
Brian Gerst authored
The SYMBOL_NAME macro (and variations) have been obsolete since 2.1.0, when the option to compile the kernel in a.out format was removed. This patch starts the process of removing these macros, starting with x86.
-
Dave Jones authored
Originally by Christoph Hellwig back in February. It recieved no objections when posted to l-k & Jens. o remove DEVICE_REQUEST definitions - never used in blk.h itself. o remove DEVICE_ON() - never used at all. o define LOCAL_END_REQUEST when we do not want end_request() instead of other hacks. o remove DEVICE_OFF() - only used in floppy driver, thus one now has a private end_request(). o use private end_request() functions for drivers not providing randomness. o remove TIMEOUT_VALUE - only ever used in hd.c
-