Commits · 11f6ed36b959131a0d990253f07e5105fc4d8901 · Kirill Smelkov / linux

22 Oct, 2023 40 commits

Kent Overstreet authored Mar 30, 2020

Dropping the wrong kind of lock can't lead to anything good...
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

11f6ed36

bcachefs: Fix inodes pass in fsck · 1d60b999

Kent Overstreet authored Mar 30, 2020

It wasn't updated for the patch that switched inodes to using the offset
field of struct bkey.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

1d60b999

bcachefs: Fix ec_stripe_update_ptrs() · e5e6aaa7

Kent Overstreet authored Mar 30, 2020

bch2_btree_iter_set_pos() invalidates the key returned by peek().
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e5e6aaa7

bcachefs: Check btree topology at startup · d06c1a0c

Kent Overstreet authored Mar 29, 2020

When initial btree gc was changed to overlay journal keys as it walks
the btree, it also stopped checking btree topology.

Previously, checking btree topology was a fairly complicated affair -
but it's much easier now that btree_ptr_v2 has min_key in the pointer.

This rewrites the old range_checks code and uses it in both runtime and
initial gc.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

d06c1a0c

bcachefs: Don't allocate memory while holding journal reservation · a0e491c0

Kent Overstreet authored Mar 30, 2020

This fixes a lockdep splat - allocating memory can call
bch2_clear_page_bits() which takes mark_lock.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a0e491c0

bcachefs: Reduce max nr of btree iters when lockdep is on · 2c31e657

Kent Overstreet authored Mar 29, 2020

This is so we don't overflow MAX_LOCK_DEPTH.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2c31e657

bcachefs: Kill bkey_type_successor · 39fb2983

Kent Overstreet authored Jan 07, 2020

Previously, BTREE_ID_INODES was special - inodes were indexed by the
inode field, which meant the offset field of struct bpos wasn't used,
which led to special cases in e.g. the btree iterator code.

Now, inodes in the inodes btree are indexed by the offset field.

Also: prevously min_key was special for extents btrees, min_key for
extents would equal max_key for the previous node. Now, min_key =
bkey_successor() of the previous node, same as non extent btrees.

This means we can completely get rid of
btree_type_sucessor/predecessor.

Also make some improvements to the metadata IO validate/compat code.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

39fb2983

bcachefs: Switch a BUG_ON() to a warning · b72633ae

Kent Overstreet authored Mar 29, 2020

This has popped and thus needs to be debugged, but the assertion firing
isn't necessarily fatal so switch it to a warning.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b72633ae

bcachefs: Use kvpmalloc mempools for compression bounce · 22f77698

Kent Overstreet authored Mar 29, 2020

This fixes an issue where mounting would fail because of memory
fragmentation - previously the compression bounce buffers were using
get_free_pages().
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

22f77698

bcachefs: Read journal when keep_journal on · 5a655f06

Kent Overstreet authored Mar 28, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

5a655f06

bcachefs: Various fixes for interior update path · 56a40fbc

Kent Overstreet authored Mar 28, 2020

The locking was wrong, and we could get a use after free in the error
path where we weren't taking the entrie being freed off the unwritten
list.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

56a40fbc

bcachefs: Use memalloc_nofs_save() · 4e4758c6

Kent Overstreet authored Mar 27, 2020

vmalloc allocations don't always obey GFP_NOFS - memalloc_nofs_save() is
the prefered approach for the future.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

4e4758c6

bcachefs: Improve error message in fsck · f7005e01

Kent Overstreet authored Mar 25, 2020

Seeing the extents that were overlapping is highly useful for figuring
out what went wrong.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f7005e01

bcachefs: Add an option for keeping journal entries after startup · f1d786a0

Kent Overstreet authored Mar 25, 2020

This will be used by the userspace debug tools.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f1d786a0

bcachefs: Fix an assertion when nothing to replay · 2f194e16

Kent Overstreet authored Mar 25, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2f194e16

bcachefs: Journal updates to interior nodes · 6357d607

Kent Overstreet authored Feb 08, 2020

Previously, the btree has always been self contained and internally
consistent on disk without anything from the journal - the journal just
contained pointers to the btree roots.

However, this meant that btree node split or compact operations - i.e.
anything that changes btree node topology and involves updates to
interior nodes - would require that interior btree node to be written
immediately, which means emitting a btree node write that's mostly empty
(using 4k of space on disk if the filesystemm blocksize is 4k to only
write perhaps ~100 bytes of new keys).

More importantly, this meant most btree node writes had to be FUA, and
consumer drives have a history of slow and/or buggy FUA support - other
filesystes have been bit by this.

This patch changes the interior btree update path to journal updates to
interior nodes, after the writes for the new btree nodes have completed.
Best of all, it turns out to simplify the interior node update path
somewhat.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

6357d607

bcachefs: Replay interior node keys · f44a6a71

Kent Overstreet authored Mar 15, 2020

This slightly modifies the journal replay code so that it can replay
updates to interior nodes.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f44a6a71

bcachefs: trans_commit() path can now insert to interior nodes · e62d65f2

Kent Overstreet authored Mar 15, 2020

This will be needed for the upcoming patches to journal updates to
interior btree nodes.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e62d65f2

bcachefs: Disable extent merging · 47143a75

Kent Overstreet authored Mar 24, 2020

Extent merging is currently broken, and will be reimplemented
differently soon - right now it only happens when btree nodes are being
compacted, which makes it difficult to test.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

47143a75

bcachefs: Fix a locking bug in fsck · 0728eed7

Kent Overstreet authored Mar 21, 2020

This works around a btree locking issue - we can't be holding read locks
while taking write locks, which currently means we can't have live
iterators holding read locks at commit time.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

0728eed7

bcachefs: Fix count_iters_for_insert() · fa4dc398

Kent Overstreet authored Mar 21, 2020

This fixes a transaction iterator overflow.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

fa4dc398

bcachefs: Fix an iterator bug · 8666a9ad

Kent Overstreet authored Mar 18, 2020

We were incorrectly not restarting the transaction when re-traversing
iterators.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8666a9ad

bcachefs: Shut down quicker · 6d61724b

Kent Overstreet authored Mar 18, 2020

Internal writes (i.e. copygc/rebalance operations) shouldn't be blocking
on the allocator when we're going RO.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

6d61724b

bcachefs: BCH_FEATURE_new_extent_overwrite is now required · 97328a1a

Kent Overstreet authored Mar 18, 2020

The patch "bcachefs: Move extent overwrite handling out of core btree
code" should have been flipping on this feature bit; extent btree nodes
in the old format have to be rewritten before we can insert into them
with the new extent update path. Not turning on this feature bit was
causing us to go into an infinite loop where we keep rewriting btree
nodes over and over.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

97328a1a

bcachefs: Clear BCH_FEATURE_extents_above_btree_updates on clean shutdown · 5d548743

Kent Overstreet authored Mar 16, 2020

This is needed so that users can roll back to before "d9bb516b2d
bcachefs: Move extent overwrite handling out of core btree code", which
it appears may still be buggy.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

5d548743

bcachefs: Fix another iterator leak · 716254b8

Kent Overstreet authored Mar 16, 2020

This updates bch2_rbio_narrow_crcs() to the current style for
transactional btree code, and fixes a rare panic on iterator overflow.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

716254b8

bcachefs: Don't use peek_filter() unnecessarily · 19f24758

Kent Overstreet authored Mar 16, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

19f24758

bcachefs: Fix a use after free in dio write path · 286d8ad0

Kent Overstreet authored Mar 16, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

286d8ad0

bcachefs: Drop unused export · 511ed5bf

Kent Overstreet authored Mar 15, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

511ed5bf

bcachefs: Move extent overwrite handling out of core btree code · e3e464ac

Kent Overstreet authored Dec 30, 2019

Ever since the btree code was first written, handling of overwriting
existing extents - including partially overwriting and splittin existing
extents - was handled as part of the core btree insert path. The modern
transaction and iterator infrastructure didn't exist then, so that was
the only way for it to be done.

This patch moves that outside of the core btree code to a pass that runs
at transaction commit time.

This is a significant simplification to the btree code and overall
reduction in code size, but more importantly it gets us much closer to
the core btree code being completely independent of extents and is
important prep work for snapshots.

This introduces a new feature bit; the old and new extent update models
are incompatible when the filesystem needs journal replay.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e3e464ac

bcachefs: btree_iter_peek_with_updates() · 57b0b3db

Kent Overstreet authored Mar 05, 2020

Introduce a new iterator method that provides a consistent view of the
btree plus uncommitted updates.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

57b0b3db

bcachefs: Fix build when CONFIG_BCACHEFS_DEBUG=n · 7d6f9b64

Kent Overstreet authored Mar 15, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

7d6f9b64

bcachefs: More btree iter invariants · 2e70ce56

Kent Overstreet authored Feb 18, 2020

Ensure that iter->pos always lies between the start and end of iter->k
(the last key returned). Also, bch2_btree_iter_set_pos() now invalidates
the key that peek() or next() returned.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2e70ce56

bcachefs: Simplify bch2_btree_iter_peek_slot() · c3801239
Kent Overstreet authored Mar 13, 2020
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
c3801239

bcachefs: Iterator debug code improvements · 2dac0eae

Kent Overstreet authored Feb 18, 2020

More aggressively checking iterator invariants, and fixing the resulting
bugs. Also greatly simplifying iter_next() and iter_next_slot() - they
were hyper optimized before, but the optimizations were getting too
brittle.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2dac0eae

bcachefs: Skip 0 size deleted extents in journal replay · 3186c80f

Kent Overstreet authored Mar 05, 2020

These are created by the new extent update path, but not used yet by the
recovery code and they break the existing recovery code, so we can just
skip them.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

3186c80f

bcachefs: Traverse iterator in journal replay · f6d0368e

Kent Overstreet authored Mar 09, 2020

This fixes a bug where we end up spinning in journal replay - in theory
this shouldn't be necessary though, transaction reset should be
re-traversing all iterators.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f6d0368e

bcachefs: Don't log errors that are expected during shutdown · a7b46a3d

Kent Overstreet authored Mar 09, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a7b46a3d

bcachefs: Fix bch2_dump_bset() · 24e0c3f8

Kent Overstreet authored Mar 07, 2020

It's used in the write path when the bset isn't in the btree node
buffer.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

24e0c3f8

bcachefs: Fix another iterator leak · 27beb810

Kent Overstreet authored Mar 07, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

27beb810