Commits · 8b3bbe2c34759aad307ced27373405e346960f13 · Kirill Smelkov / linux

22 Oct, 2023 40 commits

bcachefs: Don't reexecute triggers when retrying transaction commit · 8b3bbe2c

Kent Overstreet authored Dec 24, 2019

This was causing a bug with transaction iterators overflowing; now, if
triggers have to be reexecuted we always return -EINTR and retry from
the start of the transaction.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8b3bbe2c

bcachefs: Kill BTREE_INSERT_ATOMIC · 58e2388f

Kent Overstreet authored Dec 22, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

58e2388f

bcachefs: Convert all bch2_trans_commit() users to BTREE_INSERT_ATOMIC · b1fd23df

Kent Overstreet authored Dec 22, 2019

BTREE_INSERT_ATOMIC should really be the default mode, and there's not
that much code that doesn't need it - so this is prep work for getting
rid of the flag.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b1fd23df

bcachefs: bch2_trans_reset() calls should be at the tops of loops · a8abd3a7

Kent Overstreet authored Dec 20, 2019

It needs to be called when we get -EINTR due to e.g. lock restart - this
fixes a transaction iterators overflow bug.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a8abd3a7

bcachefs: Drop a faulty assertion · 780c4e43

Kent Overstreet authored Dec 20, 2019

This assertion was wrong for interior nodes (and wasn't terribly useful
to begin with)
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

780c4e43

bcachefs: Redo copygc throttling · 309c54c3

Kent Overstreet authored Dec 20, 2019

The code that checked the current free space and waited if it was too
big was causing issues - btree node allocations do not increment the
write IO clock (perhaps they should); but more broadly the check
wouldn't run copygc at all until the device was mostly full, at which
point it might have to do a bunch of work.

This redoes that logic so that copygc starts to run earlier, smoothly
running more and more often as the device becomes closer to full.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

309c54c3

bcachefs: Make io timers less buggy · 5873efbf

Kent Overstreet authored Dec 19, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

5873efbf

bcachefs: Fix a memory splat · 187c71f6

Kent Overstreet authored Dec 18, 2019

In __bch2_sb_field_resize, when a field's old a new size was 0, we were
doing an invalid write just past the end of the superblock.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

187c71f6

bcachefs: Redo filesystem usage ioctls · 22502ac2

Kent Overstreet authored Dec 16, 2019

When disk space accounting was changed to be tracked by replicas entry,
the ioctl interface was never update: this patch finally does that.

Aditionally, the BCH_IOCTL_USAGE ioctl is now broken out into separate
ioctls for filesystem and device usage.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

22502ac2

bcachefs: Update directory timestamps during link · 184b1dc1

Justin Husted authored Nov 11, 2019

Timestamp updates on the directory during a link operation were cached.
This is inconsistent with other metadata operations such as rename, as
well as being less efficient.
Signed-off-by: Justin Husted <sigstop@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

184b1dc1

bcachefs: Fix for an assertion on filesystem error · c45d473d

Kent Overstreet authored Dec 18, 2019

Normally the in memory i_size is always greater than or equal to i_size
on disk; this doesn't hold on filesystem error.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c45d473d

bcachefs: Fix a null ptr deref in btree_iter_traverse_one() · b5a5c4c1

Kent Overstreet authored Dec 16, 2019

When traversing nodes and we've reached the end of the btree, the
current btree node will be NULL.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b5a5c4c1

bcachefs: Kill btree_node_iter_large · ae2f17d5

Kent Overstreet authored Dec 14, 2019

Long overdue cleanup - this converts btree_node_iter_large uses to
sort_iter.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ae2f17d5

bcachefs: Use one buffer for sorting whiteouts · 8f82280e

Kent Overstreet authored Dec 14, 2019

We're not really supposed to allocate from the same mempool more than
once.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8f82280e

bcachefs: Refactor whiteouts compaction · c297a763

Kent Overstreet authored Dec 13, 2019

The whiteout compaction path - as opposed to just dropping whiteouts -
is now only needed for extents, and soon will only be needed for extent
btree nodes in the old format.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c297a763

bcachefs: Whiteout changes · c9bebae6

Kent Overstreet authored Nov 29, 2019

More prep work for snapshots: extents will soon be using
KEY_TYPE_deleted for whiteouts, with 0 size. But we wen't be able to
keep these whiteouts with the rest of the extents in the btree node, due
to sorting invariants breaking.

We can deal with this by immediately moving the new whiteouts to the
unwritten whiteouts area - this just means those whiteouts won't be
sorted, so we need new code to sort them prior to merging them with the
rest of the keys to be written.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c9bebae6

bcachefs: Always emit new extents on partial overwrite · 183797e3

Kent Overstreet authored Nov 20, 2019

This is prep work for snapshots: the algorithm in
bch2_extent_sort_fix_overlapping() will break when we have multiple
overlapping extents in unrelated snapshots - but, we'll be able to make
extents work like regular keys and use bch2_key_sort_fix_overlapping()
for extent btree nodes if we make a couple changes - the main one being
to always emit new extents when we partially overwrite an existing
(written) extent.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

183797e3

bcachefs: Fix bch2_verify_insert_pos() · c201e2d9

Kent Overstreet authored Nov 26, 2019

We were calling __btree_node_key_to_offset() on a key that wasn't in the
btree node.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c201e2d9

bcachefs: Put inline data behind a mount option for now · 07358a82

Kent Overstreet authored Nov 29, 2019

Inline data extents + reflink is still broken
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

07358a82

bcachefs: bch2_check_set_feature() · ba239c95

Kent Overstreet authored Nov 29, 2019

New helper function for setting incompatible feature bits
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ba239c95

bcachefs: Switch to macro for bkey_ops · 9ba68f6c

Kent Overstreet authored Nov 26, 2019

Older versions of gcc refuse to compile it the other way
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

9ba68f6c

bcachefs: bkey_on_stack_reassemble() · 5934a0ca

Kent Overstreet authored Nov 20, 2019

Small helper function.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

5934a0ca

bcachefs: kill ca->freelist_lock · bd7e82ee

Kent Overstreet authored Nov 20, 2019

All uses were supposed to be switched over to c->freelist_lock
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

bd7e82ee

bcachefs: Reorganize extents.c · 4de77495

Kent Overstreet authored Nov 16, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

4de77495

bcachefs: Inline data extents · 4be1a412

Kent Overstreet authored Nov 09, 2019

This implements extents that have their data inline, in the value,
instead of the bkey value being pointers to the data - and the read and
write paths are updated to read from these new extent types and write
them out, when the write size is small enough.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

4be1a412

bcachefs: Split out extent_update.c · 08c07fea

Kent Overstreet authored Nov 15, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

08c07fea

bcachefs: Rework of cut_front & cut_back · 085ab693

Kent Overstreet authored Nov 09, 2019

This changes bch2_cut_front and bch2_cut_back so that they're able to
shorten the size of the value, and it also changes the extent update
path to update the accounting in the btree node when this happens.

When the size of the value is shortened, they zero out the space that's
no longer used, so it's interpreted as noops (as implemented in the last
patch).
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

085ab693

bcachefs: bkey noops · ad44bdc3

Kent Overstreet authored Nov 09, 2019

For upcoming inline data extents, we're going to need to be able to
shorten the value of existing bkeys in the btree - and to make that work
we're going to be able to need to pad out the space the value previously
took up with something.

This patch changes the various code that iterates over bkeys to handle
k->u64s == 0 as meaning "skip the next 8 bytes".
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ad44bdc3

bcachefs: kill bch2_extent_has_device() · aef90ce0

Kent Overstreet authored Nov 15, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

aef90ce0

bcachefs: bkey_on_stack · 35189e09

Kent Overstreet authored Nov 09, 2019

This implements code for storing small bkeys on the stack and allocating
out of a mempool if they're too big.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

35189e09

bcachefs: Make memcpy_to_bio() param const · 03c8c747

Kent Overstreet authored Nov 13, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

03c8c747

bcachefs: Use wbc_to_write_flags() · 50fe5bd6

Kent Overstreet authored Nov 13, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

50fe5bd6

bcachefs: Fix erorr path in bch2_write() · c32bd3ad

Kent Overstreet authored Nov 11, 2019

The error path in bch2_write wasn't updated when the end_io callback was
added to bch_write_op.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c32bd3ad

bcachefs: Set lost+found mode to 0700 · b627c7d8

Justin Husted authored Nov 09, 2019

For security and conformance with other filesystems, the lost+found
directory should not be world or group accessible.
Signed-off-by: Justin Husted <sigstop@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b627c7d8

bcachefs: Be slightly less tricky with union usage · 08977051

Kent Overstreet authored Nov 09, 2019

This is to fix a valgrind complaint - the code was correct, but too
tricky for valgrind to know that.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

08977051

bcachefs: Remove some BKEY_PADDED uses · f7f21ed3

Kent Overstreet authored Nov 08, 2019

Prep work for extents with inline data
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f7f21ed3

bcachefs: Go back to 16 bit mantissa bkey floats · b904a799

Kent Overstreet authored Nov 06, 2019

The previous optimizations means using 32 bit mantissas are now a net
loss - having bkey_float be only 4 bytes is good for prefetching.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b904a799

bcachefs: Fall back to slowpath on exact comparison · 58404bb2

Kent Overstreet authored Oct 23, 2019

This is basically equivalent to the original strategy of falling back to
checking against the original key when the original key and previous key
didn't differ in the required bits - except, now we only fall back when
the search key doesn't differ in the required bits, which ends up being
a bit faster.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

58404bb2

bcachefs: kill BFLOAT_FAILED_PREV · 1bdb67e8

Kent Overstreet authored Nov 06, 2019

The assumption underlying BFLOAT_FAILED_PREV was wrong; the comparison
we're doing in bset_search_tree() doesn't have to tell the pivot apart
from the previous key, it just has to tell if search is definitely
greater than or equal to the pivot.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

1bdb67e8

bcachefs: bch2_read_extent() microoptimizations · 70438dc3

Kent Overstreet authored Nov 07, 2019

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

70438dc3