Commits · ac7eef0318c34c87e7ef9d574175917de1817ae6 · Kirill Smelkov / linux

22 Oct, 2023 40 commits

bcachefs: Don't report inodes to statfs · ac7eef03

Kent Overstreet authored Aug 15, 2020

We don't have a limit on the number of inodes in a filesystem, so this
is apparently the right way to report that.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ac7eef03

bcachefs: Add a cond_resched() to bch2_alloc_write() · f9adbb7d

Kent Overstreet authored Aug 12, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f9adbb7d

bcachefs: Fix a couple null ptr derefs when no disk groups exist · 2d8c0da1

Kent Overstreet authored Aug 06, 2020

Normally successfully parsing a target means disk groups should exist,
but we don't want a BUG() or null ptr deref if we end up with an invalid
target.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2d8c0da1

bcachefs: Fix disk groups not being updated when set via sysfs · 01566db2

Kent Overstreet authored Aug 12, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

01566db2

bcachefs: Change copygc to consider bucket fragmentation · 142cbdff

Kent Overstreet authored Aug 12, 2020

When devices have different sized buckets this is more correct.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

142cbdff

bcachefs: Don't block on allocations when only writing to specific device · 1421bea3

Kent Overstreet authored Aug 12, 2020

Since the copygc thread is now global and not per device, we're not
freeing up space on any one device in bounded time - and indeed we never
really were, since rebalance wasn't moving data around between devices
with that objective.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

1421bea3

bcachefs: Fix a bug with the journal_seq_blacklist mechanism · 9f115ce9

Kent Overstreet authored Aug 04, 2020

Previously, we would start doing btree updates before writing the first
journal entry; if this was after an unclean shutdown, this could cause
those btree updates to not be blacklisted.

Also, move some code to headers for userspace debug tools.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

9f115ce9

bcachefs: Fix bch2_new_stripes_to_text() · 00c24f53

Kent Overstreet authored Aug 04, 2020

painful looking typo, fortunately difficult to hit.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

00c24f53

bcachefs: Don't disallow btree writes to RO devices · 768b42a7

Kent Overstreet authored Aug 03, 2020

There's an inherent race with setting devices RO when they have dirty
btree nodes on them. We already check if a btree node is on an RO device
before we dirty it, so this patch just allows those writes so that we
don't have errors forcing the entire filesystem read only when trying to
remove a device.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

768b42a7

bcachefs: Fix maximum btree node size · 79e72a90

Kent Overstreet authored Aug 03, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

79e72a90

bcachefs: Convert various code to printbuf · 7807e143

Kent Overstreet authored Jul 25, 2020

printbufs know how big the buffer is that was allocated, so we can get
rid of the random PAGE_SIZEs all over the place.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

7807e143

bcachefs: Remove some uses of PAGE_SIZE in the btree code · 4580baec

Kent Overstreet authored Jul 25, 2020

For portability to userspace, we should try to avoid working in kernel
pages.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

4580baec

bcachefs: Ensure we wake up threads locking node when reusing it · 760992aa

Kent Overstreet authored Jul 25, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

760992aa

bcachefs: Fix bch2_btree_node_insert_fits() · f8058242

Kent Overstreet authored Jul 25, 2020

It should be checking for the recently added flag
btree_node_needs_rewrite.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f8058242

bcachefs: Ensure we only allocate one EC bucket per writepoint · d3a2b5d8

Kent Overstreet authored Jul 23, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

d3a2b5d8

bcachefs: Fix a race with BCH_WRITE_SKIP_CLOSURE_PUT · 33e33961

Kent Overstreet authored Jul 22, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

33e33961

bcachefs: Don't let copygc buckets be stolen by other threads · 74ed7e56

Kent Overstreet authored Jul 21, 2020

And assorted other copygc fixes.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

74ed7e56

bcachefs: Delete unused arguments · 3d080aa5

Kent Overstreet authored Jul 22, 2020

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

3d080aa5

bcachefs: Fix an error path · 4fe7efa1

Kent Overstreet authored Jul 22, 2020

We were missing a 'goto retry' and continuing on with an error pointer.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

4fe7efa1

bcachefs: Refactor replicas code · 988e98cf

Kent Overstreet authored Jul 10, 2020

Awhile back the mechanism for garbage collecting unused replicas entries
was significantly improved, but some cleanup was missed - this patch
does that now.

This is also prep work for a patch to account for erasure coded parity
blocks separately - we need to consolidate the logic for
checking/marking the various replicas entries from one bkey into a
single function.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

988e98cf

bcachefs: Don't restrict copygc writes to the same device · 8f3b41ab

Kent Overstreet authored Jul 11, 2020

This no longer makes any sense, since copygc is now one thread per
filesystem, not per device, with a single write point.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8f3b41ab

bcachefs: Add bch2_blk_status_to_str() · 63b214e7

Kent Overstreet authored Jul 21, 2020

We define our own BLK_STS_REMOVED, so we need our own to_str helper too.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

63b214e7

bcachefs: Fix a faulty assertion · a2b5313a

Kent Overstreet authored Jul 21, 2020

Now that updates to interior nodes are journalled, we shouldn't be
checking topology of interior nodes until we've finished replaying
updates to that node.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a2b5313a

bcachefs: Wrap write path in memalloc_nofs_save() · e8306e3b

Kent Overstreet authored Jul 20, 2020

This fixes a lockdep splat where we're allocating memory with vmalloc in
the compression bounce path, which doesn't always obey GFP_NOFS.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e8306e3b

bcachefs: Add an option for rebuilding the replicas section · f621e152

Kent Overstreet authored Jul 20, 2020

There is a bug where we cnan end up clearing the data_has field in the
superblock members section, which causes us to skip reading the journal
and thus journal replay fails. This option tells the recovery path to
not trust those fields.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f621e152

bcachefs: Make copygc thread global · e6d11615

Kent Overstreet authored Jul 11, 2020

Per device copygc threads don't move data to different devices and they
make fragmentation works - they don't make much sense anymore.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e6d11615

bcachefs: Drop extra pointers when marking data as in a stripe · f793bc15

Kent Overstreet authored Jul 11, 2020

We ideally want the buckets used for the extra initial replicas to be
reused right away.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f793bc15

bcachefs: Fix extent_ptr_durability() calculation for erasure coded data · 1d2ff0a6
Kent Overstreet authored Jul 11, 2020
```
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
1d2ff0a6
bcachefs: Use x-macros for data types · 89fd25be
Kent Overstreet authored Jul 09, 2020
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
89fd25be

bcachefs: Fix short buffered writes · 912bdf17

Kent Overstreet authored Jul 09, 2020

In the buffered write path, we have to check for short writes that write
to the full page, where the page wasn't UpToDate; when this happens, the
page is partly garbage, so we have to zero it out and revert that part
of the write.

This check was wrong - we reverted total from copied, but didn't revert
the iov_iter, probably also leading to corrupted writes.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

912bdf17

bcachefs: Allow existing stripes to be updated with new data buckets · 0ba95acc

Kent Overstreet authored Jun 30, 2020

This solves internal fragmentation within stripes. We already have
copygc, which evacuates buckets that are partially or mostly empty, but
it's up to the ec code that manages stripes to deal with stripes that
have empty buckets in them.

This patch changes the path for creating new stripes to check if there's
existing stripes with empty buckets - and if so, update them with new
data buckets instead of creating new stripes.

TODO: improve the disk space accounting so that we can only use this
(more expensive path) when we have too much fragmentation in existing
stripes.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

0ba95acc

bcachefs: Refactor stripe creation · f6b94a3b

Kent Overstreet authored Jul 06, 2020

Prep work for the patch to update existing stripes with new data blocks.
This moves allocating new stripes into ec.c, and also sets up the data
structures so that we can handly only allocating some of the blocks in a
stripe.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f6b94a3b

bcachefs: Move stripe creation to workqueue · 703e2a43

Kent Overstreet authored Jul 06, 2020

This is mainly to solve a lock ordering issue, and also simplifies the
code a bit.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

703e2a43

bcachefs: Improve stripe triggers/heap code · ba6dd1dd

Kent Overstreet authored Jul 06, 2020

Soon we'll be able to modify existing stripes - replacing empty blocks
with new blocks and new p/q blocks. This patch updates the trigger code
to handle pointers changing in an existing stripe; also, it
significantly improves how the stripes heap works, which means we can
get rid of the stripe creation/deletion lock.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ba6dd1dd

bcachefs: Rework triggers interface · e63534a2

Kent Overstreet authored Jul 06, 2020

The trigger for stripe keys is shortly going to need both the old and
the new key passed to the trigger - this patch does that rework.

For now, this just changes the in memory triggers, and this doesn't
change how extent triggers work.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e63534a2

bcachefs: Kill BTREE_TRIGGER_NOOVERWRITES · 697e45b2

Kent Overstreet authored Jul 06, 2020

This is prep work for reworking the triggers machinery - we have
triggers that need to know both the old and the new key.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

697e45b2

bcachefs: Mark btree nodes as needing rewrite when not all replicas are RW · fff899b1

Kent Overstreet authored Jul 03, 2020

This fixes a bug where recovery fails when one of the devices is read
only.

Also - consolidate the "must rewrite this node to insert it" behind a
new btree node flag.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

fff899b1

bcachefs: Use blk_status_to_str() · 306d40df

Kent Overstreet authored Jul 02, 2020

Improved error messages are always a good thing
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

306d40df

bcachefs: Don't cap ios in dio write path at 2 MB · 52fbb7c8

Kent Overstreet authored Jun 30, 2020

It appears this was erronious, a different bug was responsible
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

52fbb7c8

bcachefs: Refactor dio write code to reinit bch_write_op · 042a1f26

Kent Overstreet authored Jun 29, 2020

This fixes a bug where the BCH_WRITE_SKIP_CLOSURE_PUT was set
incorrectly, causing the completion to be delivered multiple times.
oops.
Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

042a1f26