Commits · 1c3eff7ea4a98c642134ee493001ae13b79ff38c · Kirill Smelkov / linux

30 Nov, 2020 40 commits

NFSD: Replace READ* macros that decode the fattr4 time_set attributes · 1c3eff7e
Chuck Lever authored Nov 19, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
1c3eff7e
NFSD: Replace READ* macros that decode the fattr4 owner_group attribute · 393c31dd
Chuck Lever authored Nov 19, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
393c31dd
NFSD: Replace READ* macros that decode the fattr4 owner attribute · 9853a5ac
Chuck Lever authored Nov 19, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
9853a5ac
NFSD: Replace READ* macros that decode the fattr4 mode attribute · 1c8f0ad7
Chuck Lever authored Nov 19, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
1c8f0ad7

NFSD: Replace READ* macros that decode the fattr4 acl attribute · c941a968

Chuck Lever authored Nov 19, 2020

Refactor for clarity and to move infrequently-used code out of line.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

c941a968

NFSD: Replace READ* macros that decode the fattr4 size attribute · 2ac1b9b2
Chuck Lever authored Nov 19, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
2ac1b9b2

NFSD: Change the way the expected length of a fattr4 is checked · 081d53fe

Chuck Lever authored Nov 19, 2020

Because the fattr4 is now managed in an xdr_stream, all that is
needed is to store the initial position of the stream before
decoding the attribute list. Then the actual length of the list
is computed using the final stream position, after decoding is
complete.

No behavior change is expected.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

081d53fe

NFSD: Replace READ* macros in nfsd4_decode_commit() · cbd9abb3
Chuck Lever authored Nov 03, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
cbd9abb3
NFSD: Replace READ* macros in nfsd4_decode_close() · d3d2f381
Chuck Lever authored Nov 03, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
d3d2f381
NFSD: Replace READ* macros in nfsd4_decode_access() · d169a6a9
Chuck Lever authored Nov 03, 2020
```
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
d169a6a9

NFSD: Replace the internals of the READ_BUF() macro · c1346a12

Chuck Lever authored Nov 03, 2020

Convert the READ_BUF macro in nfs4xdr.c from open code to instead
use the new xdr_stream-style decoders already in use by the encode
side (and by the in-kernel NFS client implementation). Once this
conversion is done, each individual NFSv4 argument decoder can be
independently cleaned up to replace these macros with C code.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

c1346a12

NFSD: Add tracepoints in nfsd4_decode/encode_compound() · 08281341

Chuck Lever authored Nov 21, 2020

For troubleshooting purposes, record failures to decode NFSv4
operation arguments and encode operation results.

trace_nfsd_compound_decode_err() replaces the dprintk() call sites
that are embedded in READ_* macros that are about to be removed.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

08281341

NFSD: Add tracepoints in nfsd_dispatch() · 0dfdad1c

Chuck Lever authored Oct 19, 2020

For troubleshooting purposes, record GARBAGE_ARGS and CANT_ENCODE
failures.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

0dfdad1c

NFSD: Add common helpers to decode void args and encode void results · 788f7183

Chuck Lever authored Nov 05, 2020

Start off the conversion to xdr_stream by de-duplicating the functions
that decode void arguments and encode void results.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

788f7183

SUNRPC: Prepare for xdr_stream-style decoding on the server-side · 5191955d

Chuck Lever authored Nov 05, 2020

A "permanent" struct xdr_stream is allocated in struct svc_rqst so
that it is usable by all server-side decoders. A per-rqst scratch
buffer is also allocated to handle decoding XDR data items that
cross page boundaries.

To demonstrate how it will be used, add the first call site for the
new svcxdr_init_decode() API.

As an additional part of the overall conversion, add symbolic
constants for successful and failed XDR operations. Returning "0" is
overloaded. Sometimes it means something failed, but sometimes it
means success. To make it more clear when XDR decoding functions
succeed or fail, introduce symbolic constants.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

5191955d

SUNRPC: Add xdr_set_scratch_page() and xdr_reset_scratch_buffer() · 0ae4c3e8
Chuck Lever authored Nov 11, 2020
```
Clean up: De-duplicate some frequently-used code.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
0ae4c3e8

nfsd: Fix error return code in nfsd_file_cache_init() · 231307df

Huang Guobin authored Nov 25, 2020

Fix to return PTR_ERR() error code from the error handling case instead of
0 in function nfsd_file_cache_init(), as done elsewhere in this function.

Fixes: 65294c1f("nfsd: add a new struct file caching facility to nfsd")
Signed-off-by: Huang Guobin <huangguobin4@huawei.com>
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

231307df

NFSD: Add SPDX header for fs/nfsd/trace.c · f45a444c

Chuck Lever authored Aug 27, 2020

Clean up.

The file was contributed in 2014 by Christoph Hellwig in commit
31ef83dc ("nfsd: add trace events").
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

f45a444c

NFSD: Remove extra "0x" in tracepoint format specifier · 3a90e1df
Chuck Lever authored Sep 04, 2020
```
Clean up: %p adds its own 0x already.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
3a90e1df

NFSD: Clean up the show_nf_may macro · b76278ae

Chuck Lever authored Aug 19, 2020

Display all currently possible NFSD_MAY permission flags.

Move and rename show_nf_may with a more generic name because the
NFSD_MAY permission flags are used in other places besides the file
cache.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

b76278ae

SUNRPC: Move the svc_xdr_recvfrom() tracepoint · 156708ad

Chuck Lever authored Jul 24, 2020

Commit c509f15a ("SUNRPC: Split the xdr_buf event class") added
display of the rqst's XID to the svc_xdr_buf_class. However, when
the recvfrom tracepoint fires, rq_xid has yet to be filled in with
the current XID. So it ends up recording the previous XID that was
handled by that svc_rqst.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

156708ad

nfsd/nfs3: remove unused macro nfsd3_fhandleres · 71fd7218

Alex Shi authored Nov 06, 2020

The macro is unused, remove it to tame gcc warning:
fs/nfsd/nfs3proc.c:702:0: warning: macro "nfsd3_fhandleres" is not used
[-Wunused-macros]
Signed-off-by: Alex Shi <alex.shi@linux.alibaba.com>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Cc: Chuck Lever <chuck.lever@oracle.com>
Cc: linux-nfs@vger.kernel.org
Cc: linux-kernel@vger.kernel.org
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

71fd7218

NFSD: A semicolon is not needed after a switch statement. · 25fef48b
Tom Rix authored Nov 01, 2020
```
Signed-off-by: Tom Rix <trix@redhat.com>
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
```
25fef48b

svcrdma: support multiple Read chunks per RPC · d7cc7397

Chuck Lever authored Aug 05, 2020

An efficient way to handle multiple Read chunks is to post them all
together and then take a single completion. This is also how the
code is already structured: when the Read completion fires, all
portions of the incoming RPC message are available to be assembled.

The difficult problem is setting up the Read sink buffers so that
the server pulls the client's data into place, making subsequent
pull-up unnecessary. There are several cases:

* No Read chunks. No-op.

* One data item Read chunk. This is the fast case, where the inline
  part of the RPC-over-RDMA message becomes the head and tail, and
  the data item chunk is placed in buf->pages.

* A Position-zero Read chunk. Treated like TCP: the Read chunk is
  pulled into contiguous pages.

+ A Position-zero Read chunk with data item chunks. Treated like
  TCP: all of the Read chunks are pulled into contiguous pages.

+ Multiple data item chunks. Treated like TCP: the inline part is
  copied and the data item chunks are pulled into contiguous pages.

The "*" cases are already supported. This patch adds support for the
"+" cases.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

d7cc7397

svcrdma: Use the new parsed chunk list when pulling Read chunks · d96962e6

Chuck Lever authored Sep 17, 2020

As a pre-requisite for handling multiple Read chunks in each Read
list, convert svc_rdma_recv_read_chunk() to use the new parsed Read
chunk list.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

d96962e6

svcrdma: Rename info::ri_chunklen · bafe9c27

Chuck Lever authored Jul 17, 2020

I'm about to change the purpose of ri_chunklen: Instead of tracking
the number of bytes in one Read chunk, it will track the total
number of bytes in the Read list. Rename it for clarity.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

bafe9c27

svcrdma: Clean up chunk tracepoints · b704be09

Chuck Lever authored Jun 11, 2020

We already have trace_svcrdma_decode_rseg(), which records each
ingress Read segment. Instead of reporting those again when they
are about to be posted as RDMA Reads, let's fire one tracepoint
before posting each type of chunk.

So we'll get:

nfsd-1998 [002] 321.666615: svcrdma_decode_rseg: cq.id=4 cid=42 segno=0 position=0 192@0x013ca9ebfae14000:0xb0010b05
nfsd-1998 [002] 321.666615: svcrdma_decode_rseg: cq.id=4 cid=42 segno=1 position=0 7688@0x013ca9ebf914e000:0xb0010a05
nfsd-1998 [002] 321.666615: svcrdma_decode_rseg: cq.id=4 cid=42 segno=2 position=0 28@0x013ca9ebfae15000:0xb0010905
nfsd-1998 [002] 321.666622: svcrdma_decode_rqst: cq.id=4 cid=42 xid=0x013ca9eb vers=1 credits=128 proc=RDMA_NOMSG hdrlen=100

nfsd-1998 [002] 321.666642: svcrdma_post_read_chunk: cq.id=3 cid=112 sqecount=3

kworker/2:1H-221 [002] 321.673949: svcrdma_wc_read: cq.id=3 cid=112 status=SUCCESS (0/0x0)
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

b704be09

svcrdma: Remove chunk list pointers · 7954c850

Chuck Lever authored Jun 17, 2020

Clean up: These pointers are no longer used.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

7954c850

svcrdma: Support multiple Write chunks in svc_rdma_send_reply_chunk · 41bc163f

Chuck Lever authored Mar 09, 2020

Refactor svc_rdma_send_reply_chunk() so that it Sends only the parts
of rq_res that do not contain a result payload.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

41bc163f

svcrdma: Support multiple Write chunks in svc_rdma_map_reply_msg() · 2371bcc0

Chuck Lever authored Mar 09, 2020

Refactor: svc_rdma_map_reply_msg() is restructured to DMA map only
the parts of rq_res that do not contain a result payload.

This change has been tested to confirm that it does not cause a
regression in the no Write chunk and single Write chunk cases.
Multiple Write chunks have not been tested.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

2371bcc0

svcrdma: Support multiple write chunks when pulling up · 9d0b09d5

Chuck Lever authored Mar 13, 2020

When counting the number of SGEs needed to construct a Send request,
do not count result payloads. And, when copying the Reply message
into the pull-up buffer, result payloads are not to be copied to the
Send buffer.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

9d0b09d5

svcrdma: Use parsed chunk lists to encode Reply transport headers · 6911f3e1

Chuck Lever authored Jun 17, 2020

Refactor: Instead of re-parsing the ingress RPC Call transport
header when constructing the egress RPC Reply transport header, use
the new parsed Write list and Reply chunk, which are version-
agnostic and already XDR decoded.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

6911f3e1

svcrdma: Use parsed chunk lists to construct RDMA Writes · 7a1cbfa1

Chuck Lever authored Jun 17, 2020

Refactor: Instead of re-parsing the ingress RPC Call transport
header when constructing RDMA Writes, use the new parsed chunk lists
for the Write list and Reply chunk, which are version-agnostic and
already XDR-decoded.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

7a1cbfa1

svcrdma: Use parsed chunk lists to detect reverse direction replies · 58b2e0fe

Chuck Lever authored Mar 22, 2020

Refactor: Don't duplicate header decoding smarts here. Instead, use
the new parsed chunk lists.

Note that the XID sanity test is also removed. The XID is already
looked up by the cb handler, and is rejected if it's not recognized.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

58b2e0fe

svcrdma: Use parsed chunk lists to derive the inv_rkey · eb3de6a4

Chuck Lever authored Jun 22, 2020

Refactor: Don't duplicate header decoding smarts here. Instead, use
the new parsed chunk lists.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

eb3de6a4

svcrdma: Add a "parsed chunk list" data structure · 78147ca8

Chuck Lever authored Jun 22, 2020

This simple data structure binds the location of each data payload
inside of an RPC message to the chunk that will be used to push it
to or pull it from the client.

There are several benefits to this small additional overhead:

 * It enables support for more than one chunk in incoming Read and
   Write lists.

 * It translates the version-specific on-the-wire format into a
   generic in-memory structure, enabling support for multiple
   versions of the RPC/RDMA transport protocol.

 * It enables the server to re-organize a chunk list if it needs to
   adjust where Read chunk data lands in server memory without
   altering the contents of the XDR-encoded Receive buffer.

Construction of these lists is done while sanity checking each
incoming RPC/RDMA header. Subsequent patches will make use of the
generated data structures.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

78147ca8

svcrdma: Clean up svc_rdma_encode_reply_chunk() · ded380f1

Chuck Lever authored Mar 13, 2020

Refactor: Match the control flow of svc_rdma_encode_write_list().
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

ded380f1

svcrdma: Post RDMA Writes while XDR encoding replies · f6ad7759

Chuck Lever authored Mar 13, 2020

The only RPC/RDMA ordering requirement between RDMA Writes and RDMA
Sends is that the responder must post the Writes on the Send queue
before posting the Send that conveys the RPC Reply for that Write
payload.

The Linux NFS server implementation now has a transport method that
can post result Payload Writes earlier than svc_rdma_sendto:

   ->xpo_result_payload()

This gets RDMA Writes going earlier so they are more likely to be
complete at the remote end before the Send completes.

Some care must be taken with pulled-up Replies. We don't want to
push the Write chunk and then send the same payload data via Send.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

f6ad7759

NFSD: Invoke svc_encode_result_payload() in "read" NFSD encoders · 76e5492b

Chuck Lever authored Nov 05, 2020

Have the NFSD encoders annotate the boundaries of every
direct-data-placement eligible result data payload. Then change
svcrdma to use that annotation instead of the xdr->page_len
when handling Write chunks.

For NFSv4 on RDMA, that enables the ability to recognize multiple
result payloads per compound. This is a pre-requisite for supporting
multiple Write chunks per RPC transaction.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

76e5492b

SUNRPC: Rename svc_encode_read_payload() · 03493bca

Chuck Lever authored Jun 10, 2020

Clean up: "result payload" is a less confusing name for these
payloads. "READ payload" reflects only the NFS usage.
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

03493bca