Commit 77fe6612 authored by Vinod Koul's avatar Vinod Koul Committed by Jonathan Corbet

dmaengine: doc: ReSTize provider doc

This moves and converts provider file with some format changes
for RST style
Signed-off-by: default avatarVinod Koul <vinod.koul@intel.com>
Signed-off-by: default avatarJonathan Corbet <corbet@lwn.net>
parent 8a0698c1
...@@ -5,6 +5,17 @@ DMAEngine documentation ...@@ -5,6 +5,17 @@ DMAEngine documentation
DMAEngine documentation provides documents for various aspects of DMAEngine DMAEngine documentation provides documents for various aspects of DMAEngine
framework. framework.
DMAEngine documentation
-----------------------
This book helps with DMAengine internal APIs and guide for DMAEngine device
driver writers.
.. toctree::
:maxdepth: 1
provider
.. only:: subproject .. only:: subproject
Indices Indices
......
==================================
DMAengine controller documentation DMAengine controller documentation
================================== ==================================
Hardware Introduction Hardware Introduction
+++++++++++++++++++++ =====================
Most of the Slave DMA controllers have the same general principles of Most of the Slave DMA controllers have the same general principles of
operations. operations.
...@@ -81,7 +82,7 @@ support other kind of transfers or memory operations that dmaengine ...@@ -81,7 +82,7 @@ support other kind of transfers or memory operations that dmaengine
support and will be detailed later in this document. support and will be detailed later in this document.
DMA Support in Linux DMA Support in Linux
++++++++++++++++++++ ====================
Historically, DMA controller drivers have been implemented using the Historically, DMA controller drivers have been implemented using the
async TX API, to offload operations such as memory copy, XOR, async TX API, to offload operations such as memory copy, XOR,
...@@ -96,11 +97,11 @@ ensure that it stayed compatible. ...@@ -96,11 +97,11 @@ ensure that it stayed compatible.
For more information on the Async TX API, please look the relevant For more information on the Async TX API, please look the relevant
documentation file in Documentation/crypto/async-tx-api.txt. documentation file in Documentation/crypto/async-tx-api.txt.
DMAEngine Registration DMAEngine APIs
++++++++++++++++++++++ ==============
struct dma_device Initialization ``struct dma_device`` Initialization
-------------------------------- ------------------------------------
Just like any other kernel framework, the whole DMAEngine registration Just like any other kernel framework, the whole DMAEngine registration
relies on the driver filling a structure and registering against the relies on the driver filling a structure and registering against the
...@@ -110,36 +111,40 @@ The first thing you need to do in your driver is to allocate this ...@@ -110,36 +111,40 @@ The first thing you need to do in your driver is to allocate this
structure. Any of the usual memory allocators will do, but you'll also structure. Any of the usual memory allocators will do, but you'll also
need to initialize a few fields in there: need to initialize a few fields in there:
* channels: should be initialized as a list using the - channels: should be initialized as a list using the
INIT_LIST_HEAD macro for example INIT_LIST_HEAD macro for example
* src_addr_widths: - src_addr_widths:
- should contain a bitmask of the supported source transfer width should contain a bitmask of the supported source transfer width
* dst_addr_widths: - dst_addr_widths:
- should contain a bitmask of the supported destination transfer should contain a bitmask of the supported destination transfer width
width
- directions:
* directions: should contain a bitmask of the supported slave directions
- should contain a bitmask of the supported slave directions (i.e. excluding mem2mem transfers)
(i.e. excluding mem2mem transfers)
- residue_granularity:
* residue_granularity:
- Granularity of the transfer residue reported to dma_set_residue. - Granularity of the transfer residue reported to dma_set_residue.
- This can be either: This can be either:
+ Descriptor
-> Your device doesn't support any kind of residue - Descriptor
reporting. The framework will only know that a particular
transaction descriptor is done. - Your device doesn't support any kind of residue
+ Segment reporting. The framework will only know that a particular
-> Your device is able to report which chunks have been transaction descriptor is done.
transferred
+ Burst - Segment
-> Your device is able to report which burst have been
transferred - Your device is able to report which chunks have been transferred
* dev: should hold the pointer to the struct device associated - Burst
to your current driver instance.
- Your device is able to report which burst have been transferred
- dev: should hold the pointer to the ``struct device`` associated
to your current driver instance.
Supported transaction types Supported transaction types
--------------------------- ---------------------------
...@@ -147,75 +152,96 @@ Supported transaction types ...@@ -147,75 +152,96 @@ Supported transaction types
The next thing you need is to set which transaction types your device The next thing you need is to set which transaction types your device
(and driver) supports. (and driver) supports.
Our dma_device structure has a field called cap_mask that holds the Our ``dma_device structure`` has a field called cap_mask that holds the
various types of transaction supported, and you need to modify this various types of transaction supported, and you need to modify this
mask using the dma_cap_set function, with various flags depending on mask using the dma_cap_set function, with various flags depending on
transaction types you support as an argument. transaction types you support as an argument.
All those capabilities are defined in the dma_transaction_type enum, All those capabilities are defined in the ``dma_transaction_type enum``,
in include/linux/dmaengine.h in ``include/linux/dmaengine.h``
Currently, the types available are: Currently, the types available are:
* DMA_MEMCPY
- The device is able to do memory to memory copies - DMA_MEMCPY
* DMA_XOR - The device is able to do memory to memory copies
- The device is able to perform XOR operations on memory areas
- Used to accelerate XOR intensive tasks, such as RAID5 - DMA_XOR
* DMA_XOR_VAL - The device is able to perform XOR operations on memory areas
- The device is able to perform parity check using the XOR
algorithm against a memory buffer. - Used to accelerate XOR intensive tasks, such as RAID5
* DMA_PQ - DMA_XOR_VAL
- The device is able to perform RAID6 P+Q computations, P being a
simple XOR, and Q being a Reed-Solomon algorithm. - The device is able to perform parity check using the XOR
algorithm against a memory buffer.
* DMA_PQ_VAL
- The device is able to perform parity check using RAID6 P+Q - DMA_PQ
algorithm against a memory buffer.
- The device is able to perform RAID6 P+Q computations, P being a
* DMA_INTERRUPT simple XOR, and Q being a Reed-Solomon algorithm.
- The device is able to trigger a dummy transfer that will
generate periodic interrupts - DMA_PQ_VAL
- Used by the client drivers to register a callback that will be
called on a regular basis through the DMA controller interrupt - The device is able to perform parity check using RAID6 P+Q
algorithm against a memory buffer.
* DMA_PRIVATE
- The devices only supports slave transfers, and as such isn't - DMA_INTERRUPT
available for async transfers.
- The device is able to trigger a dummy transfer that will
* DMA_ASYNC_TX generate periodic interrupts
- Must not be set by the device, and will be set by the framework
if needed - Used by the client drivers to register a callback that will be
- /* TODO: What is it about? */ called on a regular basis through the DMA controller interrupt
* DMA_SLAVE - DMA_PRIVATE
- The device can handle device to memory transfers, including
scatter-gather transfers. - The devices only supports slave transfers, and as such isn't
- While in the mem2mem case we were having two distinct types to available for async transfers.
deal with a single chunk to copy or a collection of them, here,
we just have a single transaction type that is supposed to - DMA_ASYNC_TX
handle both.
- If you want to transfer a single contiguous memory buffer, - Must not be set by the device, and will be set by the framework
simply build a scatter list with only one item. if needed
* DMA_CYCLIC - TODO: What is it about?
- The device can handle cyclic transfers.
- A cyclic transfer is a transfer where the chunk collection will - DMA_SLAVE
loop over itself, with the last item pointing to the first.
- It's usually used for audio transfers, where you want to operate - The device can handle device to memory transfers, including
on a single ring buffer that you will fill with your audio data. scatter-gather transfers.
* DMA_INTERLEAVE - While in the mem2mem case we were having two distinct types to
- The device supports interleaved transfer. deal with a single chunk to copy or a collection of them, here,
- These transfers can transfer data from a non-contiguous buffer we just have a single transaction type that is supposed to
to a non-contiguous buffer, opposed to DMA_SLAVE that can handle both.
transfer data from a non-contiguous data set to a continuous
destination buffer. - If you want to transfer a single contiguous memory buffer,
- It's usually used for 2d content transfers, in which case you simply build a scatter list with only one item.
want to transfer a portion of uncompressed data directly to the
display to print it - DMA_CYCLIC
- The device can handle cyclic transfers.
- A cyclic transfer is a transfer where the chunk collection will
loop over itself, with the last item pointing to the first.
- It's usually used for audio transfers, where you want to operate
on a single ring buffer that you will fill with your audio data.
- DMA_INTERLEAVE
- The device supports interleaved transfer.
- These transfers can transfer data from a non-contiguous buffer
to a non-contiguous buffer, opposed to DMA_SLAVE that can
transfer data from a non-contiguous data set to a continuous
destination buffer.
- It's usually used for 2d content transfers, in which case you
want to transfer a portion of uncompressed data directly to the
display to print it
These various types will also affect how the source and destination These various types will also affect how the source and destination
addresses change over time. addresses change over time.
...@@ -236,168 +262,225 @@ The functions that we have to fill in there, and hence have to ...@@ -236,168 +262,225 @@ The functions that we have to fill in there, and hence have to
implement, obviously depend on the transaction types you reported as implement, obviously depend on the transaction types you reported as
supported. supported.
* device_alloc_chan_resources - ``device_alloc_chan_resources``
* device_free_chan_resources
- These functions will be called whenever a driver will call - ``device_free_chan_resources``
dma_request_channel or dma_release_channel for the first/last
time on the channel associated to that driver. - These functions will be called whenever a driver will call
- They are in charge of allocating/freeing all the needed ``dma_request_channel`` or ``dma_release_channel`` for the first/last
resources in order for that channel to be useful for your time on the channel associated to that driver.
driver.
- These functions can sleep. - They are in charge of allocating/freeing all the needed
resources in order for that channel to be useful for your driver.
* device_prep_dma_*
- These functions are matching the capabilities you registered - These functions can sleep.
previously.
- These functions all take the buffer or the scatterlist relevant - ``device_prep_dma_*``
for the transfer being prepared, and should create a hardware
descriptor or a list of hardware descriptors from it - These functions are matching the capabilities you registered
- These functions can be called from an interrupt context previously.
- Any allocation you might do should be using the GFP_NOWAIT
flag, in order not to potentially sleep, but without depleting - These functions all take the buffer or the scatterlist relevant
the emergency pool either. for the transfer being prepared, and should create a hardware
- Drivers should try to pre-allocate any memory they might need descriptor or a list of hardware descriptors from it
during the transfer setup at probe time to avoid putting to
much pressure on the nowait allocator. - These functions can be called from an interrupt context
- It should return a unique instance of the - Any allocation you might do should be using the GFP_NOWAIT
dma_async_tx_descriptor structure, that further represents this flag, in order not to potentially sleep, but without depleting
particular transfer. the emergency pool either.
- This structure can be initialized using the function - Drivers should try to pre-allocate any memory they might need
dma_async_tx_descriptor_init. during the transfer setup at probe time to avoid putting to
- You'll also need to set two fields in this structure: much pressure on the nowait allocator.
+ flags:
TODO: Can it be modified by the driver itself, or - It should return a unique instance of the
should it be always the flags passed in the arguments ``dma_async_tx_descriptor structure``, that further represents this
particular transfer.
+ tx_submit: A pointer to a function you have to implement,
that is supposed to push the current - This structure can be initialized using the function
transaction descriptor to a pending queue, waiting ``dma_async_tx_descriptor_init``.
for issue_pending to be called.
- In this structure the function pointer callback_result can be - You'll also need to set two fields in this structure:
initialized in order for the submitter to be notified that a
transaction has completed. In the earlier code the function pointer - flags:
callback has been used. However it does not provide any status to the TODO: Can it be modified by the driver itself, or
transaction and will be deprecated. The result structure defined as should it be always the flags passed in the arguments
dmaengine_result that is passed in to callback_result has two fields:
+ result: This provides the transfer result defined by - tx_submit: A pointer to a function you have to implement,
dmaengine_tx_result. Either success or some error that is supposed to push the current transaction descriptor to a
condition. pending queue, waiting for issue_pending to be called.
+ residue: Provides the residue bytes of the transfer for those that
support residue. - In this structure the function pointer callback_result can be
initialized in order for the submitter to be notified that a
* device_issue_pending transaction has completed. In the earlier code the function pointer
- Takes the first transaction descriptor in the pending queue, callback has been used. However it does not provide any status to the
and starts the transfer. Whenever that transfer is done, it transaction and will be deprecated. The result structure defined as
should move to the next transaction in the list. ``dmaengine_result`` that is passed in to callback_result
- This function can be called in an interrupt context has two fields:
* device_tx_status - result: This provides the transfer result defined by
- Should report the bytes left to go over on the given channel ``dmaengine_tx_result``. Either success or some error condition.
- Should only care about the transaction descriptor passed as
argument, not the currently active one on a given channel - residue: Provides the residue bytes of the transfer for those that
- The tx_state argument might be NULL support residue.
- Should use dma_set_residue to report it
- In the case of a cyclic transfer, it should only take into - ``device_issue_pending``
account the current period.
- This function can be called in an interrupt context. - Takes the first transaction descriptor in the pending queue,
and starts the transfer. Whenever that transfer is done, it
* device_config should move to the next transaction in the list.
- Reconfigures the channel with the configuration given as
argument - This function can be called in an interrupt context
- This command should NOT perform synchronously, or on any
currently queued transfers, but only on subsequent ones - ``device_tx_status``
- In this case, the function will receive a dma_slave_config
structure pointer as an argument, that will detail which - Should report the bytes left to go over on the given channel
configuration to use.
- Even though that structure contains a direction field, this - Should only care about the transaction descriptor passed as
field is deprecated in favor of the direction argument given to argument, not the currently active one on a given channel
the prep_* functions
- This call is mandatory for slave operations only. This should NOT be - The tx_state argument might be NULL
set or expected to be set for memcpy operations.
If a driver support both, it should use this call for slave - Should use dma_set_residue to report it
operations only and not for memcpy ones.
- In the case of a cyclic transfer, it should only take into
* device_pause account the current period.
- Pauses a transfer on the channel
- This command should operate synchronously on the channel, - This function can be called in an interrupt context.
pausing right away the work of the given channel
- device_config
* device_resume
- Resumes a transfer on the channel - Reconfigures the channel with the configuration given as argument
- This command should operate synchronously on the channel,
resuming right away the work of the given channel - This command should NOT perform synchronously, or on any
currently queued transfers, but only on subsequent ones
* device_terminate_all
- Aborts all the pending and ongoing transfers on the channel - In this case, the function will receive a ``dma_slave_config``
- For aborted transfers the complete callback should not be called structure pointer as an argument, that will detail which
- Can be called from atomic context or from within a complete configuration to use.
callback of a descriptor. Must not sleep. Drivers must be able
to handle this correctly. - Even though that structure contains a direction field, this
- Termination may be asynchronous. The driver does not have to field is deprecated in favor of the direction argument given to
wait until the currently active transfer has completely stopped. the prep_* functions
See device_synchronize.
- This call is mandatory for slave operations only. This should NOT be
* device_synchronize set or expected to be set for memcpy operations.
- Must synchronize the termination of a channel to the current If a driver support both, it should use this call for slave
context. operations only and not for memcpy ones.
- Must make sure that memory for previously submitted
descriptors is no longer accessed by the DMA controller. - device_pause
- Must make sure that all complete callbacks for previously
submitted descriptors have finished running and none are - Pauses a transfer on the channel
scheduled to run.
- May sleep. - This command should operate synchronously on the channel,
pausing right away the work of the given channel
Misc notes (stuff that should be documented, but don't really know - device_resume
- Resumes a transfer on the channel
- This command should operate synchronously on the channel,
resuming right away the work of the given channel
- device_terminate_all
- Aborts all the pending and ongoing transfers on the channel
- For aborted transfers the complete callback should not be called
- Can be called from atomic context or from within a complete
callback of a descriptor. Must not sleep. Drivers must be able
to handle this correctly.
- Termination may be asynchronous. The driver does not have to
wait until the currently active transfer has completely stopped.
See device_synchronize.
- device_synchronize
- Must synchronize the termination of a channel to the current
context.
- Must make sure that memory for previously submitted
descriptors is no longer accessed by the DMA controller.
- Must make sure that all complete callbacks for previously
submitted descriptors have finished running and none are
scheduled to run.
- May sleep.
Misc notes
==========
(stuff that should be documented, but don't really know
where to put them) where to put them)
------------------------------------------------------------------
* dma_run_dependencies ``dma_run_dependencies``
- Should be called at the end of an async TX transfer, and can be
ignored in the slave transfers case. - Should be called at the end of an async TX transfer, and can be
- Makes sure that dependent operations are run before marking it ignored in the slave transfers case.
as complete.
- Makes sure that dependent operations are run before marking it
* dma_cookie_t as complete.
- it's a DMA transaction ID that will increment over time.
- Not really relevant any more since the introduction of virt-dma dma_cookie_t
that abstracts it away.
- it's a DMA transaction ID that will increment over time.
* DMA_CTRL_ACK
- If clear, the descriptor cannot be reused by provider until the - Not really relevant any more since the introduction of ``virt-dma``
client acknowledges receipt, i.e. has has a chance to establish any that abstracts it away.
dependency chains
- This can be acked by invoking async_tx_ack() DMA_CTRL_ACK
- If set, does not mean descriptor can be reused
- If clear, the descriptor cannot be reused by provider until the
* DMA_CTRL_REUSE client acknowledges receipt, i.e. has has a chance to establish any
- If set, the descriptor can be reused after being completed. It should dependency chains
not be freed by provider if this flag is set.
- The descriptor should be prepared for reuse by invoking - This can be acked by invoking async_tx_ack()
dmaengine_desc_set_reuse() which will set DMA_CTRL_REUSE.
- dmaengine_desc_set_reuse() will succeed only when channel support - If set, does not mean descriptor can be reused
reusable descriptor as exhibited by capabilities
- As a consequence, if a device driver wants to skip the dma_map_sg() and DMA_CTRL_REUSE
dma_unmap_sg() in between 2 transfers, because the DMA'd data wasn't used,
it can resubmit the transfer right after its completion. - If set, the descriptor can be reused after being completed. It should
- Descriptor can be freed in few ways not be freed by provider if this flag is set.
- Clearing DMA_CTRL_REUSE by invoking dmaengine_desc_clear_reuse()
and submitting for last txn - The descriptor should be prepared for reuse by invoking
- Explicitly invoking dmaengine_desc_free(), this can succeed only ``dmaengine_desc_set_reuse()`` which will set DMA_CTRL_REUSE.
when DMA_CTRL_REUSE is already set
- Terminating the channel - ``dmaengine_desc_set_reuse()`` will succeed only when channel support
reusable descriptor as exhibited by capabilities
* DMA_PREP_CMD
- If set, the client driver tells DMA controller that passed data in DMA - As a consequence, if a device driver wants to skip the
API is command data. ``dma_map_sg()`` and ``dma_unmap_sg()`` in between 2 transfers,
- Interpretation of command data is DMA controller specific. It can be because the DMA'd data wasn't used, it can resubmit the transfer right after
used for issuing commands to other peripherals/register reads/register its completion.
writes for which the descriptor should be in different format from
normal data descriptors. - Descriptor can be freed in few ways
- Clearing DMA_CTRL_REUSE by invoking
``dmaengine_desc_clear_reuse()`` and submitting for last txn
- Explicitly invoking ``dmaengine_desc_free()``, this can succeed only
when DMA_CTRL_REUSE is already set
- Terminating the channel
- DMA_PREP_CMD
- If set, the client driver tells DMA controller that passed data in DMA
API is command data.
- Interpretation of command data is DMA controller specific. It can be
used for issuing commands to other peripherals/register reads/register
writes for which the descriptor should be in different format from
normal data descriptors.
General Design Notes General Design Notes
-------------------- ====================
Most of the DMAEngine drivers you'll see are based on a similar design Most of the DMAEngine drivers you'll see are based on a similar design
that handles the end of transfer interrupts in the handler, but defer that handles the end of transfer interrupts in the handler, but defer
...@@ -415,10 +498,11 @@ order to have a shorter idle window (that we can't really avoid ...@@ -415,10 +498,11 @@ order to have a shorter idle window (that we can't really avoid
anyway). anyway).
Glossary Glossary
-------- ========
- Burst: A number of consecutive read or write operations that
can be queued to buffers before being flushed to memory.
- Chunk: A contiguous collection of bursts
Burst: A number of consecutive read or write operations - Transfer: A collection of chunks (be it contiguous or not)
that can be queued to buffers before being flushed to
memory.
Chunk: A contiguous collection of bursts
Transfer: A collection of chunks (be it contiguous or not)
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment