ICP  1.1.0
 Hosted by GitHub
Public Types | Public Member Functions | Public Attributes | List of all members
cl_algo::ICP::ICPWeights Class Reference

Interface class for the icpComputeReduceWeights kernel. More...

#include <algorithms.hpp>

Public Types

enum  Memory : uint8_t {
  Memory::H_IN, Memory::H_OUT_W, Memory::H_OUT_SUM_W, Memory::D_IN,
  Memory::D_OUT_W, Memory::D_GW, Memory::D_OUT_SUM_W
}
 Enumerates the memory objects handled by the class. More...
 

Public Member Functions

 ICPWeights (clutils::CLEnv &_env, clutils::CLEnvInfo< 1 > _info)
 Configures an OpenCL environment as specified by _info. More...
 
cl::Memory & get (ICPWeights::Memory mem)
 Returns a reference to an internal memory object. More...
 
void init (unsigned int _n, Staging _staging=Staging::IO)
 Configures kernel execution parameters. More...
 
void write (ICPWeights::Memory mem=ICPWeights::Memory::D_IN, void *ptr=nullptr, bool block=CL_FALSE, const std::vector< cl::Event > *events=nullptr, cl::Event *event=nullptr)
 Performs a data transfer to a device buffer. More...
 
void * read (ICPWeights::Memory mem=ICPWeights::Memory::H_OUT_SUM_W, bool block=CL_TRUE, const std::vector< cl::Event > *events=nullptr, cl::Event *event=nullptr)
 Performs a data transfer to a staging buffer. More...
 
void run (const std::vector< cl::Event > *events=nullptr, cl::Event *event=nullptr)
 Executes the necessary kernels. More...
 
template<typename period >
double run (clutils::GPUTimer< period > &timer, const std::vector< cl::Event > *events=nullptr)
 Executes the necessary kernels. More...
 

Public Attributes

rbc_dist_id * hPtrIn
 
cl_float * hPtrOutW
 
cl_double * hPtrOutSW
 

Detailed Description

Interface class for the icpComputeReduceWeights kernel.

The icpComputeReduceWeights kernel computes weights for pairs of points in the fixed and moving sets, and also reduces them to get their sum. For more details, look at the kernel's documentation.

Note
The icpComputeReduceWeights kernel is available in kernels/icp_kernels.cl.
The class creates its own buffers. If you would like to provide your own buffers, call get to get references to the placeholders within the class and assign them to your buffers. You will have to do this strictly before the call to init. You can also call get (after the call to init) to get a reference to a buffer within the class and assign it to another kernel class instance further down in your task pipeline.

The following input/output OpenCL memory objects are created by an ICPWeights instance:

Name Type Placement I/O Use Properties Size
H_IN Buffer Host I Staging CL_MEM_READ_WRITE \(n*sizeof\ (rbc\_dist\_id)\)
H_OUT_W Buffer Host O Staging CL_MEM_READ_WRITE \(n*sizeof\ (cl\_float) \)
H_OUT_SUM_W Buffer Host O Staging CL_MEM_READ_WRITE \( sizeof\ (cl\_double) \)
D_IN Buffer Device I Processing CL_MEM_READ_ONLY \(n*sizeof\ (rbc\_dist\_id)\)
D_OUT_W Buffer Device O Processing CL_MEM_WRITE_ONLY \(n*sizeof\ (cl\_float) \)
D_OUT_SUM_W Buffer Device O Processing CL_MEM_WRITE_ONLY \( sizeof\ (cl\_double) \)

Member Enumeration Documentation

enum cl_algo::ICP::ICPWeights::Memory : uint8_t
strong

Enumerates the memory objects handled by the class.

Note
H_* names refer to staging buffers on the host.
D_* names refer to buffers on the device.
Enumerator
H_IN 

Input staging buffer for the distances between pairs of points in the fixed and moving sets.

H_OUT_W 

Output staging buffer for the weights.

H_OUT_SUM_W 

Output staging buffer for the sum of the weights.

D_IN 

Input buffer for the distances between pairs of points in the fixed and moving sets.

D_OUT_W 

Output buffer for the weights.

D_GW 

Buffer of block sums of weights.

D_OUT_SUM_W 

Output buffer for the sum of the weights.

Constructor & Destructor Documentation

cl_algo::ICP::ICPWeights::ICPWeights ( clutils::CLEnv &  _env,
clutils::CLEnvInfo< 1 >  _info 
)

Configures an OpenCL environment as specified by _info.

Parameters
[in]_envopencl environment.
[in]_infoopencl configuration. It specifies the context, queue, etc, to be used.

Member Function Documentation

cl::Memory & cl_algo::ICP::ICPWeights::get ( ICPWeights::Memory  mem)

Returns a reference to an internal memory object.

This interface exists to allow CL memory sharing between different kernels.

Parameters
[in]memenumeration value specifying the requested memory object.
Returns
A reference to the requested memory object.
void cl_algo::ICP::ICPWeights::init ( unsigned int  _n,
Staging  _staging = Staging::IO 
)

Configures kernel execution parameters.

Sets up memory objects as necessary, and defines the kernel workspaces.

Note
If you have assigned a memory object to one member variable of the class before the call to init, then that memory will be maintained. Otherwise, a new memory object will be created.
Parameters
[in]_nnumber of elements in the input sets.
[in]_stagingflag to indicate whether or not to instantiate the staging buffers.
void * cl_algo::ICP::ICPWeights::read ( ICPWeights::Memory  mem = ICPWeights::Memory::H_OUT_SUM_W,
bool  block = CL_TRUE,
const std::vector< cl::Event > *  events = nullptr,
cl::Event *  event = nullptr 
)

Performs a data transfer to a staging buffer.

The transfer happens from a device buffer to the associated (specified) staging buffer on the host.

Parameters
[in]memenumeration value specifying an output staging buffer.
[in]blocka flag to indicate whether to perform a blocking or a non-blocking operation.
[in]eventsa wait-list of events.
[out]eventevent associated with the read operation to the staging buffer.
void cl_algo::ICP::ICPWeights::run ( const std::vector< cl::Event > *  events = nullptr,
cl::Event *  event = nullptr 
)

Executes the necessary kernels.

The function call is non-blocking.

Parameters
[in]eventsa wait-list of events.
[out]eventevent associated with the kernel execution.
template<typename period >
double cl_algo::ICP::ICPWeights::run ( clutils::GPUTimer< period > &  timer,
const std::vector< cl::Event > *  events = nullptr 
)
inline

Executes the necessary kernels.

This run instance is used for profiling.

Parameters
[in]timerGPUTimer that does the profiling of the kernel executions.
[in]eventsa wait-list of events.
Returns
Τhe total execution time measured by the timer.
void cl_algo::ICP::ICPWeights::write ( ICPWeights::Memory  mem = ICPWeights::Memory::D_IN,
void *  ptr = nullptr,
bool  block = CL_FALSE,
const std::vector< cl::Event > *  events = nullptr,
cl::Event *  event = nullptr 
)

Performs a data transfer to a device buffer.

The transfer happens from a staging buffer on the host to the associated (specified) device buffer.

Parameters
[in]memenumeration value specifying an input device buffer.
[in]ptra pointer to an array holding input data. If not NULL, the data from ptr will be copied to the associated staging buffer.
[in]blocka flag to indicate whether to perform a blocking or a non-blocking operation.
[in]eventsa wait-list of events.
[out]eventevent associated with the write operation to the device buffer.

Member Data Documentation

rbc_dist_id* cl_algo::ICP::ICPWeights::hPtrIn

Mapping of the input staging buffer for the distances.

cl_double* cl_algo::ICP::ICPWeights::hPtrOutSW

Mapping of the output staging buffer for the sum of weights.

cl_float* cl_algo::ICP::ICPWeights::hPtrOutW

Mapping of the output staging buffer for the weights.


The documentation for this class was generated from the following files: