image_convolution

HIP-Doc Image Convolution Example

Description

This example demonstrates 2D image convolution using HIP, implementing a box blur filter on images. The application uses the stb_image library for image loading and saving, making it easy to work with common image formats like JPEG and PNG.

For more information on HIP programming and stencil operations, please refer to the HIP documentation.

Application flow

An input image is loaded from disk using the stb_image library.
A convolution mask (box blur filter) is initialized on the host.
Device memory is allocated for the input image, output image, and convolution mask.
The input image and mask are copied from host to device memory.
A 2D grid of thread blocks is configured based on the image dimensions.
The convolution kernel is launched on the GPU.
Each thread processes one pixel across all color channels:
- Applies the convolution mask to the neighborhood around the pixel
- Handles boundary conditions with zero-padding
- Normalizes pixel values between 0-255
The kernel launch is checked for errors and the device is synchronized.
The processed output image is copied back from device to host memory.
The output image is saved to disk in JPEG format.
All device memory is freed.

Convolution Implementation

The kernel implements 2D convolution with the following features:

Parallel Processing: Each thread processes one pixel location
Multi-channel Support: Handles RGB images by processing each channel independently
Boundary Handling: Uses zero-padding for pixels near image edges
Box Blur Filter: Applies a uniform averaging filter (33x33 default)
Normalized Output: Maintains pixel values in valid 0-255 range

The box blur filter computes the average of all pixels in the mask region, creating a smoothing/blurring effect.

Key APIs and Concepts

HIP Runtime APIs

hipMalloc: Allocates device memory
hipMemcpy: Transfers data between host and device
hipFree: Frees device memory
hipGetLastError: Retrieves the last error from a runtime call
hipDeviceSynchronize: Blocks until all device operations complete

Device Code Features

__global__: Declares a kernel function callable from host
blockIdx, blockDim, threadIdx: Built-in variables for grid/block indexing
2D thread indexing for image processing

Stencil Pattern

The convolution operation is a classic stencil computation where each output element depends on a neighborhood of input elements. Key characteristics:

Regular access pattern (structured grid)
Halo region handling (boundary conditions)
Data reuse opportunities (same input pixels used by multiple output pixels)

Image Processing

Uses stb_image.h for loading images (JPEG, PNG, BMP, etc.)
Uses stb_image_write.h for saving images
Processes images in row-major order with interleaved color channels

Configuration

Default input: test.jpg
Default output: test_out.jpg
Default mask size: 33x33 (box blur)
Block size: 16x16 threads
Command line usage: ./hip_image_convolution [input.jpg] [output.jpg]

Performance Considerations

Potential optimizations for this algorithm:

Use shared memory to cache frequently accessed pixels
Separate kernels for different color channels to improve memory coalescing
Use texture memory for automatic caching and filtering
Implement separable convolution for larger kernels (two 1D passes instead of one 2D pass)

Demonstrated API calls

HIP runtime

Device symbols

blockDim
blockIdx
threadIdx

Host symbols

hipDeviceSynchronize
hipFree
hipGetLastError
hipMalloc
hipMemcpy
hipMemcpyHostToDevice
hipMemcpyDeviceToHost

External Libraries

stb_image.h: Image loading (supports JPEG, PNG, BMP, TGA, etc.)
stb_image_write.h: Image saving (JPEG, PNG, BMP, TGA)

Name		Name	Last commit message	Last commit date
parent directory ..
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
Makefile		Makefile
README.md		README.md
image.h		image.h
image_convolution_vs2017.sln		image_convolution_vs2017.sln
image_convolution_vs2017.vcxproj		image_convolution_vs2017.vcxproj
image_convolution_vs2017.vcxproj.filters		image_convolution_vs2017.vcxproj.filters
image_convolution_vs2019.sln		image_convolution_vs2019.sln
image_convolution_vs2019.vcxproj		image_convolution_vs2019.vcxproj
image_convolution_vs2019.vcxproj.filters		image_convolution_vs2019.vcxproj.filters
image_convolution_vs2022.sln		image_convolution_vs2022.sln
image_convolution_vs2022.vcxproj		image_convolution_vs2022.vcxproj
image_convolution_vs2022.vcxproj.filters		image_convolution_vs2022.vcxproj.filters
main.hip		main.hip
stb_image.h		stb_image.h
stb_image_write.h		stb_image_write.h
test.jpg		test.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

HIP-Doc Image Convolution Example

Description

Application flow

Convolution Implementation

Key APIs and Concepts

HIP Runtime APIs

Device Code Features

Stencil Pattern

Image Processing

Configuration

Performance Considerations

Demonstrated API calls

HIP runtime

Device symbols

Host symbols

External Libraries

FilesExpand file tree

image_convolution

Directory actions

More options

Directory actions

More options

Latest commit

History

image_convolution

Folders and files

parent directory

README.md

HIP-Doc Image Convolution Example

Description

Application flow

Convolution Implementation

Key APIs and Concepts

HIP Runtime APIs

Device Code Features

Stencil Pattern

Image Processing

Configuration

Performance Considerations

Demonstrated API calls

HIP runtime

Device symbols

Host symbols

External Libraries