Open links in new tab
  1. GitHub - thomasplantin/cuda-image-processing

    • This repository contains the codebase to run various parallel GPU based algorithms for image processing. Some of the algorithms implemented are image blurring, image flipping, and more. These p… See more

    Test Cases

    To run our test cases, run chmod u+x ex_single.sh chmod u+x ex_batch.sh
    To run our single image mode test cases, run ex_single.sh

    Github
    Repository structure

    main.cu simply parses arguments and calls the necessary filters stb_image/ contains the image library we used. image.h is our wrapper for the image library. filters/ contains each filter implemented in a header file filters/convolve.h is called from every convolution filter. Other filters have their own kernels. expected_output/ stores the expected...

    Github
  1. CUDA (Compute Unified Device Architecture) enables parallel programming on NVIDIA GPUs, allowing you to accelerate compute-intensive tasks by running thousands of threads simultaneously. Below is a minimal CUDA C++ example to help you understand the basic concepts: kernel functions, memory allocation, and data transfer between CPU (host) and GPU (device).

    Steps to Create a Basic CUDA Program

    1. Define a Kernel Function A kernel is a function that runs on the GPU. Use the __global__ qualifier to declare it.

    2. Allocate Memory on the GPU Use cudaMallocManaged() for unified memory accessible by both CPU and GPU.

    3. Launch the Kernel Use the triple angle bracket syntax <<<blocks, threads>>> to specify execution configuration.

    4. Synchronize and Free Memory Call cudaDeviceSynchronize() to wait for GPU execution to finish, then free memory with cudaFree().

    Example Code: Vector Addition

    Feedback
    1. An Even Easier Introduction to CUDA (Updated) - NVIDIA …

      May 2, 2025 · A quick and easy introduction to CUDA programming for GPUs. This post dives into CUDA C++ with a simple, step-by-step parallel programming …

    2. CUDA Processing Examples | dusty-nv/jetson-utils | DeepWiki

      May 31, 2025 · This page provides practical examples demonstrating CUDA-accelerated image processing workflows using jetson-utils. It covers basic image manipulation, drawing operations, …

    3. Intro to image processing with CUDA | The …

      Sep 20, 2011 · CUDA is great for any compute intensive task, and that includes image processing. In this tutorial, we’ll be going over why CUDA is ideal for image …

    4. Getting Started with OpenCV CUDA Module - GeeksforGeeks

      Jun 20, 2024 · When combined with OpenCV, developers can carry out computationally intensive image and video processing much faster than purely CPU-based methods. The OpenCV CUDA module does …

    5. Image Processing with CUDA C++ - GitHub

      The objective of this project is to implement from scratch in CUDA C++ various image processing algorithms. A Cpu and a Gpu version of the following algorithms is implemented and commented:

    6. NVIDIA/cuda-samples | DeepWiki

      Sep 28, 2025 · Currently supporting CUDA Toolkit 13.0, these samples span from basic introductory concepts to advanced domain-specific applications and performance optimization techniques.

    7. Accelerating Image Processing with NVIDIA’s CUDA

      Dec 4, 2024 · Process Image: Capture a frame from the webcam, allocate GPU memory, convert image data, execute the function, and read back the processed …

    8. NVIDIA CUDA - Image Processing

      Simple example that demonstrates use of 3D Textures in CUDA. This sample demonstrates how Discrete Cosine Transform (DCT) for blocks of 8 by 8 pixels can be performed using CUDA: a naive …

    9. Unleashing the Power of PyTorch CUDA for Image Processing

      Nov 14, 2025 · In this blog post, we have explored the fundamental concepts, usage methods, common practices, and best practices of using PyTorch with CUDA for image processing.

  2. People also ask