Sensors

Understanding interpolation and image perception

2nd January 2025
Caitlin Gittins
0

Interpolation is a mathematical technique used to estimate unknown values that lie between known data points. Interpolation helps transform raw sensor data into stunning, full-color images in embedded vision systems.

In imaging, interpolation is a mathematical operation that is used to perceive a coloured image as seen in the real world. Interpolation also drives the advanced image processing capabilities of ISPs.

Here is a little glimpse into how interpolation came into the camera scene. To capture an image, we need photons from a light source, lens, sensors, and memory to store the data. But to capture colour in the images, these aren’t enough. Image sensors are comprised of many individual photosensors, all of which capture light. These photosensors are natively able to capture the intensity of light but not its wavelength (colour).

To tackle this problem, image sensors were overlaid with something called a 'colour filter array' or 'colour filter mosaic'. This overlay consists of many tiny filters that cover the known pixels and allow them to render colour information. However, these colour filters over the sensors will allow only a single colour to pass through it, that is, either R, G or B.

In order to obtain all these colours in an image despite the single colour filter, prisms were used in the earlier days to split the colour coming from a light source. Prisms help obtain the primary colours R, G, and B through diffraction. Multiple sensors placed in different directions were used to capture the colours obtained through prisms, making the cameras bulky and complicated. However, this method allowed us to capture all three colours in the image obtained.

Then came the modern cameras with CMOS sensors. CMOS sensors came with a Bayer filter array that enabled the capturing of colour using a single sensor. However, one major drawback was that the Bayer filter CMOS camera captured only one colour element per pixel, as the filter allows only one colour element to pass through it. This results in an incomplete colour data. This is where interpolation comes into use. Here, interpolation operation can help estimate the missing colours, thereby completing the image.

 

Figure 1: Bayer filter in CMOS camera capturing only one-colour element per pixel

Figure 2: Original image at 200%.    What your camera sees through a Bayer array

What is interpolation?

Interpolation is a mathematical technique used to estimate unknown values that lie between known data points. In simpler terms, it bridges gaps in data by predicting intermediate values. While interpolation focuses on filling gaps within the known range, extrapolation deals with predicting values outside the known data points.

Figure 3: Mathematical/graphical representation of interpolation

In the context of imaging, interpolation helps reconstruct missing colour and intensity information in images, playing an important role in creating visually accurate outputs.

What is an interpolation kernel?

A kernel can be thought of as a weighted function applied to neighbouring pixel values to compute an interpolated value. Essentially, it’s a mathematical tool that 'spreads out' or weights the contribution of known data points to predict or calculate unknown ones.

For example, in a grid of pixels, if you have data values at certain points and need to estimate the values in between, kernels guide how the neighboring points influence the calculation. Kernels vary in complexity, from simple linear functions to advanced techniques using waveforms or gradients, depending on the accuracy and smoothness required.

In tasks like resizing or demosaicing, kernels define how the missing pixel values are computed. The choice of kernel impacts the following:

  • Sharpness: Some kernels prioritise preserving edges and fine details
  • Smoothing: Others focus on minimising noise and blending transitions
  • Accuracy: Advanced kernels adapt dynamically to image content, reducing artifacts and improving colour fidelity

Types of kernels and their characteristics

Gaussian Kernel: A Gaussian kernel applies weights that follow a bell-shaped curve. Pixels closer to the point being interpolated will have higher weights, while those farther away contribute less. This kernel is primarily used for smoothing and noise reduction. Gaussian kernel has a smaller area of influence, leading to less precision in capturing sharp edges or transitions. This makes it Gaussian kernel less effective for high-frequency details.

SinC Kernel: SinC kernel is based on the sinC function. It combines sine waves to approximate missing values. SinC kernel’s influence extends over a larger area compared to Gaussian kernels, capturing more complex patterns in oscillatory data. This kernel is ideal for high-precision tasks, such as upscaling images or handling complex textures. SinC kernels provide sharper results than Gaussian kernels.

Bilinear Kernel: Bilinear kernel extends linear interpolation into two dimensions. It uses the values of four neighbouring pixels in a grid to estimate the value of a new pixel. It is used for simple image scaling tasks where computational efficiency is a priority. While being very effective, it may introduce blocky or overly smooth regions in detailed images.

Bicubic Kernel: Bicubic interpolation is an enhancement of bilinear interpolation. It uses the gradients (rate of change) of known data points, applying cubic polynomial equations to estimate smoother transitions. This kernel is preferred for high-quality image scaling, as it preserves sharpness and natural gradients. Bicubic kernel balances computational cost and image quality, making it widely used in image processing for OEM cameras.

Techniques of interpolation in CMOS sensors

Various interpolation methods serve distinct roles in camera systems, each with its trade-offs between computational efficiency and image quality. Let us look at the most commonly used interpolation techniques used in CMOS sensors.

Nearest-neighbour interpolation: Nearest-neighbour interpolation is the simplest approach that copies values from nearby pixels. It is a faster method but can result in blocky, low-quality images, making it unsuitable for premium OEM applications.

Bilinear and bicubic interpolation: These interpolation methods consider two or four neighboring pixels. This helps in attaining smoother transitions for better image quality. Bicubic interpolation is particularly favored in balancing sharpness and smooth gradients, which is useful in embedded applications like robotics and autonomous navigation.

Lanczos resampling: Lanczos resampling is a higher-order technique using sinC-based kernels, it offers superior edge detail, making it ideal for surveillance or medical imaging cameras where precision is important.

Adaptive methods: Techniques like Variable Number of Gradients (VNG) or Adaptive Homogeneity-Directed (AHD) interpolation adapts dynamically to image content. These are widely adopted in demosaicing algorithms to reduce colour artifacts, ensuring natural and accurate image reproduction.

Figure 4: Visual comparison of different interpolation methods.

a: nearest neighbour, b: bilinear, c: bicubic, d: original HR image (4x)

How ISPs use interpolation

Interpolation in ISPs (Image Signal Processors) serves various purposes, let’s explore a few of them.

Resizing images: In ISP’s interpolation is used to resize images. For example, resizing a 4×4-pixel array to 8×8. For this ISP’s use Linear or bilinear interpolation that can help smoothen the output. Nearest neighbour interpolation isn’t used for this as it can lead to sharp edges (pixelation).

Demosaicing: Interpolation is used to convert raw sensor data into a full-colour image, which is called as demosaicing. This can be done using weighted interpolation, which can determine the missing values for each colour channel based on neighboring pixel values.

Deblurring: Ensures image sharpness during interpolation by cutting and realigning edges. Techniques like edge-directed interpolation and constant hue interpolation further refine the output.

Bicubic interpolation: Bicubic interpolation improves upon bilinear by considering gradients, which represent the rate of change between data points. This allows for smoother transitions and more detailed reconstructions.

Artifacts Caused by interpolation

Despite its benefits, interpolation introduces certain artifacts:

Colour merging: When two colours blend, dark spots may appear between transitions. This occurs because cameras interpret contrast linearly, whereas human vision is logarithmic.

Data storage and compression: Camera’s store squared values of pixel intensities to match the logarithmic perception of human eyes, especially in darker regions. While this approach saves space, it can lead to slight deviations in brightness.

Combatting femosaicing srtifacts in OEM cameras

Demosaicing, a pivotal step for colour reconstruction, is prone to challenges such as colour moiré patterns and zippering artifacts in high-frequency regions. OEM cameras tackle these challenges with:

Gradient-based methods: These prioritise low-gradient areas for interpolation, minimising distortions.

Edge-aware techniques: By detecting and preserving edges, such methods ensure that critical details remain intact, particularly in automotive or industrial cameras where accuracy can influence decision-making.

Modern advancements have unified super-resolution and demosaicing, addressing shared issues like aliasing, and have been implemented in multi-frame video reconstruction. This dual-purpose approach enhances the performance of cameras used in dynamic environments, such as drones or delivery robots.

The OEM advantage: Optimised for industry needs

For an OEM camera manufacturer, the choice of interpolation algorithm isn’t arbitrary. It must align with:

Application-specific requirements: Surveillance cameras benefit from Lanczos resampling, whereas bilinear or bicubic techniques are better suited for high-speed operations like warehouse robotics.

Computational constraints: Cameras integrated with low-power systems may favor efficient methods like bilinear interpolation to maintain real-time processing.

Sensor design: High-quality CMOS sensors with advanced colour filter arrays (CFAs) like panchromatic CFAs complement linear interpolation methods for superior results.

By fine tuning interpolation strategies to these factors, OEM cameras achieve utmost precision, contributing to the success of diverse industries.

Featured products

Product Spotlight

Upcoming Events

No events found.
Newsletter
Latest global electronics news
© Copyright 2025 Electronic Specifier