AI Research

PECCAVI

Identifying AI Generated Content

PECCAVI : Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images

The line between real and AI-generated visual content is becoming increasingly blurred. As state-of-the-art image generation and enhancement models like Stable Diffusion 3 and Imagen3 advance, AI-generated images are becoming nearly indistinguishable from real ones to the naked eye. These models offer users a creative platform with limitless possibilities, but they also pose significant risks when exploited by malicious actors.

The pace at which the research community is working on improving these models is unthinkable and hence this makes it more crucial than ever to focus on advancements in visual watermarking area.

Visual Paraphrase

Visual Paraphrase is a technique that generates variations of an image while maintaining the same semantic content but altering its visual presentation, similar to how paraphrasing works in natural language. The effectiveness of this technique depends on two main parameters: Paraphrasing Strength (s) and Guidance Scale (gs). Paraphrasing Strength ranges from 0 to 1, determining how much the new image deviates from the original. A higher value (s = 1) results in a more distinct image, while a lower value (s = 0) keeps the image closer to the original. Guidance Scale controls how closely the new image follows the original text prompt, with higher values ensuring closer adherence. Optimal results are achieved when the paraphrasing strength is low (s ≤ 0.4) and the guidance scale is between 4 and 7.

Visual Paraphrase Example

Where does PECCAVI Watermark and How?

PECCAVI embeds watermarks in specific regions of an image known as Non-Melting Points (NMPs). These strategically selected areas are the most crucial and resistant to adversarial attacks, including visual paraphrasing. The watermarking process begins by generating several paraphrased versions of the original image. For each version, the most important areas are identified using an advanced method called the XRAI algorithm. The most significant regions are then selected, and the system focuses on these areas while disregarding less important ones. Once the critical regions are identified, bounding boxes are drawn around them. These boxes are aggregated from all paraphrased versions and undergo a filtering process to select the most distinct, non-overlapping areas. This ensures that the watermark is placed in areas that are sufficiently large to be effective yet not too small to lose their impact. The approach also ensures that there are no overlapping boxes that could interfere with the watermark’s integrity. By embedding the watermark within these Non-Melting Points, PECCAVI ensures that the watermark remains robust and resistant to adversarial manipulation while preserving the overall visual quality of the image.

Once the Non-Melting Points (NMPs) are identified, PECCAVI employs several watermarking strategies to enhance resilience against attacks. Four distinct methods were tested: (1) baseline watermarking with state-of-the-art techniques such as ZoDiac, Stable Signature, and WAM, (2) adjusting the strength of the watermark, (3) single-channel watermarking, and (4) multi-channel watermarking. Among these, multi-channel watermarking proved to be the most effective, as embedding watermarks across multiple channels significantly increased resistance to attacks like visual paraphrasing. Additionally, varying the watermark strength, determined by the distance between the rings, further increased the system’s robustness, with smaller distances resulting in stronger, more resilient watermarks.

Results

Watermark Detection Scores after Paraphrase attack

ZoDiac
0.81
WAM
0.63
PECCAVI
0.91 ↑
Stable Signature
0.65
DwtDctSVD
0.65

PECCAVI showcases consistent 10% improvement over the current SOTA models like ZoDiac and WAM in different Dewatermarking tests and Paraphrase Attacks.